Skip to content
View rzo1's full-sized avatar

Sponsors

@tomitribe

Organizations

@crawler-commons

Block or report rzo1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
58 stars written in Java
Clear filter

Free and Open Source, Distributed, RESTful Search Engine

Java 76,508 25,833 Updated Apr 12, 2026

Graphs for Everyone

Java 16,293 2,587 Updated Apr 1, 2026

Cryptomator for Windows, macOS, and Linux: Secure client-side encryption for your cloud storage, ensuring privacy and control over your data.

Java 14,927 1,278 Updated Apr 9, 2026

LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impl…

Java 11,565 2,121 Updated Apr 10, 2026

Picocli is a modern framework for building powerful, user-friendly, GraalVM-enabled command line apps with ease. It supports colors, autocompletion, subcommands, and more. In 1 source file so apps …

Java 5,350 456 Updated Oct 30, 2025

Open Source Web Crawler for Java

Java 4,625 1,908 Updated Nov 4, 2021

Apache Nutch is an extensible and scalable web crawler

Java 3,149 1,263 Updated Feb 27, 2026

fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.

Java 2,131 221 Updated Dec 2, 2025

TwelveMonkeys ImageIO: Additional plug-ins and extensions for Java's ImageIO

Java 2,109 323 Updated Apr 6, 2026

Awesome Procedures On Cypher for Neo4j - codenamed "apoc"                     If you like it, please ★ above ⇧            

Java 1,862 503 Updated Apr 11, 2026

Apache OpenNLP

Java 1,594 492 Updated Apr 7, 2026

Terminal-based progress bar for Java / JVM

Java 1,170 110 Updated Mar 1, 2026

A scalable, mature and versatile web crawler based on Apache Storm

Java 974 273 Updated Apr 10, 2026

...will provide a platform neutral way for running mongodb in unittests.

Java 935 155 Updated Jan 24, 2026

Efficient Graph Algorithms for Neo4j

Java 775 195 Updated Apr 22, 2020

Generated Java code for Google APIs

Java 720 390 Updated Apr 12, 2026

Java embedded PostgreSQL component for testing

Java 712 190 Updated Feb 13, 2026

INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.

Java 686 168 Updated Apr 12, 2026

Fast, scalable, self-contained, single-threaded Java web server

Java 597 63 Updated Apr 3, 2025

Apache TomEE

Java 472 693 Updated Apr 12, 2026

A Java SDK for the Twitter API

Java 278 98 Updated Jul 12, 2025

Twitter API client for Java developers

Java 256 66 Updated Jul 31, 2024

A set of reusable Java components that implement functionality common to any web crawler

Java 255 90 Updated Feb 26, 2026

Unit testing for CDI applications

Java 103 55 Updated Apr 7, 2026

Maven Build Time Profiler

Java 94 14 Updated Nov 24, 2025

The open source PII and PHI redaction and de-identification engine

Java 93 13 Updated Apr 2, 2026

DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all information in Wikipedia.

Java 90 35 Updated Apr 6, 2026

A JUnit5 Extension to help write tests that call System.exit()

Java 58 7 Updated Oct 26, 2024

API definition, resources and reference implementation of URL Frontiers

Java 59 12 Updated Jan 23, 2026

Apache OpenNLP Sandbox

Java 47 35 Updated Mar 30, 2026
Next