Skip to content
View chrismattmann's full-sized avatar

Highlights

  • Pro

Organizations

@ESIPFed

Block or report chrismattmann

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
113 results for source starred repositories written in Java
Clear filter

Free and Open Source, Distributed, RESTful Search Engine

Java 75,638 25,703 Updated Dec 14, 2025

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…

Java 14,162 3,856 Updated Dec 13, 2025

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 10,017 2,717 Updated Nov 27, 2025

Open source routing engine for OpenStreetMap. Use it as Java library or standalone web server.

Java 6,134 1,847 Updated Dec 13, 2025

Apache NiFi

Java 5,869 2,908 Updated Dec 13, 2025

A scalable, distributed Time Series Database.

Java 5,061 1,239 Updated Dec 12, 2024

A machine learning software for extracting information from scholarly documents

Java 4,502 524 Updated Dec 2, 2025

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Java 3,463 894 Updated Dec 14, 2025

Apache Nutch is an extensible and scalable web crawler

Java 3,099 1,261 Updated Dec 11, 2025

Autopsy® is a digital forensics platform and graphical interface to The Sleuth Kit® and other digital forensics tools. It can be used by law enforcement, military, and corporate examiners to invest…

Java 2,918 640 Updated Oct 25, 2025

Deeplearning4j Examples (DL4J, DL4J Spark, DataVec)

Java 2,511 1,825 Updated Nov 18, 2025

Please visit https://github.com/h2oai/h2o-3 for latest H2O

Java 2,292 556 Updated Oct 24, 2024

Extract tables from PDF files

Java 1,985 444 Updated Mar 19, 2025

Apache OpenNLP

Java 1,570 489 Updated Dec 14, 2025

Pure Java speech recognition library

Java 1,438 586 Updated Oct 18, 2022

Elasticsearch File System Crawler (FS Crawler)

Java 1,420 306 Updated Dec 8, 2025

Java API for GeoIP2 webservice client and database reader

Java 850 206 Updated Dec 12, 2025

A programmable, embeddable web browser driver compatible with the Selenium WebDriver spec -- headless, WebKit-based, pure Java

Java 815 143 Updated Jul 29, 2024

args4j

Java 798 188 Updated Mar 6, 2024
Java 460 108 Updated Mar 24, 2023

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

Java 419 139 Updated Mar 30, 2023

Neural Adaptive Machine Translation that adapts to context and learns from corrections.

Java 349 73 Updated Jul 7, 2022

Wicketstuff-core projects are bundled user contributions for use with Apache Wicket (https://wicket.apache.org/). They are released in step with Wicket releases to make them easy to use.

Java 346 292 Updated Dec 14, 2025

Source code for Big Data: Principles and best practices of scalable realtime data systems

Java 332 164 Updated Jun 8, 2024
Java 271 40 Updated Jun 17, 2015

Distributed P2P Data-driven Workflow Framework

Java 269 128 Updated Dec 13, 2025

Java library and command-line application for converting Apache Spark ML pipelines to PMML

Java 269 80 Updated Nov 29, 2025

A set of reusable Java components that implement functionality common to any web crawler

Java 251 88 Updated Dec 13, 2025

Obsolete - superseded by Apache Calcite

Java 235 89 Updated Jan 20, 2021

Putting LIRE into Solr - an ongoing project

Java 183 40 Updated May 26, 2020
Next