Skip to content
View soodoku's full-sized avatar

Block or report soodoku

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
22 stars written in Java
Clear filter

MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Java 13,960 2,872 Updated Feb 3, 2026

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 10,062 2,718 Updated Feb 10, 2026

Tools for keeping your cloud operating in top form. Chaos Monkey is a resiliency tool that helps applications tolerate random instance failures.

Java 7,983 1,123 Updated Dec 18, 2018

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Java 3,208 781 Updated Mar 10, 2026

an open source geocoder for openstreetmap data

Java 2,683 344 Updated Mar 15, 2026

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means

Java 2,147 229 Updated Feb 17, 2025

Extract tables from PDF files

Java 2,021 450 Updated Mar 19, 2025

Open Source ML Model Versioning, Metadata, and Experiment Management

Java 1,745 287 Updated Jul 23, 2024

Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class…

Java 1,604 214 Updated Dec 17, 2023

Linear Algebra for Java

Java 604 152 Updated Jul 7, 2023

ABBYY Cloud OCR SDK

Java 530 481 Updated May 22, 2023

Content ExtRactor and MINEr

Java 513 99 Updated Jun 30, 2022

News crawling with StormCrawler - stores content as WARC

Java 364 40 Updated Feb 19, 2025
Java 270 40 Updated Jun 17, 2015

PDF Toolkit. πŸ“Ž πŸ”¨ πŸ”§ βœ‚οΈ πŸ“‘ πŸ“πŸ“Ž πŸ”– 🚧 πŸ‘·

Java 268 26 Updated Aug 26, 2025

DistML provide a supplement to mllib to support model-parallel on Spark

Java 169 75 Updated Feb 6, 2017

Machine Learning Tool Kit

Java 139 75 Updated Oct 21, 2020

A Pi Zero and Motion based webcamera that forwards images to Amazon Web Services for Image Processing

Java 118 31 Updated Apr 14, 2019

Source code for "Enginneering Deep Learning Platforms"

Java 55 16 Updated May 4, 2025

A Java K-means Clustering implementation

Java 23 16 Updated Mar 1, 2017

Java OCR and Parser for Warren's TV and Cable Factbook (From 2013)

Java 2 Updated Dec 22, 2017