Stars
Style and Grammar Checker for 25+ Languages
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Please visit https://github.com/h2oai/h2o-3 for latest H2O
A high performance replicated log service. (The development is moved to Apache Incubator)
Elegant parsing in Java and Scala - lightweight, easy-to-use, powerful.
Fast Parallel Async HTTP/SSH/TCP/UDP/Ping Client Java Library. Aggregate 100,000 APIs & send anywhere in 20 lines of code. Ping/HTTP Calls 8000 servers in 12 seconds. (Akka) www.parallec.io
Library and tools for advanced feature engineering
Fast Entity Linker Toolkit for training models to link entities to KnowledgeBase (Wikipedia) in documents and queries.
The PSL software from the University of Maryland and the University of California Santa Cruz
Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump
Dexter is a framework that implements some popular algorithms and provides all the tools needed to develop any entity linking technique.
Experiments codes for SIGIR'16 paper "Fast Matrix Factorization for Online Recommendation with Implicit Feedback "
Original java implementation of CraftML, an efficient Clustering-based Random Forest for Extreme multi-label Learning