Starred repositories
Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running mat…
OpenRefine is a free, open source power tool for working with messy data and improving it
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Deep Learning (Python, C, C++, Java, Scala, Go)
中文自然语言处理工具包 Toolkit for Chinese natural language processing
A Property Graph Model Interface (no longer active - see Apache TinkerPop)
A Question Answering system built on top of the Apache UIMA framework.
Library and tools for advanced feature engineering
啊哈自然语言处理包,提供包括分词、依存句法分析、语义角色标注、自动摘要、语义相似度计算、LDA 主题预测、词云等服务。
Dexter is a framework that implements some popular algorithms and provides all the tools needed to develop any entity linking technique.
AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics
DistML provide a supplement to mllib to support model-parallel on Spark
AGDISTIS - Agnostic Named Entity Disambiguation
Graph-algorithm inferences over local groundings of first-order logic programs
An open source toolkit for mining Wikipedia
Implementation of Vision Based Page Segmentation algorithm in Java
Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法
Improving Knowledge Graph Embedding Using Simple Constraints (ACL-2108)
LASER-A Scalable Response Prediction Platform For Online Advertising
An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.