- Washington
- All languages
- ApacheConf
- C
- C#
- C++
- CMake
- CSS
- ChucK
- Clojure
- CoffeeScript
- Elixir
- Emacs Lisp
- Fennel
- Go
- HCL
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Lua
- MATLAB
- MDX
- Makefile
- Markdown
- Objective-C
- PHP
- PLpgSQL
- Perl
- Processing
- Python
- R
- Roff
- Ruby
- Rust
- Scala
- Scilab
- Shell
- Slash
- Stan
- TeX
- TypeScript
- Vim Script
- Vue
- hoon
Starred repositories
Statistical Machine Intelligence & Learning Engine
A machine learning software for extracting information from scholarly documents
Collect, aggregate, and visualize a data ecosystem's metadata
A native library providing a Tinder-like cards effect. A card can be constructed using an image and displayed with animation effects, dismiss-to-like and dismiss-to-unlike, and use different sortin…
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Anserini is a Lucene toolkit for reproducible information retrieval research
A Java HTTP client for consuming Twitter's realtime Streaming API
A programmable, embeddable web browser driver compatible with the Selenium WebDriver spec -- headless, WebKit-based, pure Java
INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.
Duke is a fast and flexible deduplication engine written in Java
Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
A Java library of SOCKS5 protocol including client and server
An open source, high scalability toolkit in Java for Entity Resolution.
Warcbase is an open-source platform for managing analyzing web archives
Android app for saving webpages for offline reading.
A toolbox for statistical relational learning and reasoning.
neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität Berlin.
A spring-boot-starter application, with user authentication, registration, JPA using mysql.
BoostSRL: "Boosting for Statistical Relational Learning." A gradient-boosting based approach for learning different types of SRL models.
Simple kafka producer that ingest data from Twitter Streaming API to a Kafka broker
Egonet is a program for the collection and analysis of egocentric network data. It helps you create the questionnaire, collect data, and provide general global network measures and data matrixes th…