Skip to content
View jtbates's full-sized avatar

Organizations

@dssg

Block or report jtbates

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

39 stars written in Java
Clear filter

Apache Iceberg

Java 8,177 2,855 Updated Nov 6, 2025

AI + Data, online. https://vespa.ai

Java 6,550 675 Updated Nov 6, 2025

Statistical Machine Intelligence & Learning Engine

Java 6,289 1,149 Updated Nov 6, 2025

A machine learning software for extracting information from scholarly documents

Java 4,415 517 Updated Nov 6, 2025

Collect, aggregate, and visualize a data ecosystem's metadata

Java 2,054 376 Updated Nov 5, 2025

A native library providing a Tinder-like cards effect. A card can be constructed using an image and displayed with animation effects, dismiss-to-like and dismiss-to-unlike, and use different sortin…

Java 1,472 362 Updated Nov 16, 2020

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 1,108 144 Updated Nov 5, 2025

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,084 538 Updated Nov 5, 2025

A Java HTTP client for consuming Twitter's realtime Streaming API

Java 958 367 Updated Apr 6, 2022

align and compare tables

Java 867 71 Updated Aug 6, 2025

A programmable, embeddable web browser driver compatible with the Selenium WebDriver spec -- headless, WebKit-based, pure Java

Java 815 142 Updated Jul 29, 2024

INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.

Java 666 163 Updated Nov 5, 2025

Duke is a fast and flexible deduplication engine written in Java

Java 626 190 Updated Oct 11, 2023

ReverseProxy-Android

Java 512 159 Updated Apr 16, 2018

Latent Dirichlet Allocation (LDA) model for Microblogs (Twitter, weibo etc.)

Java 320 108 Updated May 4, 2018

Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)

Java 315 47 Updated Feb 4, 2022

A Java library of SOCKS5 protocol including client and server

Java 302 109 Updated Jul 16, 2023

Mirror of Apache Samoa (Incubating)

Java 251 105 Updated Apr 16, 2023

An open source, high scalability toolkit in Java for Entity Resolution.

Java 221 45 Updated Jul 12, 2025

Flexible classic and NeurAl Retrieval Toolkit

Java 220 35 Updated Jun 28, 2025

Twitter Tools

Java 220 97 Updated Feb 18, 2018

Warcbase is an open-source platform for managing analyzing web archives

Java 161 47 Updated Dec 8, 2017

Android app for saving webpages for offline reading.

Java 140 45 Updated Jul 15, 2021

A toolbox for statistical relational learning and reasoning.

Java 102 26 Updated Jul 6, 2022

Artificial Intelligence for Digital Response

Java 102 38 Updated Nov 21, 2018

neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität Berlin.

Java 68 10 Updated Feb 13, 2019

A spring-boot-starter application, with user authentication, registration, JPA using mysql.

Java 49 22 Updated Oct 3, 2024

BoostSRL: "Boosting for Statistical Relational Learning." A gradient-boosting based approach for learning different types of SRL models.

Java 32 24 Updated Sep 11, 2023

Simple kafka producer that ingest data from Twitter Streaming API to a Kafka broker

Java 28 32 Updated Sep 19, 2016

Egonet is a program for the collection and analysis of egocentric network data. It helps you create the questionnaire, collect data, and provide general global network measures and data matrixes th…

Java 25 10 Updated Feb 1, 2022
Next