Skip to content
View jldbc's full-sized avatar
🤖
🤖

Highlights

  • Pro

Organizations

@squareup

Block or report jldbc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Track data flowing through Java programs

Java 369 12 Updated Oct 7, 2024

The Open Source Feature Store for AI/ML

Python 7,092 1,344 Updated Jun 13, 2026

A FiveThirtyEight/The Marshall Project effort to collect comprehensive data on police misconduct settlements from 2010-19.

R 151 38 Updated Jan 19, 2022

NYC Subway Turnstile Data

Ruby 121 10 Updated Aug 19, 2023

An ongoing list of pandas quirks

Jupyter Notebook 996 133 Updated May 8, 2023

Parse and analyze the data that the sleep tracking app Pillow exports.

Jupyter Notebook 7 Updated Jun 16, 2018

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

29,756 3,947 Updated Jul 18, 2024

Transportation planning and traffic simulation software for creating cities friendlier to walking, biking, and public transit

Rust 8,140 379 Updated Sep 10, 2025

A repository of data on coronavirus cases and deaths in the U.S.

6,973 3,392 Updated Apr 2, 2024

Data and methodology for the Big Mac index

Jupyter Notebook 1,734 442 Updated Mar 30, 2026

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,683 1,926 Updated May 8, 2026

Below are some simple methods for exiting vim.

7,191 327 Updated Mar 14, 2026

MusicBrainz Spotify integration hack for SF Music Hack Day 2014

Python 69 16 Updated Jan 20, 2022

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 10,431 1,615 Updated Apr 15, 2023

Python package + CLI to generate wordclouds of Twitter tweets.

Python 78 4 Updated Nov 27, 2019

Library to scrape and clean web pages to create massive datasets.

Python 2,261 323 Updated Nov 11, 2020

Reusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.

JavaScript 7,095 237 Updated Apr 26, 2024

Datasets, tools, and benchmarks for representation learning of code.

Jupyter Notebook 2,436 408 Updated Jan 31, 2022

Python library for Multi-Armed Bandits

Jupyter Notebook 769 157 Updated Feb 11, 2020

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

Python 16,384 2,764 Updated Feb 23, 2023

xkcd styled chart lib

JavaScript 7,784 205 Updated Jun 12, 2026

Library of contextual bandits algorithms

Jupyter Notebook 342 85 Updated Mar 14, 2024

A library of extension and helper modules for Python's data analysis and machine learning libraries.

Python 5,151 907 Updated Jun 12, 2026

Machine learning, in numpy

Python 16,342 3,762 Updated Oct 29, 2023

Statistics for each published edition of Data Is Plural.

Python 17 1 Updated Jun 1, 2021

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 25,520 3,730 Updated Jun 12, 2026

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

Python 20,226 4,629 Updated May 8, 2026

Build Accelerated Mobile Page versions of your Jekyll posts

Ruby 280 53 Updated Nov 8, 2019

Populate a database with NBA shot data

Ruby 122 14 Updated Aug 19, 2023

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,930 5,885 Updated Aug 14, 2024
Next