Starred repositories
A topic-centric list of HQ open datasets.
🦀 Small exercises to get you used to reading and writing Rust code!
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Extremely fast Query Engine for DataFrames, written in Rust
🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
A game theoretic approach to explain the output of any machine learning model.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
State-of-the-Art Text Embeddings
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
A computer algebra system written in pure Python
A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.
Statsmodels: statistical modeling and econometrics in Python
A unified framework for machine learning with time series
Bayesian Modeling and Probabilistic Programming in Python
A python library for user-friendly forecasting and anomaly detection on time series.
Deep universal probabilistic programming with Python and PyTorch
Open Source Feature Flags, Experimentation, and Product Analytics
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristi…
Model interpretability and understanding for PyTorch
A unified, comprehensive and efficient recommendation library