Skip to content
View thomasjpfan's full-sized avatar

Highlights

  • Pro

Organizations

@scikit-learn @numfocus @NYCPython @conda-forge @scikit-learn-contrib @pyOpenSci @scientific-python @skorch-dev

Block or report thomasjpfan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Machine learning with dataframes

Python 1,620 257 Updated Jun 11, 2026

Documentation sites for Python packages: start simple, go deep

Python 204 10 Updated Jun 11, 2026

Polars extension for general data science use cases

Rust 643 43 Updated Jun 5, 2026

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 16,455 1,405 Updated Jun 12, 2026

A Python package for the statistical analysis of A/B tests

Python 328 10 Updated Jun 7, 2026

Override and customize Python packages without touching their code

Python 476 6 Updated Mar 24, 2026

JAX in JavaScript – ML library for the web, running on WebGPU & Wasm

TypeScript 819 47 Updated Jun 12, 2026

An extremely fast Python type checker and language server, written in Rust.

Python 18,924 298 Updated Jun 12, 2026

RFC document, tooling and other content related to the array API standard

Python 267 54 Updated Apr 23, 2026

The most widely used Python to C compiler

Cython 10,768 1,613 Updated Jun 12, 2026

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,652 4,687 Updated May 19, 2026

A central repository to keep track of the status of work on and support for free-threaded CPython (see PEP 703), with a focus on the scientific and ML/AI ecosystem

288 48 Updated Jun 9, 2026

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.

Go 7,082 829 Updated Jun 12, 2026

D-Adaptation for SGD, Adam and AdaGrad

Python 532 24 Updated Jan 22, 2025

RFC document, tooling and other content related to the dataframe API standard

Python 106 22 Updated Mar 29, 2024

Extremely fast Query Engine for DataFrames, written in Rust

Rust 38,738 2,877 Updated Jun 12, 2026

Model interpretability and understanding for PyTorch

Python 5,650 559 Updated Jun 11, 2026

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Python 805 67 Updated Jun 2, 2026

Script to help maintain a wheelhouse folder on a cloud storage.

Python 33 11 Updated Aug 4, 2020

Hummingbird compiles trained ML models into tensor computation for faster inference.

Python 3,536 292 Updated Jul 17, 2025

A repository in preparation for open-sourcing lottery ticket hypothesis code.

Python 641 118 Updated Sep 6, 2022

📝 python package to calculate readability statistics of a text object - paragraphs, sentences, articles.

Python 1,372 183 Updated Feb 18, 2026

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 13,451 435 Updated Jun 5, 2026

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,510 900 Updated Jan 13, 2026

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

Python 1,582 217 Updated Jun 13, 2024

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 11,609 1,836 Updated Jun 12, 2026

Discrete Hidden Markov Models with Numba

Python 12 3 Updated Aug 31, 2021

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,617 2,580 Updated Jun 4, 2026

apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly. See the documentation page: https://apricot-select.rea…

Jupyter Notebook 532 53 Updated Nov 17, 2025

A collection of various deep learning architectures, models, and tips

Jupyter Notebook 17,531 4,107 Updated Feb 8, 2024
Next