Skip to content
View jimdowling's full-sized avatar

Block or report jimdowling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,586 13,218 Updated Feb 5, 2026

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 42,641 3,810 Updated Feb 5, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,549 4,704 Updated Feb 5, 2026

Extremely fast Query Engine for DataFrames, written in Rust

Rust 37,323 2,598 Updated Feb 5, 2026

DuckDB is an analytical in-process SQL database management system

C++ 35,907 2,906 Updated Feb 5, 2026

DSPy: The framework for programmingβ€”not promptingβ€”language models

Python 32,016 2,607 Updated Feb 5, 2026

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 29,974 2,702 Updated Feb 5, 2026

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Jupyter Notebook 29,839 13,251 Updated Jun 13, 2024

A playbook for systematically maximizing the performance of deep learning models.

29,771 2,414 Updated Jun 18, 2024

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 28,601 2,029 Updated Feb 5, 2026

An open autonomous driving platform

C++ 26,420 9,961 Updated Feb 2, 2026

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

Jupyter Notebook 25,860 12,868 Updated Oct 3, 2023

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 25,001 3,474 Updated Feb 3, 2026

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 20,638 5,042 Updated Feb 5, 2026

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,099 2,511 Updated Feb 1, 2026

πŸ“Š Cube Core is open-source semantic layer for AI, BI and embedded analytics

Rust 19,421 1,947 Updated Feb 5, 2026

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,194 1,373 Updated Oct 6, 2025

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,975 3,718 Updated Jun 2, 2023

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

Python 15,255 3,564 Updated Feb 5, 2026

An orchestration platform for the development, production, and observation of data assets.

Python 14,901 1,969 Updated Feb 5, 2026

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,664 2,256 Updated Dec 1, 2025

πŸ”Ž Open source distributed and RESTful search engine.

Java 12,333 2,393 Updated Feb 5, 2026

OpenZFS on Linux and FreeBSD

C 11,954 1,944 Updated Feb 4, 2026

A framework for few-shot evaluation of language models.

Python 11,366 3,018 Updated Feb 5, 2026

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 11,350 2,303 Updated Feb 5, 2026

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,303 882 Updated Jan 13, 2026

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,847 1,390 Updated Nov 4, 2024

Modin: Scale your Pandas workflows by changing a single line of code

Python 10,358 673 Updated Oct 2, 2025

An open-source, low-code machine learning library in Python

Jupyter Notebook 9,688 1,856 Updated Apr 21, 2025
Next