Stars
Robust Speech Recognition via Large-Scale Weak Supervision
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A high-throughput and memory-efficient inference and serving engine for LLMs
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Get your documents ready for gen AI
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
You like pytorch? You like micrograd? You love tinygrad! ❤️
DSPy: The framework for programming—not prompting—language models
AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
An open-source RAG-based tool for chatting with your documents.
🤗 smolagents: a barebones library for agents that think in code.
Official inference framework for 1-bit LLMs
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
🚀 The fast, Pythonic way to build MCP servers and clients
Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.
A TTS model capable of generating ultra-realistic dialogue in one pass.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
SQL databases in Python, designed for simplicity, compatibility, and robustness.
LLM agents built for control. Designed for real-world use. Deployed in minutes.
Machine Learning Engineering Open Book
End-to-End Object Detection with Transformers