Stars
Stable Diffusion web UI
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The Web framework for perfectionists with deadlines.
The world's simplest facial recognition api for Python and the command line
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
The definitive Web UI for local AI, with powerful features and easy setup.
Get your documents ready for gen AI
aider is AI pair programming in your terminal
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A community-maintained Python framework for creating mathematical animations.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
A book-in-progress about the Linux kernel and its insides.
Deezer source separation library including pretrained models.
State-of-the-art 2D and 3D Face Analysis Project
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Fast and memory-efficient exact attention
Open standard for machine learning interoperability
A TTS model capable of generating ultra-realistic dialogue in one pass.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Lets make video diffusion practical!
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Wan: Open and Advanced Large-Scale Video Generative Models
A Conversational Speech Generation Model