Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🦜🔗 The platform for reliable agents.
Robust Speech Recognition via Large-Scale Weak Supervision
A latent text-to-image diffusion model
An extremely fast Python linter and code formatter, written in Rust.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🔊 Text-Prompted Generative Audio Model
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Cross-platform, customizable ML solutions for live and streaming media.
Generative Models by Stability AI
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Official inference framework for 1-bit LLMs
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
GUI for a Vocal Remover that uses Deep Neural Networks.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🚀 The fast, Pythonic way to build MCP servers and clients
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
The official Python SDK for Model Context Protocol servers and clients
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Instruct-tune LLaMA on consumer hardware
A TTS model capable of generating ultra-realistic dialogue in one pass.
Neural Networks: Zero to Hero
State-of-the-Art Text Embeddings
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A multi-voice TTS system trained with an emphasis on quality