Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Robust Speech Recognition via Large-Scale Weak Supervision
Models and examples built with TensorFlow
Clone a voice in 5 seconds to generate arbitrary speech in real-time
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Deezer source separation library including pretrained models.
A TTS model capable of generating ultra-realistic dialogue in one pass.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Style transfer, deep learning, feature transform
Distributed Asynchronous Hyperparameter Optimization in Python
Multilingual Document Layout Parsing in a Single Vision-Language Model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
Noise supression using deep filtering
Marshalling / communication library for drones.
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Reversi reinforcement learning by AlphaGo Zero methods.
Export Hugging Face models to Core ML and TensorFlow Lite
MAVLink proxy and command line ground station
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
TLS implementation in pure python, focused on interoperability testing
This program calculates the word error rate of hypothesis in ASR and print the aligned result.