Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Robust Speech Recognition via Large-Scale Weak Supervision
Models and examples built with TensorFlow
A curated list of awesome Machine Learning frameworks, libraries and software.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
Interact with your documents using the power of GPT, 100% privately, no data leaks
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The world's simplest facial recognition api for Python and the command line
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A generative speech model for daily dialogue.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Instant voice cloning by MIT and MyShell. Audio foundation model.
💫 Industrial-strength Natural Language Processing (NLP) in Python
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
State-of-the-art 2D and 3D Face Analysis Project
Generative Models by Stability AI
Fully open reproduction of DeepSeek-R1
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Code for the paper "Language Models are Unsupervised Multitask Learners"
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python