Stars
All Algorithms implemented in Python
Stable Diffusion web UI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A latent text-to-image diffusion model
Examples and guides for using the OpenAI API
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Lightweight coding agent that runs in your terminal
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official Code for DragGAN (SIGGRAPH 2023)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.
Real-time face swap for PC streaming or video calls
Open-Sora: Democratizing Efficient Video Production for All
Generative Models by Stability AI
Industry leading face manipulation platform
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Tongyi Deep Research, the Leading Open-source Deep Research Agent
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
PyTorch implementations of Generative Adversarial Networks.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything