-
Sichuan University
- Chengdu
Stars
A toolkit for developing and comparing reinforcement learning algorithms.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Official Code for DragGAN (SIGGRAPH 2023)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
OpenMMLab Detection Toolbox and Benchmark
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Official inference repo for FLUX.1 models
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
An open source implementation of CLIP.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Enjoy the magic of Diffusion models!
ImageBind One Embedding Space to Bind Them All
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Matplotlib styles for scientific plotting
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"