Lists (3)
Sort Name ascending (A-Z)
Starred repositories
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Generate 3D objects conditioned on text or images
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
An open-source tool-augmented conversational language model from Fudan University
Official implementation of AnimateDiff.
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
手写实现李航《统计学习方法》书中全部算法
Wan: Open and Advanced Large-Scale Video Generative Models
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
HunyuanVideo: A Systematic Framework For Large Video Generation Model
StyleGAN2 - Official TensorFlow Implementation
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
PyTorch package for the discrete VAE used for DALL·E.
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
A collaboration friendly studio for NeRFs
🐍 Geometric Computer Vision Library for Spatial AI
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Enjoy the magic of Diffusion models!
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Hackable and optimized Transformers building blocks, supporting a composable construction.
Refine high-quality datasets and visual AI models
Hydra is a framework for elegantly configuring complex applications
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support