Lists (1)
Sort Name ascending (A-Z)
Stars
Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
The best OSS video generation models, created by Genmo
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Efficient vision foundation models for high-resolution generation and perception.
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
Code and dataset for photorealistic Codec Avatars driven from audio
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
A unified inference and post-training framework for accelerated video generation.
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
PyTorch code and models for VJEPA2 self-supervised learning from video.
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
本项目分享了中山大学计算机学院本科和研究生阶段的课程资料、笔记、期末考试卷和其他实用的相关资源。希望对同学们的学习有所帮助❤️,如果喜欢记得给个star🌟
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!