- Hong Kong
-
21:33
(UTC +08:00) - http://fuxiao0719.github.io/
- @lemonaddie0909
Stars
Making large AI models cheaper, faster and more accessible
Official Code for DragGAN (SIGGRAPH 2023)
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
A generative world for general-purpose robotics & embodied AI learning.
Fully open reproduction of DeepSeek-R1
verl: Volcano Engine Reinforcement Learning for LLMs
Wan: Open and Advanced Large-Scale Video Generative Models
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Wan: Open and Advanced Large-Scale Video Generative Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Stable Diffusion built-in to Blender
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
The best OSS video generation models, created by Genmo
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
[CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
A unified inference and post-training framework for accelerated video generation.