Stars
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
[ICCV2025] DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation
[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models
[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
(CVPR 2025) DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“
Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.
A generative world for general-purpose robotics & embodied AI learning.
[ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Official inference repo for FLUX.1 models
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Emu Series: Generative Multimodal Models from BAAI
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
[ICCV 2023] GETAvatar: Generative Textured Meshes for Animatable Human Avatars
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.