Skip to content
View cuijh26's full-sized avatar
🤗
🤗

Organizations

@fudan-generative-vision

Block or report cuijh26

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
398 stars written in Python
Clear filter

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,960 133 Updated Aug 20, 2024

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,952 117 Updated Apr 3, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,890 134 Updated Dec 6, 2024

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

Python 1,879 133 Updated Jul 5, 2024

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,870 179 Updated Aug 7, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,782 107 Updated Sep 27, 2024

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,763 76 Updated Oct 22, 2025

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Python 1,757 228 Updated Jul 7, 2023

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,715 176 Updated Oct 4, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,686 128 Updated Jul 25, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,653 76 Updated Apr 18, 2025

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,583 191 Updated Sep 18, 2025

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,581 124 Updated Aug 20, 2025

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Python 1,554 68 Updated Jun 19, 2025

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,471 71 Updated Oct 13, 2025

A fork to add multimodal model training to open-r1

Python 1,416 70 Updated Feb 8, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,402 64 Updated Mar 16, 2025

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,325 175 Updated Mar 13, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,324 77 Updated Sep 12, 2025

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,321 124 Updated Oct 22, 2025

[ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,308 72 Updated Oct 17, 2025

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Python 1,308 102 Updated Apr 25, 2024

Official code for "Style Aligned Image Generation via Shared Attention"

Python 1,304 98 Updated Dec 29, 2023

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,300 116 Updated Sep 30, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,272 92 Updated Nov 6, 2025

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,267 42 Updated Jun 12, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,257 58 Updated Oct 13, 2025

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

Python 1,236 278 Updated Aug 20, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,218 142 Updated Mar 14, 2025

[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Python 1,208 56 Updated Jul 9, 2025