cuijh26

🤗

Jiahao Cui cuijh26

🤗

43 followers · 123 following

Fusion Lab
China
https://cuijh26.github.io/

Achievements

Organizations

Lists (1)

Sort

awesome

5 repositories

Stars

398 stars written in Python

Clear filter

3DTopia / LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,960 133 Updated Aug 20, 2024

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,952 117 Updated Apr 3, 2025

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,890 134 Updated Dec 6, 2024

junshutang / Make-It-3D

[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior

Python 1,879 133 Updated Jul 5, 2024

muskie82 / MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,870 179 Updated Aug 7, 2024

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,782 107 Updated Sep 27, 2024

showlab / Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,763 76 Updated Oct 22, 2025

Janspiry / Palette-Image-to-Image-Diffusion-Models

Unofficial implementation of Palette: Image-to-Image Diffusion Models by Pytorch

Python 1,757 228 Updated Jul 7, 2023

SkyworkAI / Matrix-Game

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,715 176 Updated Oct 4, 2025

Xiaojiu-z / EasyControl

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,686 128 Updated Jul 25, 2025

policy-gradient / GRPO-Zero

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,653 76 Updated Apr 18, 2025

ZiqiaoPeng / SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,583 191 Updated Sep 18, 2025

Fantasy-AMAP / fantasy-talking

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,581 124 Updated Aug 20, 2025

menyifang / MIMO

Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"

Python 1,554 68 Updated Jun 19, 2025

Gen-Verse / MMaDA

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,471 71 Updated Oct 13, 2025

EvolvingLMMs-Lab / open-r1-multimodal

A fork to add multimodal model training to open-r1

Python 1,416 70 Updated Feb 8, 2025

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,402 64 Updated Mar 16, 2025

fudan-generative-vision / hallo3

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,325 175 Updated Mar 13, 2025

bytedance / UNO

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,324 77 Updated Sep 12, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,321 124 Updated Oct 22, 2025

wenqsun / DimensionX

[ICCV'25]DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Python 1,308 72 Updated Oct 17, 2025

wyhuai / DDNM

[ICLR 2023 Oral] Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model

Python 1,308 102 Updated Apr 25, 2024

google / style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Python 1,304 98 Updated Dec 29, 2023

muzishen / IMAGDressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,300 116 Updated Sep 30, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,272 92 Updated Nov 6, 2025

hustvl / LightningDiT

[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Python 1,267 42 Updated Jun 12, 2025

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,257 58 Updated Oct 13, 2025

TimoBolkart / voca

This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character mesh.

Python 1,236 278 Updated Aug 20, 2024

Fictionarry / ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,218 142 Updated Mar 14, 2025

alibaba / Tora

[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Python 1,208 56 Updated Jul 9, 2025

Previous Next