[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 1,032 104 Updated Oct 24, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,495 40 Updated Oct 15, 2025

KwaiVGI / VideoCanvas

Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"

60 Updated Oct 10, 2025

NVlabs / rcm

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 266 13 Updated Nov 5, 2025

NVlabs / LongLive

LongLive: Real-time Interactive Long Video Generation

Python 801 49 Updated Nov 3, 2025

nvidia-cosmos / cosmos-transfer2.5

Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control inputs.

Python 172 18 Updated Nov 7, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 346 27 Updated Nov 7, 2025

nvidia-cosmos / cosmos-reason1

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 781 65 Updated Nov 7, 2025

runjiali-rl / vmem

[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Python 389 14 Updated Jul 25, 2025

SkyworkAI / Matrix-Game

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,714 176 Updated Oct 4, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,477 1,279 Updated Oct 12, 2025

liruilong940607 / prope

Cameras as Relative Positional Encoding

Python 606 10 Updated Oct 20, 2025

NVlabs / Long-RL

Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)

Python 649 23 Updated Sep 24, 2025

UnrealZoo / unrealzoo-gym

Forked from zfw1226/gym-unrealcv

[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI

Python 206 11 Updated Nov 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiao Fu fuxiao0719

Achievements

Achievements

Block or report fuxiao0719

Stars

nv-tlabs / lyra

baaivision / Emu3.5

imlixinyang / FlashWorld

JunyaoHu / common_metrics_on_video_quality

nv-tlabs / vipe

Parskatt / RoMa

bytetriper / RAE

KwaiVGI / VideoCanvas

NVlabs / rcm

NVlabs / LongLive

nvidia-cosmos / cosmos-transfer2.5

nvidia-cosmos / cosmos-predict2.5

nvidia-cosmos / cosmos-reason1

runjiali-rl / vmem

SkyworkAI / Matrix-Game

Wan-Video / Wan2.2

liruilong940607 / prope

NVlabs / Long-RL

UnrealZoo / unrealzoo-gym

showlab / Awesome-Unified-Multimodal-Models

yaotingwangofficial / Awesome-MCoT

volcengine / verl

fuxiao0719 / PanopticNeRF

nvidia-cosmos / cosmos-predict2

YichongLu / Orientation_Matters

guandeh17 / Self-Forcing

pandayuanyu / generative-photography

xizaoqu / WorldMem

ByteDance-Seed / Bagel

KwaiVGI / RoboMaster