fuxiao0719

Xiao Fu fuxiao0719

Ph.D. @ MMLab, CUHK ; B.Eng. @ ZJU

170 followers · 52 following

Hong Kong
11:04 (UTC +08:00)
http://fuxiao0719.github.io/
@lemonaddie0909

Achievements

Stars

nvidia-cosmos / cosmos-transfer2.5

Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control inputs.

Python 174 18 Updated Nov 8, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,517 6,482 Updated Nov 8, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 347 27 Updated Nov 7, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,535 194 Updated Nov 7, 2025

nvidia-cosmos / cosmos-reason1

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 781 65 Updated Nov 7, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 27,564 2,536 Updated Nov 7, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,228 4,540 Updated Nov 7, 2025

baaivision / Emu3.5

Native Multimodal Models are World Learners

Python 1,176 41 Updated Nov 7, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,518 111 Updated Nov 7, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,222 2,445 Updated Nov 7, 2025

YanjieZe / awesome-humanoid-robot-learning

A Paper List for Humanoid Robot Learning.

1,181 60 Updated Nov 7, 2025

UnrealZoo / unrealzoo-gym

Forked from zfw1226/gym-unrealcv

[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI

Python 206 11 Updated Nov 5, 2025

microsoft / TRELLIS

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 10,925 997 Updated Nov 5, 2025

NVlabs / rcm

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 266 13 Updated Nov 5, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,553 84 Updated Nov 4, 2025

zai-org / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,103 1,210 Updated Nov 4, 2025

NVlabs / LongLive

LongLive: Real-time Interactive Long Video Generation

Python 801 49 Updated Nov 3, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,830 187 Updated Nov 3, 2025

yyeboah / Awesome-Text-to-3D

A growing curation of Text-to-3D, Diffusion-to-3D works.

TeX 566 34 Updated Nov 2, 2025

colmap / glomap

GLOMAP - Global Structured-from-Motion Revisited

C++ 2,068 153 Updated Oct 31, 2025

nvidia-cosmos / cosmos-transfer1

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 723 98 Updated Oct 29, 2025

baaivision / NOVA

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 589 19 Updated Oct 29, 2025

nvidia-cosmos / cosmos-predict2

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 658 89 Updated Oct 29, 2025

Tencent-Hunyuan / Hunyuan3D-2

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,300 1,208 Updated Oct 28, 2025

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

2,028 123 Updated Oct 27, 2025

OpenDriveLab / AgiBot-World

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,596 177 Updated Oct 27, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,258 455 Updated Oct 27, 2025

liuyuan-pal / SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Python 1,008 48 Updated Oct 26, 2025

xizaoqu / WorldMem

[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory

Python 267 11 Updated Oct 25, 2025

Parskatt / RoMa

[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 1,032 104 Updated Oct 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiao Fu fuxiao0719

Achievements

Achievements

Block or report fuxiao0719

Stars

nvidia-cosmos / cosmos-transfer2.5

huggingface / diffusers

nvidia-cosmos / cosmos-predict2.5

hao-ai-lab / FastVideo

nvidia-cosmos / cosmos-reason1

Genesis-Embodied-AI / Genesis

hpcaitech / ColossalAI

baaivision / Emu3.5

aigc-apps / VideoX-Fun

volcengine / verl

YanjieZe / awesome-humanoid-robot-learning

UnrealZoo / unrealzoo-gym

microsoft / TRELLIS

NVlabs / rcm

yifan123 / flow_grpo

zai-org / CogVideo

NVlabs / LongLive

Paper2Poster / Paper2Poster

yyeboah / Awesome-Text-to-3D

colmap / glomap

nvidia-cosmos / cosmos-transfer1

baaivision / NOVA

nvidia-cosmos / cosmos-predict2

Tencent-Hunyuan / Hunyuan3D-2

ActiveVisionLab / Awesome-LLM-3D

OpenDriveLab / AgiBot-World

ByteDance-Seed / Bagel

liuyuan-pal / SyncDreamer

xizaoqu / WorldMem

Parskatt / RoMa