Skip to content
View fuxiao0719's full-sized avatar

Block or report fuxiao0719

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control inputs.

Python 174 18 Updated Nov 8, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,517 6,482 Updated Nov 8, 2025

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 347 27 Updated Nov 7, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,535 194 Updated Nov 7, 2025

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 781 65 Updated Nov 7, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,564 2,536 Updated Nov 7, 2025

Making large AI models cheaper, faster and more accessible

Python 41,228 4,540 Updated Nov 7, 2025

Native Multimodal Models are World Learners

Python 1,176 41 Updated Nov 7, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,518 111 Updated Nov 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,222 2,445 Updated Nov 7, 2025

A Paper List for Humanoid Robot Learning.

1,181 60 Updated Nov 7, 2025

[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI

Python 206 11 Updated Nov 5, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 10,925 997 Updated Nov 5, 2025

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 266 13 Updated Nov 5, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,553 84 Updated Nov 4, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,103 1,210 Updated Nov 4, 2025

LongLive: Real-time Interactive Long Video Generation

Python 801 49 Updated Nov 3, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,830 187 Updated Nov 3, 2025

A growing curation of Text-to-3D, Diffusion-to-3D works.

TeX 566 34 Updated Nov 2, 2025

GLOMAP - Global Structured-from-Motion Revisited

C++ 2,068 153 Updated Oct 31, 2025

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 723 98 Updated Oct 29, 2025

[ICLR 2025] Autoregressive Video Generation without Vector Quantization

Python 589 19 Updated Oct 29, 2025

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 658 89 Updated Oct 29, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,300 1,208 Updated Oct 28, 2025

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

2,028 123 Updated Oct 27, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,596 177 Updated Oct 27, 2025

Open-source unified multimodal model

Python 5,258 455 Updated Oct 27, 2025

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Python 1,008 48 Updated Oct 26, 2025

[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory

Python 267 11 Updated Oct 25, 2025

[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 1,032 104 Updated Oct 24, 2025
Next