Skip to content
View RobbinW's full-sized avatar

Highlights

  • Pro

Block or report RobbinW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models

Jupyter Notebook 87 2 Updated Apr 1, 2026

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 349 17 Updated Apr 3, 2026

[CVPR'26] Semi-Supervised Conformal Prediction With Unlabeled Nonconformity Score

Python 3 Updated Mar 26, 2026

Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Python 1,894 189 Updated Mar 27, 2026

EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards

Python 30 1 Updated Mar 25, 2026

PAct: Part-Decomposed Single-View Articulated Object Generation

Jupyter Notebook 42 Updated Mar 2, 2026

Causal video-action world model for generalist robot control

Python 949 64 Updated Feb 27, 2026

Advancing Open-source World Models

Python 3,310 272 Updated Apr 2, 2026

(arXiv) MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Python 1,121 49 Updated Feb 26, 2026

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 10,009 800 Updated Mar 30, 2026

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 1,200 67 Updated Nov 9, 2025

[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"

Python 649 32 Updated Jul 1, 2025

AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation

Python 35 1 Updated Jul 25, 2025

Official repo for vidar and vidarc: video foundation model for robotics.

Python 40 1 Updated Dec 22, 2025

WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine, reason, and act in the physical world. Unlike passive vide…

Jupyter Notebook 150 11 Updated Jan 4, 2026

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,497 97 Updated Sep 11, 2025

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,808 277 Updated Apr 2, 2026

A pipeline parallel training script for diffusion models.

Python 1,907 266 Updated Feb 8, 2026

Enjoy the magic of Diffusion models!

Python 12,164 1,183 Updated Apr 2, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,030 1,821 Updated Mar 17, 2026

[ICRA 2026] VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos

Python 343 19 Updated Feb 24, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,584 1,268 Updated Nov 4, 2025

Pusa: Thousands Timesteps Video Diffusion Model

Python 677 47 Updated Feb 13, 2026

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,743 803 Updated Dec 8, 2022

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,331 71 Updated Jan 27, 2026

An end-to-end, GPU-accelerated, and modular platform for building generalized Embodied Intelligence.

Python 134 9 Updated Apr 3, 2026

Official code of Motus: A Unified Latent Action World Model

Python 925 42 Updated Jan 5, 2026

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,741 241 Updated Dec 17, 2025
Next