Skip to content
View zkz515's full-sized avatar

Block or report zkz515

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICML 2026] 🏂 World Guidance: World Modeling in Condition Space for Action Generation

Python 140 4 Updated Apr 28, 2026

《XQuant:人人都是量化交易员》开源书稿

TypeScript 268 44 Updated Jun 16, 2026

Source code for 👏🏻"CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos"

Python 35 Updated Jun 14, 2026

When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering (RSS 2026)

Python 2 Updated Jun 3, 2026

Next Forcing: World Action Modeling with Multi-Chunk Prediction (MCP)

JavaScript 65 Updated Jun 12, 2026

STEP: Warm-Started Visuomotor Policies with Spatiotemporal Consistency Prediction

Python 8 Updated Mar 31, 2026
Python 225 12 Updated Jun 1, 2026

The official repository of Qwen-VLA

614 24 Updated May 29, 2026

ViGeo: Towards Consistent Video Geometry Estimation

Python 81 1 Updated Jun 18, 2026

Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Python 626 9 Updated Jun 17, 2026

Official implementation of No Pose, No Problem in 4D: Feed-Forward Dynamic Gaussians from Unposed Multi-View Videos

Python 61 1 Updated May 27, 2026

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Python 761 37 Updated Jun 3, 2026

Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders

Python 272 11 Updated May 21, 2026

OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation

54 1 Updated Mar 25, 2026

OPENTOUCH: Bringing Full-Hand Touch to Real-World Interaction

Python 57 6 Updated Mar 18, 2026

Interactive World Model papers organized by core research challenges.

Python 238 8 Updated Jun 11, 2026

A platform for reproducible world model research and evaluation

Python 1,886 215 Updated Jun 18, 2026

MotuBrain: An Advanced World Action Model for Robot Control

37 Updated Jun 14, 2026

A Minimalist, Batteries-included Repository for Advancing World Model Science.

Python 624 34 Updated Jun 15, 2026
Python 558 25 Updated Jun 8, 2026

[ICML 2026] ResVLA: From Noise to Intent: Anchoring Generative VLA Policies with Residual Bridges

Python 22 Updated Jun 1, 2026

Code for "Predicting What Matters: Robust Generalist Robot Policy Learning via Future Semantic Mask".

Python 23 Updated Jun 8, 2026
Python 151 8 Updated Jun 10, 2026

A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.

HTML 851 21 Updated Jun 18, 2026

[CVPR 2026 Oral] VGGT Omega

Python 3,067 134 Updated May 18, 2026

Implementation for paper "Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models".

Python 112 4 Updated May 17, 2026

repository for training action-conditioned latent diffusion world models for robot video generation

Python 66 2 Updated May 29, 2026

BAGEL as world models for VLA

Python 6 Updated May 17, 2026

Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind

Python 71 Updated Jun 8, 2026

[CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface

Python 82 4 Updated Mar 10, 2026
Next