zkz515

fenbiHE515 zkz515

16 followers · 196 following

Fudan University

Lists (8)

Sort

Starred repositories

Selen-Suyue / WoG

[ICML 2026] 🏂 World Guidance: World Modeling in Condition Space for Action Generation

Python 140 4 Updated Apr 28, 2026

xingwudao / xquant-beginner

《XQuant：人人都是量化交易员》开源书稿

TypeScript 268 44 Updated Jun 16, 2026

LinShan-Bin / OpenCLAP

Source code for 👏🏻"CLAP: Contrastive Latent Action Pretraining for Learning Vision-Language-Action Models from Human Videos"

Python 35 Updated Jun 14, 2026

CMU-IntentLab / uncertainty_aware_policy_steering

When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering (RSS 2026)

Python 2 Updated Jun 3, 2026

gangweix / next-forcing

Next Forcing: World Action Modeling with Multi-Chunk Prediction (MCP)

JavaScript 65 Updated Jun 12, 2026

Kimho666 / STEP

STEP: Warm-Started Visuomotor Policies with Spatiotemporal Consistency Prediction

Python 8 Updated Mar 31, 2026

sii-research / tau-0-wm

Python 225 12 Updated Jun 1, 2026

QwenLM / Qwen-VLA

The official repository of Qwen-VLA

614 24 Updated May 29, 2026

aigc3d / ViGeo

ViGeo: Towards Consistent Video Geometry Estimation

Python 81 1 Updated Jun 18, 2026

nv-tlabs / Gamma-World

Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Python 626 9 Updated Jun 17, 2026

bralani / NoPo4D

Official implementation of No Pose, No Problem in 4D: Feed-Forward Dynamic Gaussians from Unposed Multi-View Videos

Python 61 1 Updated May 27, 2026

nv-tlabs / PiD

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Python 761 37 Updated Jun 3, 2026

nanovisionx / RAEv2

Official Implemenation for RAEv2: Improved Baselines with Representation Autoencoders

Python 272 11 Updated May 21, 2026

MrSecant / OmniVTA

OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation

54 1 Updated Mar 25, 2026

OpenTouch-MIT / opentouch

OPENTOUCH: Bringing Full-Hand Touch to Real-World Interaction

Python 57 6 Updated Mar 18, 2026

EasonTuT / Awesome-Interactive-World-Model

Interactive World Model papers organized by core research challenges.

Python 238 8 Updated Jun 11, 2026

galilai-group / stable-worldmodel

A platform for reproducible world model research and evaluation

Python 1,886 215 Updated Jun 18, 2026

shengshu-ai / Motubrain

MotuBrain: An Advanced World Action Model for Robot Control

37 Updated Jun 14, 2026

simchowitzlabpublic / nano-world-model

A Minimalist, Batteries-included Repository for Advancing World Model Science.

Python 624 34 Updated Jun 15, 2026

Lijiaxin0111 / Open-d4rt

Python 558 25 Updated Jun 8, 2026

4DVLab / ResVLA

[ICML 2026] ResVLA: From Noise to Intent: Anchoring Generative VLA Policies with Residual Bridges

Python 22 Updated Jun 1, 2026

LYFCLOUDFAN / mask-world-model

Code for "Predicting What Matters: Robust Generalist Robot Policy Learning via Future Semantic Mask".

Python 23 Updated Jun 8, 2026

meituan-longcat / LARYBench

Python 151 8 Updated Jun 10, 2026

OpenMOSS / Awesome-WAM

A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.

HTML 851 21 Updated Jun 18, 2026

facebookresearch / vggt-omega

[CVPR 2026 Oral] VGGT Omega

Python 3,067 134 Updated May 18, 2026

zju-jiyicheng / Forcing-KV

Implementation for paper "Forcing-KV: Hybrid KV Cache Compression for Efficient Autoregressive Video Diffusion Models".

Python 112 4 Updated May 17, 2026

chandar-lab / semantic-wm

repository for training action-conditioned latent diffusion world models for robot video generation

Python 66 2 Updated May 29, 2026

VLARLKit / BAGEL

BAGEL as world models for VLA

Python 6 Updated May 17, 2026

lucidrains / d4rt

Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, from Deepmind

Python 71 Updated Jun 8, 2026

Real2Edit2Real / Real2Edit2Real

[CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface

Python 82 4 Updated Mar 10, 2026

fenbiHE515 zkz515

Lists (8)

AI agent

ai trading

awesome lists

foundation model

reward model

video model

vla

vlm for spatial reasoning

Starred repositories

dexterous-manipulation