LMD0311

🎯

Focusing

Xin Zhou LMD0311

🎯

Focusing

98 followers · 36 following

Huazhong University of Science & Technology
Wuhan, Hubei Province, China
12:50 (UTC +08:00)
https://orcid.org/0009-0009-4752-6118
@THELMDOFZHOUXIN
https://lmd0311.github.io/

Achievements

Highlights

Organizations

Lists (1)

Sort

🚀 My stack

1 repository

Stars

370 results for source starred repositories

Clear filter

dreamzero0 / dreamzero

Code to load DreamZero model checkpoints and run evaluation on DROID-sim and Genie Sim 3.0

Python 461 10 Updated Feb 5, 2026

Robbyant / lingbot-va

Causal video-action world model for generalist robot control

Python 537 21 Updated Feb 6, 2026

Robbyant / lingbot-world

Advancing Open-source World Models

Python 2,610 208 Updated Feb 2, 2026

JaceyHuang / Gen3R

Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction

Python 172 3 Updated Jan 14, 2026

IamCreateAI / NeoVerse

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

342 7 Updated Jan 5, 2026

InternRobotics / InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Jupyter Notebook 325 20 Updated Feb 3, 2026

DreamLM / Dream-VLX

Dream-VL and Dream-VLA, a diffusion VLM and a diffusion VLA.

Python 101 4 Updated Jan 14, 2026

pengxuanyang / WorldRFT

[AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving

29 Updated Dec 23, 2025

thu-ml / Motus

Official code of Motus: A Unified Latent Action World Model

Python 683 20 Updated Jan 5, 2026

bingreeky / MemEvolve

MemEvolve & EvolveLab

Python 159 21 Updated Dec 23, 2025

allenai / molmo2

Code for the Molmo2 Vision-Language Model

153 4 Updated Dec 16, 2025

MiniMax-AI / VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 439 10 Updated Dec 16, 2025

wzzheng / DVGT

Visual Geometry Transformer for Autonomous Driving

Python 181 8 Updated Dec 19, 2025

2toinf / X-VLA

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 498 36 Updated Feb 2, 2026

DepthAnything / PromptDA

[CVPR 2025] Prompt Depth Anything

Python 1,049 63 Updated Jan 29, 2026

Tencent-Hunyuan / HY-WorldPlay

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,109 90 Updated Jan 13, 2026

microsoft / TRELLIS.2

Native and Compact Structured Latents for 3D Generation

Python 3,468 328 Updated Jan 10, 2026

alibaba-damo-academy / T2I-Distill

[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Python 338 22 Updated Dec 31, 2025

xiaomi-mlab / MindDrive

Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”

118 5 Updated Jan 31, 2026

RuiningLi / particulate

Official Implementation of Particulate: Feed-Forward 3D Object Articulation

Python 107 6 Updated Jan 25, 2026

KlingTeam / SVG-T2I

Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Python 130 7 Updated Dec 18, 2025

Ivan-Tang-3D / 3DGen-R1

The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"

Python 95 1 Updated Dec 28, 2025

End2End-Diffusion / iREPA

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 216 9 Updated Dec 15, 2025

MoyangSensei / AwesomeRobustDWM

Repository of the survey: Progressive Robustness-Aware World Models in Autonomous Driving: A Review and Outlook

15 1 Updated Dec 15, 2025

jin-cao-tma / WonderZoom

Code release for https://wonderzoom.github.io/

152 4 Updated Dec 11, 2025

showlab / H2R-Grounder

A V2V framework that translates human interaction videos into robot manipulation videos.

22 1 Updated Dec 12, 2025

alibaba-damo-academy / RynnVLA-002

RynnVLA-002: A Unified Vision-Language-Action and World Model

Python 875 49 Updated Dec 2, 2025

EternalEvan / Astra

[ICLR 2026] Astra : General Interactive World Model with Autoregressive Denoising"

Python 208 5 Updated Feb 2, 2026

worldbench / WorldLens

🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 178 14 Updated Jan 18, 2026

Dynamics-X / DynamicVerse

[NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"

Python 93 3 Updated Dec 21, 2025