Skip to content
View LMD0311's full-sized avatar
😶‍🌫️
😶‍🌫️

Highlights

  • Pro

Organizations

@H-EmbodVis

Block or report LMD0311

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Visual Geometry Transformer for Autonomous Driving

Python 75 2 Updated Dec 19, 2025

The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 374 25 Updated Dec 3, 2025

[CVPR 2025] Prompt Depth Anything

Python 1,018 58 Updated Sep 2, 2025

WorldPlay: Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 667 36 Updated Dec 19, 2025

Native and Compact Structured Latents for 3D Generation

Python 2,105 138 Updated Dec 17, 2025

[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Python 230 11 Updated Dec 16, 2025

Official code of “MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning”

86 3 Updated Dec 17, 2025

Official Implementation of Particulate: Feed-Forward 3D Object Articulation

Python 63 4 Updated Dec 15, 2025

Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Python 112 2 Updated Dec 18, 2025

The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"

Python 69 Updated Dec 17, 2025

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 140 7 Updated Dec 15, 2025

Repository of the survey: Progressive Robustness-Aware World Models in Autonomous Driving: A Review and Outlook

10 Updated Dec 15, 2025

Code release for https://wonderzoom.github.io/

53 1 Updated Dec 11, 2025

A V2V framework that translates human interaction videos into robot manipulation videos.

17 1 Updated Dec 12, 2025

RynnVLA-002: A Unified Vision-Language-Action and World Model

Python 787 47 Updated Dec 2, 2025

The official repository of "Astra : General Interactive World Model with Autoregressive Denoising"

Python 166 3 Updated Dec 19, 2025

🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 151 10 Updated Dec 19, 2025

[NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"

Python 87 3 Updated Dec 21, 2025

Running VLA at 30Hz frame rate and 480Hz trajectory frequency

Python 323 23 Updated Dec 14, 2025

DGGT: Feedforward 4D Reconstruction of Dynamic Driving Scenes using Unposed Images

Python 337 29 Updated Dec 11, 2025

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

Python 149 13 Updated Dec 15, 2025

Official implementation of "C3G: Learning Compact 3D Representations with 2K Gaussians"

Python 111 3 Updated Dec 16, 2025

[Arxiv] Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration

Jupyter Notebook 29 2 Updated Dec 1, 2025

Learning to Drive via Real-World Simulation at Scale

102 4 Updated Dec 8, 2025

Visual Generation Tuning

Python 75 Updated Dec 1, 2025

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 230 4 Updated Nov 27, 2025

One4D: Unified 4D Generation and Reconstruction

56 1 Updated Dec 2, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,063 99 Updated Dec 19, 2025

[AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution

Python 355 11 Updated Dec 12, 2025
Next