StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
-
Updated
Jun 12, 2026 - Python
StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
Dexbotic: Open-Source Vision-Language-Action Toolbox
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
Unified Codebase for Advanced World Models.
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models
[CVPR 2025, Spotlight] SimLingo (CarLLava): Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment
[CVPR 2026] WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., Pi0, Pi0.5, GR00TN1.5. Fully open-sourced.
1st place solution of 2025 BEHAVIOR Challenge
CLI for Robot Training and Deployment
WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving
The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)
Add a description, image, and links to the vla topic page so that developers can more easily learn about it.
To associate your repository with the vla topic, visit your repo's landing page and select "manage topics."