Stars
MiniMax-M2, a model built for Max coding & agentic workflows.
[ECCV 2024] GenAD: Generative End-to-End Autonomous Driving
This is the official implementation of "OpenREAD:Reinforced Open-Ended Reasoning for End-to-End Autonomous Driving with LLM-as-Critic"
[AAAI 2026] OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
[ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives
[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
Code for Reinforcement Learning from Vision Language Foundation Model Feedback
Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
[CoRL 2025] CaRL: Learning Scalable Planning Policies with Simple Rewards
A collection of reference environments for offline reinforcement learning
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
[CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving
[NeurIPS 2024] Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
[ICCV 2025] ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models
[CVPR 2026] Drive-π0 and DriveMoE on End-to-end Autonomous Driving
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Latest Advances on Vison-Language-Action Models.
implementation of dualformer
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model