Stars
Elevate your AI research writing, no more tedious polishing ✨
Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
Robotics research demonstrating reliability and robustness in the real world (continuously updated)
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation" (ICLR2026)
A Survey on Reinforcement Learning of Vision-Language-Action Models for Robotic Manipulation
Dexbotic: Open-Source Vision-Language-Action Toolbox
[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., Pi0, Pi0.5, GR00TN1.5. Fully open-sourced.
[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.
This tool has been deprecated. Use Agentic Document Extraction instead.
😼 优雅地使用基于 clash/mihomo 的代理环境
A Self-Training Framework for Vision-Language Reasoning
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
MichalZawalski / embodied-CoT
Forked from openvla/openvlaEmbodied Chain of Thought: A robotic policy that reason to solve the task.
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.