Stars
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementations.
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence