Stars
A MuJoCo/Gym environment for robot control using Reinforcement Learning. The task of agents in this environment is pixel-wise prediction of grasp success chances.
Reinforcement Learning with Model Predictive Control
🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
[RSS 2024] Agile But Safe: Learning Collision-Free High-Speed Legged Locomotion
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
This is a new repo used for training UAV navigation (local path planning) policy using DRL methods.
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
Reinforcement learning algorithms for MuJoCo tasks
Educational Python library for manipulator motion planning
A project for 3D multi-object tracking
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation
Official implementation of Implicit Behavioral Cloning, as described in our CoRL 2021 paper, see more at https://implicitbc.github.io/
Code for the paper "3D Diffuser Actor: Policy Diffusion with 3D Scene Representations"
Repository for our paper: FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning, Proceedings of the 12th International Conference on Learning Representations (ICLR)
MichalZawalski / embodied-CoT
Forked from openvla/openvlaEmbodied Chain of Thought: A robotic policy that reason to solve the task.
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
Repository to accompany RSS 2018 paper on dexterous hand manipulation
The suite of modeling video with Mamba
Simulated experiments for "Real-Time Execution of Action Chunking Flow Policies".
Implementation of 6-DoF GraspNet with tensorflow and python. This repo has been tested with python 2.7 and tensorflow 1.12.
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)