The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,909 6,316 Updated Sep 18, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,611 489 Updated Aug 7, 2024

X-Square-Robot / wall-x

Building General-Purpose Robots Based on Embodied Foundation Model

Python 817 68 Updated Apr 7, 2026

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 23,152 4,231 Updated Apr 12, 2026

OpenTeleVision / TeleVision

[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Python 1,230 129 Updated Sep 27, 2024

EleutherAI / pythia

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,767 209 Updated Nov 15, 2025

GreatenAnoymous / RGBTrack

Python 69 5 Updated Mar 24, 2025

OpenHelix-Team / ReconVLA

Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.

Python 236 19 Updated Apr 1, 2026

MarkFzp / act-plus-plus

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,596 645 Updated May 15, 2024

RoboTwin-Platform / RoboTwin

RoboTwin 2.0 Offical Repo

Python 2,179 333 Updated Apr 10, 2026

gemcollector / maniwhere

This is the repo of CoRL 2024 paper "Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning"

Python 84 5 Updated Dec 13, 2024

CNJianLiu / SinRef-6D

Code for "Novel Object 6D Pose Estimation with a Single Reference View".

51 2 Updated Aug 18, 2025

taeyeopl / Any6D

[CVPR 2025] Any6D: Model-free 6D Pose Estimation of Novel Objects

Jupyter Notebook 420 36 Updated Aug 31, 2025

SpatialVLA / SpatialVLA

🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 678 47 Updated Jun 23, 2025

oymotion / roh_firmware

Docs and firmware for OYMotion robotic hand.

C 12 5 Updated Apr 2, 2026

microsoft / Magma

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,916 158 Updated Mar 3, 2026

aiming-lab / GRAPE

GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization

Python 159 8 Updated Apr 6, 2025

Psi-Robot / DexGraspVLA

[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 502 41 Updated Aug 10, 2025

geopavlakos / hamer

HaMeR: Reconstructing Hands in 3D with Transformers

Python 939 137 Updated Feb 7, 2026

facebookresearch / hot3d

HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos, CVPR 2025

Python 234 27 Updated Apr 7, 2026

shreyashampali / HOnnotate

CVPR2020. HOnnotate: A method for 3D Annotation of Hand and Object Poses

Python 182 15 Updated Jul 29, 2020

facebookresearch / foundpose

FoundPose: Unseen Object Pose Estimation with Foundation Features, ECCV 2024

Python 122 15 Updated Sep 1, 2025

drshashwat / pointcloud_registration

This module provides functions for point cloud registration using Open3D. It includes functions for preprocessing point clouds, executing global registration, refining registration using ICP, and p…

Python 4 Updated Feb 6, 2024

ktgiahieu / PPF-MEAM

Point Pair Feature-Based Pose Estimation with Multiple Edge Appearance Models (PPF-MEAM) for Robotic Bin Picking

C++ 59 11 Updated Dec 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jingranxia

Block or report Jingranxia

Stars

NVlabs / Fast-FoundationStereo

humansensinglab / Hamba

Physical-Intelligence / openpi

datawhalechina / leedl-tutorial

datawhalechina / easy-rl

RLinf / RLinf

facebookresearch / segment-anything