Skip to content
View geyan21's full-sized avatar

Highlights

  • Pro

Block or report geyan21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Awesome work on hand pose estimation/tracking

Python 3,355 530 Updated Dec 1, 2025

[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth

Python 1,179 69 Updated Apr 1, 2026

Galaxea's open-source VLA repository

Python 563 40 Updated Feb 14, 2026

Official code of RDT 2

Python 753 47 Updated Feb 7, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,933 1,725 Updated Jan 30, 2026

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,567 97 Updated Jan 6, 2026

[CoRL 2025] ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training

Python 186 15 Updated Nov 13, 2025

Code for RSS 2025 paper "Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies"

Python 40 6 Updated Jun 18, 2025

Visual Imitation Enables Contextual Humanoid Control. CoRL 2025, Best Student Paper Award.

Python 768 57 Updated Nov 25, 2025

Cameras as Relative Positional Encoding

Python 701 11 Updated Dec 18, 2025

[CoRL 2025] RISE-2: A Generalizable Imitation Learning Policy

Python 61 1 Updated Nov 29, 2025

[NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation

Python 186 13 Updated Dec 2, 2025

🦾 A Dual-System VLA with System2 Thinking

Python 139 3 Updated Aug 21, 2025

[CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation

Python 718 63 Updated Sep 14, 2025

Universal Monocular Metric Depth Estimation

Python 1,166 109 Updated May 18, 2025
Python 175 14 Updated Nov 27, 2025

[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"

Python 240 9 Updated Dec 16, 2025

DreamGen: Nvidia GEAR Lab's initiative to solve the robotics data problem using world models

Jupyter Notebook 525 52 Updated Oct 24, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 6,630 1,110 Updated Apr 8, 2026

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,912 128 Updated Apr 4, 2026
Jupyter Notebook 90 4 Updated Sep 23, 2025

DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation

C 197 24 Updated Oct 2, 2025

Pytorch Implementation (unofficial) of the paper "Mean Flows for One-step Generative Modeling" by Geng et al.

Python 1,115 64 Updated Dec 17, 2025

Attention mappers and visualisation for transformer-based Physical AI policies

Python 152 19 Updated Jan 23, 2026

[ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"

C++ 119 16 Updated May 16, 2025

[ICRA 25] FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning

Python 45 2 Updated Jan 5, 2025
Python 46 9 Updated Apr 2, 2025
Next