Skip to content
View Yioutpi's full-sized avatar

Block or report Yioutpi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
160 stars written in Python
Clear filter

starVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 385 18 Updated Nov 6, 2025

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 381 33 Updated Apr 5, 2025

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

Python 378 27 Updated Jul 24, 2025

Code for "AutoRecon: Automated 3D Object Discovery and Reconstruction" CVPR 2023 (Highlight)

Python 364 16 Updated Feb 18, 2025

The Most Faithful Implementation of Segment Anything (SAM) in 3D

Python 348 16 Updated Sep 11, 2024

[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World

Python 340 17 Updated Oct 21, 2025

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Python 322 20 Updated Aug 6, 2025

The Simulation Framework from AgiBot

Python 319 28 Updated Sep 18, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Python 310 9 Updated Oct 3, 2025

Galaxea's first VLA release

Python 309 16 Updated Oct 23, 2025

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Python 308 13 Updated Jul 17, 2024

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 285 15 Updated Apr 22, 2024

Open source library for Single Object Tracking in point clouds.

Python 276 39 Updated Oct 9, 2023

GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data

Python 269 7 Updated Jul 26, 2025

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Python 268 4 Updated Mar 19, 2025
Python 255 45 Updated Oct 18, 2022

[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation

Python 252 11 Updated Jul 8, 2025

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Python 246 17 Updated Oct 27, 2025

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 233 10 Updated Oct 30, 2025

[NeurIPS 2023] PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Python 233 23 Updated Jul 1, 2024

Codebase for Automated Creation of Digital Cousins for Robust Policy Learning

Python 232 20 Updated Mar 31, 2025

Full reimplementation of siamese rpn, has 0.24 eao on vot2017.

Python 226 41 Updated Sep 9, 2021

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

Python 210 10 Updated Nov 6, 2025

Entity Alignment toolkit (EAkit), a lightweight, easy-to-use and highly extensible PyTorch implementation of many entity alignment algorithms.

Python 207 24 Updated Oct 24, 2022

[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Python 200 8 Updated Jun 30, 2025

Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)

Python 198 11 Updated Oct 20, 2025

[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

Python 197 9 Updated Oct 19, 2024

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

Python 194 35 Updated Jul 25, 2024

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022) & TCTrack++ (TPAMI)

Python 194 39 Updated Aug 29, 2023

Official implementation for the CVPR2021 paper Alpha-Refine

Python 192 30 Updated Oct 3, 2023