Skip to content
View Yioutpi's full-sized avatar

Block or report Yioutpi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
161 stars written in Python
Clear filter

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 821 49 Updated Nov 6, 2025

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 814 93 Updated Sep 9, 2025

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Python 799 199 Updated Feb 17, 2025

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 741 92 Updated Sep 8, 2025

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 724 98 Updated Oct 29, 2025

Unofficial ROS2 SDK support for Unitree GO2 AIR/PRO/EDU

Python 719 158 Updated Oct 31, 2025

[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking

Python 697 149 Updated Apr 13, 2024

RoboBrain 2.0: Advanced version of RoboBrain. See Better. Think Harder. Do Smarter. 🎉🎉🎉

Python 681 57 Updated Sep 30, 2025

Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.

Python 680 126 Updated Oct 29, 2023

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 661 38 Updated Oct 22, 2024

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 638 47 Updated Jun 13, 2025

🎁 A collection of utilities for LeRobot.

Python 627 52 Updated Oct 30, 2025

PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Python 623 46 Updated Feb 13, 2024

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Python 615 42 Updated Jan 17, 2024

Transformer Tracking (CVPR2021)

Python 610 106 Updated Jul 1, 2023

Vision-and-Language Navigation in Continuous Environments using Habitat

Python 609 72 Updated Jan 7, 2025

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 590 21 Updated Oct 29, 2024

Building General-Purpose Robots Based on Embodied Foundation Model

Python 587 37 Updated Nov 7, 2025

[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Python 580 27 Updated May 7, 2025

Official Python toolkit for generic object tracking benchmark GOT-10k and beyond

Python 579 95 Updated Oct 3, 2023

🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 565 32 Updated Jun 23, 2025

[ECCV 2022] Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Python 536 74 Updated Aug 3, 2023

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 522 6 Updated Oct 26, 2025

[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Python 502 71 Updated Feb 28, 2024

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 465 40 Updated Apr 20, 2025

Differentiable IoU of rotated bounding boxes using Pytorch

Python 437 65 Updated Jul 26, 2022

MDNet PyTorch implementation

Python 433 151 Updated Nov 1, 2019

[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

Python 430 29 Updated Apr 27, 2024

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 420 21 Updated Nov 6, 2025
Python 408 20 Updated Jan 24, 2025