Skip to content
View Yioutpi's full-sized avatar

Block or report Yioutpi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
160 stars written in Python
Clear filter

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 820 49 Updated Nov 6, 2025

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 807 92 Updated Sep 9, 2025

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Python 797 199 Updated Feb 17, 2025

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 737 91 Updated Sep 8, 2025

Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.

Python 723 98 Updated Oct 29, 2025

Unofficial ROS2 SDK support for Unitree GO2 AIR/PRO/EDU

Python 717 157 Updated Oct 31, 2025

[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking

Python 696 149 Updated Apr 13, 2024

Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.

Python 679 126 Updated Oct 29, 2023

RoboBrain 2.0: Advanced version of RoboBrain. See Better. Think Harder. Do Smarter. πŸŽ‰πŸŽ‰πŸŽ‰

Python 677 57 Updated Sep 30, 2025

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Python 660 38 Updated Oct 22, 2024

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 638 47 Updated Jun 13, 2025

PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Python 622 46 Updated Feb 13, 2024

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Python 615 42 Updated Jan 17, 2024

🎁 A collection of utilities for LeRobot.

Python 613 52 Updated Oct 30, 2025

Transformer Tracking (CVPR2021)

Python 609 106 Updated Jul 1, 2023

Vision-and-Language Navigation in Continuous Environments using Habitat

Python 608 72 Updated Jan 7, 2025

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 589 21 Updated Oct 29, 2024

Building General-Purpose Robots Based on Embodied Foundation Model

Python 581 38 Updated Nov 4, 2025

[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Python 580 27 Updated May 7, 2025

Official Python toolkit for generic object tracking benchmark GOT-10k and beyond

Python 579 95 Updated Oct 3, 2023

πŸ”₯ SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.

Python 564 33 Updated Jun 23, 2025

[ECCV 2022] Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Python 536 74 Updated Aug 3, 2023

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 523 5 Updated Oct 26, 2025

[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention

Python 501 71 Updated Feb 28, 2024

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 465 40 Updated Apr 20, 2025

Differentiable IoU of rotated bounding boxes using Pytorch

Python 437 65 Updated Jul 26, 2022

MDNet PyTorch implementation

Python 433 151 Updated Nov 1, 2019

[CVPR'24 Highlight] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

Python 430 29 Updated Apr 27, 2024
Python 405 20 Updated Jan 24, 2025

PyViz3D is a web-based visualizer for 3D objects and point clouds.

Python 391 28 Updated Dec 19, 2024