Yioutpi

Fangjing Wang Yioutpi

20 followers · 26 following

SUSTech
Shenzhen

Stars

160 stars written in Python

Clear filter

starVLA / starVLA

starVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 385 18 Updated Nov 6, 2025

MarSaKi / ETPNav

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 381 33 Updated Apr 5, 2025

hovsg / HOV-SG

[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"

Python 378 27 Updated Jul 24, 2025

zju3dv / AutoRecon

Code for "AutoRecon: Automated 3D Object Discovery and Reconstruction" CVPR 2023 (Highlight)

Python 364 16 Updated Feb 18, 2025

ZiyuGuo99 / SAM2Point

The Most Faithful Implementation of Segment Anything (SAM) in 3D

Python 348 16 Updated Sep 11, 2024

ZCMax / LLaVA-3D

[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World

Python 340 17 Updated Oct 21, 2025

OpenMOSS / VLABench

Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.

Python 322 20 Updated Aug 6, 2025

AgibotTech / genie_sim

The Simulation Framework from AgiBot

Python 319 28 Updated Sep 18, 2025

PKU-HMI-Lab / Hybrid-VLA

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Python 310 9 Updated Oct 3, 2025

OpenGalaxea / G0

Galaxea's first VLA release

Python 309 16 Updated Oct 23, 2025

Open3DA / LL3DA

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Python 308 13 Updated Jul 17, 2024

bytedance / GR-1

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 285 15 Updated Apr 22, 2024

Ghostish / Open3DSOT

Open source library for Single Object Tracking in point clouds.

Python 276 39 Updated Oct 9, 2023

PKU-EPIC / GraspVLA

GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data

Python 269 7 Updated Jul 26, 2025

scene-verse / SceneVerse

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Python 268 4 Updated Mar 19, 2025

LitingLin / SwinTrack

Python 255 45 Updated Oct 18, 2022

InternRobotics / Seer

[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation

Python 252 11 Updated Jul 8, 2025

alibaba-damo-academy / RynnVLA-001

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Python 246 17 Updated Oct 27, 2025

InternRobotics / InternVLA-M1

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 233 10 Updated Oct 30, 2025

CGuangyan-BIT / PointGPT

[NeurIPS 2023] PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Python 233 23 Updated Jul 1, 2024

cremebrule / digital-cousins

Codebase for Automated Creation of Digital Cousins for Robust Policy Learning

Python 232 20 Updated Mar 31, 2025

HelloRicky123 / Siamese-RPN

Full reimplementation of siamese rpn, has 0.24 eao on vot2017.

Python 226 41 Updated Sep 9, 2021

2toinf / UniAct

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

Python 210 10 Updated Nov 6, 2025

THU-KEG / EAkit

Entity Alignment toolkit (EAkit), a lightweight, easy-to-use and highly extensible PyTorch implementation of many entity alignment algorithms.

Python 207 24 Updated Oct 24, 2022

qizekun / SoFar

[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Python 200 8 Updated Jun 30, 2025

ZzZZCHS / Chat-Scene

Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)

Python 198 11 Updated Oct 20, 2025

Pointcept / OpenIns3D

[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

Python 197 9 Updated Oct 19, 2024

HaozheQi / P2B

P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds

Python 194 35 Updated Jul 25, 2024

vision4robotics / TCTrack

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022) & TCTrack++ (TPAMI)

Python 194 39 Updated Aug 29, 2023

MasterBin-IIAU / AlphaRefine

Official implementation for the CVPR2021 paper Alpha-Refine

Python 192 30 Updated Oct 3, 2023

Previous Next

Provide feedback

Saved searches

Use saved searches to filter your results more quickly