The implementation of our ICRA2024 submission manuscript paper "Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction"

Python 62 2 Updated Mar 11, 2024

hustvl / MapTR

[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

Python 1,519 244 Updated Mar 3, 2025

zhangzaibin / spagent

SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.

Python 191 30 Updated Jun 17, 2026

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,278 2,869 Updated Mar 5, 2026

bethgelab / supersanity

A critical analysis of the Cambrian-S model and VSI-Super benchmarks

Python 15 Updated Nov 20, 2025

wzzheng / StreamVGGT

[ICLR 2026] Streaming 4D Visual Geometry Transformer

Python 929 48 Updated Oct 27, 2025

DLR-RM / BlenderProc

A procedural Blender pipeline for photorealistic training image generation

Python 3,596 511 Updated Jan 20, 2026

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,407 1,494 Updated May 19, 2026

nv-tlabs / PartField

[ICCV 2025] PartField: Learning 3D Feature Fields for Part Segmentation and Beyond

Python 431 39 Updated Jun 2, 2026

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 6,962 826 Updated Jun 2, 2026

Visual-AI / 3DRS

[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding

Python 158 Updated Dec 9, 2025

embodied-generalist / embodied-generalist

[ICML 2024] LEO: An Embodied Generalist Agent in 3D World

Python 485 42 Updated Apr 20, 2025

zhangquanchen / 3DThinker

[CVPR 2026] Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Python 237 7 Updated May 7, 2026

LaVi-Lab / VG-LLM

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 240 8 Updated Nov 28, 2025

mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM

A paper list for spatial reasoning

755 42 Updated Jan 19, 2026

XiaomiMiMo / MiMo-Embodied

MiMo-Embodied

Python 389 17 Updated Apr 15, 2026

Reagan1311 / Image-Video-Interaction-Generation

Collection of papers on human-object-interaction generation

1 Updated Nov 15, 2025

iLearn-Lab / ICLR25-3D_ADLLM

[ICLR 2025] Official Implementation for 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds]

Python 18 Updated Apr 7, 2026

eat-slim / PointNeXt_pure_python

Python 45 5 Updated Dec 29, 2022

facebookresearch / Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,385 514 Updated Jul 29, 2024

hq-King / Affordance-R1

code for affordance-r1

Python 73 3 Updated May 11, 2026

HeegerGao / VLA-OS

Official Code For VLA-OS.

Python 143 8 Updated Jun 25, 2025

hustvl / EVF-SAM

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 502 25 Updated Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yingbo Tang tyb197

Achievements

Achievements

Highlights

Block or report tyb197

Stars

yuantianyuan01 / FastWAM

rail-berkeley / crossformer

Jerry2398 / BabyHappyForest

2toinf / X-VLA

starVLA / starVLA

dexmal / dexbotic

Robbyant / lingbot-vla

xjtu-cs-gao / SatforHDMap