Skip to content
View tyb197's full-sized avatar
  • Institute of Automation,Chinese Academy of Sciences
  • Beijing

Highlights

  • Pro

Block or report tyb197

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?

Python 996 105 Updated Apr 3, 2026
Python 283 37 Updated Aug 26, 2024
Python 1 1 Updated Mar 7, 2026

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 673 64 Updated Jun 10, 2026

StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing

Python 2,875 360 Updated Jun 18, 2026

Dexbotic: Open-Source Vision-Language-Action Toolbox

Python 1,221 173 Updated Jun 18, 2026

A Pragmatic VLA Foundation Model

Python 1,475 154 Updated Jun 11, 2026

The implementation of our ICRA2024 submission manuscript paper "Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction"

Python 62 2 Updated Mar 11, 2024

[ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

Python 1,519 244 Updated Mar 3, 2025

SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.

Python 191 30 Updated Jun 17, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,278 2,869 Updated Mar 5, 2026

A critical analysis of the Cambrian-S model and VSI-Super benchmarks

Python 15 Updated Nov 20, 2025

[ICLR 2026] Streaming 4D Visual Geometry Transformer

Python 929 48 Updated Oct 27, 2025

A procedural Blender pipeline for photorealistic training image generation

Python 3,596 511 Updated Jan 20, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,407 1,494 Updated May 19, 2026

[ICCV 2025] PartField: Learning 3D Feature Fields for Part Segmentation and Beyond

Python 431 39 Updated Jun 2, 2026

SAM 3D Objects

Python 6,962 826 Updated Jun 2, 2026

[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding

Python 158 Updated Dec 9, 2025

[ICML 2024] LEO: An Embodied Generalist Agent in 3D World

Python 485 42 Updated Apr 20, 2025

[CVPR 2026] Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Python 237 7 Updated May 7, 2026

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 240 8 Updated Nov 28, 2025

A paper list for spatial reasoning

755 42 Updated Jan 19, 2026

MiMo-Embodied

Python 389 17 Updated Apr 15, 2026

Collection of papers on human-object-interaction generation

1 Updated Nov 15, 2025

[ICLR 2025] Official Implementation for 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds]

Python 18 Updated Apr 7, 2026

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 3,385 514 Updated Jul 29, 2024

code for affordance-r1

Python 73 3 Updated May 11, 2026

Official Code For VLA-OS.

Python 143 8 Updated Jun 25, 2025

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 502 25 Updated Mar 17, 2025
Next