Zhoues

Follow

😎

keep doing research!

Enshen Zhou Zhoues

😎

keep doing research!

Follow

PhD student @ BUAA. Passionate about Embodied AI and Agents.

130 followers · 36 following

Beihang University
Shenzhen, China
07:38 (UTC +08:00)
https://zhoues.github.io/

Achievements

Achievements

Organizations

Lists (16)

Sort

💸 3D Assert

🌐 3D Vision

😃 Agent

⭐ awesome-paper-list

29 repositories

🦾 Bimanual Manipulation

🤗 Embodied AI

90 repositories

🥇 Foundation Model

15 repositories

🎨 Image Generation

🤖 Minecraft Agent

Agent in Minecraft

😆 MLLM & LLM

19 repositories

🚀 Navigation

🤯 Reasoning

🎇 Spatial Intelligence

25 repositories

🔧 Useful Tool

💯 VLM for Robotics

🎥 World Model

16 repositories

Stars

atfortes / Awesome-LLM-Reasoning

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,488 202 Updated May 7, 2025

Osilly / Awesome-Interleaving-Reasoning

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

223 10 Updated Oct 17, 2025

yukangcao / Awesome-4D-Spatial-Intelligence

A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)

399 21 Updated Dec 10, 2025

ByteDance-Seed / Depth-Anything-3

Depth Anything 3

Python 3,669 318 Updated Dec 12, 2025

Zhoues / RoboTracer

Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"

23 1 Updated Dec 23, 2025

Any-4D / Any4D

Any4D: Unified Feed-Forward Metric 4D Reconstruction

Python 185 5 Updated Dec 12, 2025

HHYHRHY / MM-ACT

MM-ACT: Learn from Multimodal Parallel Generation to Act

Python 83 4 Updated Dec 19, 2025

IGL-HKUST / TrackingWorld

[NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

Jupyter Notebook 146 11 Updated Dec 19, 2025

KangLiao929 / Puffin

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 372 12 Updated Nov 25, 2025

zhangzef / COOPER

The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.

Python 16 Updated Dec 19, 2025

InternRobotics / G2VLM

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 232 4 Updated Nov 27, 2025

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 351 42 Updated Dec 1, 2025

humanoid-vstar / hstar

Thinking in 360°: Humanoid Visual Search in the Wild

Python 84 Updated Dec 5, 2025

KovenYu / WonderWorld

Code release for https://kovenyu.com/WonderWorld/

Python 695 34 Updated Apr 14, 2025

open-gigaai / giga-world-0

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 802 65 Updated Dec 3, 2025

lifuguan / IGGT_official

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 306 10 Updated Dec 1, 2025

PeiwenSun2000 / SpaceVista

The official repo for SpaceVista: All-Scale Visual Spatial Reasoning from mm to km.

Python 37 1 Updated Oct 13, 2025

WU-CVGL / SIU3R

[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

Python 152 5 Updated Sep 25, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,565 160 Updated Dec 18, 2025

PicoTrex / Awesome-Nano-Banana-images

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

19,051 1,985 Updated Dec 12, 2025

pickxiguapi / Embodied-R1

Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"

Python 115 3 Updated Aug 21, 2025

nv-tlabs / vipe

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,574 120 Updated Dec 9, 2025

allenai / molmoact

Official Repository for MolmoAct

Python 276 30 Updated Dec 11, 2025

Dantong88 / LLARVA

Python 60 3 Updated Dec 14, 2024

declare-lab / Emma-X

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Python 79 7 Updated May 17, 2025

0-693 / VLA-Embodied-Intelligence-Learning

具身智能入门自学

21 2 Updated Apr 21, 2025

Large-Trajectory-Model / ATM

Official codebase for "Any-point Trajectory Modeling for Policy Learning"

Python 267 32 Updated Jun 19, 2025

henry123-boy / SpaTrackerV2

[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy

Python 868 39 Updated Sep 26, 2025

A-embodied / A0

Python 68 4 Updated Nov 21, 2025

Songwxuan / Embodied-AI-Paper-TopConf

[Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV, ECCV).

442 11 Updated Dec 1, 2025