Skip to content
View Zhoues's full-sized avatar
😎
keep doing research!
😎
keep doing research!

Organizations

@camel-ai

Block or report Zhoues

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,488 202 Updated May 7, 2025

Interleaving Reasoning: Next-Generation Reasoning Systems for AGI

223 10 Updated Oct 17, 2025

A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)

399 21 Updated Dec 10, 2025

Depth Anything 3

Python 3,669 318 Updated Dec 12, 2025

Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"

23 1 Updated Dec 23, 2025

Any4D: Unified Feed-Forward Metric 4D Reconstruction

Python 185 5 Updated Dec 12, 2025

MM-ACT: Learn from Multimodal Parallel Generation to Act

Python 83 4 Updated Dec 19, 2025

[NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels

Jupyter Notebook 146 11 Updated Dec 19, 2025

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 372 12 Updated Nov 25, 2025

The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.

Python 16 Updated Dec 19, 2025

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 232 4 Updated Nov 27, 2025

Training VLM agents with multi-turn reinforcement learning

Python 351 42 Updated Dec 1, 2025

Thinking in 360°: Humanoid Visual Search in the Wild

Python 84 Updated Dec 5, 2025

Code release for https://kovenyu.com/WonderWorld/

Python 695 34 Updated Apr 14, 2025

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 802 65 Updated Dec 3, 2025

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 306 10 Updated Dec 1, 2025

The official repo for SpaceVista: All-Scale Visual Spatial Reasoning from mm to km.

Python 37 1 Updated Oct 13, 2025

[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment

Python 152 5 Updated Sep 25, 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,565 160 Updated Dec 18, 2025

A curated collection of fun and creative examples generated with Nano Banana & Nano Banana Pro🍌, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the commu…

19,051 1,985 Updated Dec 12, 2025

Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"

Python 115 3 Updated Aug 21, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,574 120 Updated Dec 9, 2025

Official Repository for MolmoAct

Python 276 30 Updated Dec 11, 2025
Python 60 3 Updated Dec 14, 2024

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Python 79 7 Updated May 17, 2025

具身智能入门自学

21 2 Updated Apr 21, 2025

Official codebase for "Any-point Trajectory Modeling for Policy Learning"

Python 267 32 Updated Jun 19, 2025

[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy

Python 868 39 Updated Sep 26, 2025
Python 68 4 Updated Nov 21, 2025

[Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV, ECCV).

442 11 Updated Dec 1, 2025
Next