This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,251 58 Updated Oct 18, 2025

haosulab / ManiSkill

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 2,216 379 Updated Oct 31, 2025

declare-lab / Emma-X

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Python 75 6 Updated May 17, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,184 1,665 Updated Sep 24, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,321 516 Updated Mar 23, 2025

MichalZawalski / embodied-CoT

Forked from openvla/openvla

Embodied Chain of Thought: A robotic policy that reason to solve the task.

Python 322 16 Updated Apr 5, 2025

agiresearch / A-mem

A-MEM: Agentic Memory for LLM Agents

Python 664 79 Updated Oct 21, 2025

zwq2018 / embodied_reasoner

Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

Python 178 15 Updated Sep 24, 2025

FlagOpen / RoboBrain2.0

RoboBrain 2.0: Advanced version of RoboBrain. See Better. Think Harder. Do Smarter. 🎉🎉🎉

Python 674 57 Updated Sep 30, 2025

OpenDriveLab / UniVLA

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 817 48 Updated Aug 21, 2025

zchoi / Awesome-Embodied-Robotics-and-Agent

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

1,580 89 Updated Oct 30, 2025

FlagOpen / ShareRobot

55 1 Updated Apr 1, 2025

FlagOpen / RoboOS

🤖 RoboOS: A Universal Embodied Operating System for Cross-Embodied and Multi-Robot Collaboration

Python 236 28 Updated Sep 4, 2025

yangchris11 / samurai

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,986 478 Updated Mar 18, 2025

jizhang-cmu / autonomy_stack_go2

Full Autonomy Stack for Unitree Go2

C++ 304 43 Updated Apr 1, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,946 5,814 Updated Sep 27, 2025

jzhzhang / NaVid-VLN-CE

[RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid

Python 295 20 Updated Oct 15, 2025

NVIDIA-AI-IOT / remembr

Python 279 34 Updated Mar 17, 2025

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 6,951 691 Updated Jan 22, 2025

MIV-XJTU / ARTrack

PyTorch implementation of paper "ARTrack" and "ARTrackV2"

Python 292 35 Updated Oct 20, 2025

ymzis69 / HybridSORT

[AAAI2024]Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking

Python 235 30 Updated Apr 2, 2024

mikel-brostrom / boxmot

BoxMOT: Pluggable SOTA multi-object tracking modules modules for segmentation, object detection and pose estimation models

Python 7,769 1,855 Updated Oct 31, 2025

EgocentricVision / EgocentricVision

🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-progress; join us!

120 8 Updated Nov 23, 2024

vision-x-nyu / thinking-in-space

Official repo and evaluation implementation of VSI-Bench

Python 616 37 Updated Aug 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wei-Baldwin-Zeng Wei-Baldwin-Zeng

Block or report Wei-Baldwin-Zeng

Stars

wsakobe / TrackVLA

serengil / retinaface

mk-minchul / AdaFace

vimalabs / VIMA

embodiedreasoning / ERQA

GT-RIPL / Awesome-LLM-Robotics

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs