Lists (5)
Sort Name ascending (A-Z)
Stars
[ICLR 2025] NextBestPath: Efficient 3D Mapping of Unseen Environments
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos
Enjoy the magic of Diffusion models!
Collection of Composed Image Retrieval (CIR) papers.
HORT: Monocular Hand-held Objects Reconstruction with Transformers, ICCV 2025
A curated list of awesome model based RL resources (continually updated)
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Tactile Sensing • Simulation • Representation • Manipulation • RL/IL/VLA • Open Source
An open-source library for GPU-accelerated robot learning and sim-to-real transfer.
A generative world for general-purpose robotics & embodied AI learning.
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
[NeurIPS 2023] Scalable 3D Captioning with Pretrained Models
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."
A modern, high customizable, responsive Jekyll theme for documentation with built-in search.
python tools to work with habitat-sim environment.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).
Unified framework for robot learning built on NVIDIA Isaac Sim
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型