Lists (27)
Sort Name ascending (A-Z)
3DGS
Agent
AIGC
Animation
Calibration
Concept
DIBR
DigitalHuman
Fusion
GPT
ImageTask2D
Library
LLM
LocoManip
MeshProcess
NERF
ObjectGeneration
Reconstruction
Render
Robot
SceneGen
Survey
Tools
VideoGen
VideoInterpolation
VLA
WorldModel
Starred repositories
Code for kai0, including training, inference and data collection.
[IROS 2025] Generalizable Humanoid Manipulation with 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
[CoRL 2025] TWIST: Teleoperated Whole-Body Imitation System
Code to load DreamZero model checkpoints and run evaluation on DROID-sim and Genie Sim 3.0
Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".
[L4DC 2026] "FALCON: Learning Force-Adaptive Humanoid Loco-Manipulation"
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Tensor's VLA Training Infrastructure for Real-World Robotics in PyTorch
[arXiv 2025] TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System
HoloMotion: A Foundation Model for Whole-Body Humanoid Control
A Paper List for Humanoid Robot Learning.
[ICLR 2026] Towards Unified Latent VLA for Whole-body Loco-manipulation Control
Software stack for loco-manipulation experiments across multiple humanoid platforms, with primary support for the Unitree G1. This repository provides whole-body control policies, a teleoperation s…
Builder and index for PyTorch packages
[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos
Spirit-v1.5: A Robotic Foundation Model by Spirit AI
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
A unified inference and post-training framework for accelerated video generation.
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Official repo for vidar and vidarc: video foundation model for robotics.
Lets make video diffusion practical!
Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
Wan: Open and Advanced Large-Scale Video Generative Models
Official code implementation of "Mitty: Diffusion-based Human-to-Robot Video Generation"