Lists (1)
Sort Name ascending (A-Z)
Starred repositories
[CVPR 2025] Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
Official code of Motus: A Unified Latent Action World Model
Pydantic media reference for images and video frames (with timestamp support) from data URIs, HTTP URLs, file URIs, and local paths. Features lazy loading and optimized batch video decoding.
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
Source Code for MIRROR: Visual Motion Imitation via Real-time Retargeting and Teleoperation on Humanoid Robots with Parallel Differential Inverse Kinematics
[CoRL 2025] TWIST: Teleoperated Whole-Body Imitation System
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
[ICRA 2026] GMR: General Motion Retargeting. Retarget human motions into diverse humanoid robots in real time on CPU. Retargeter for TWIST.
Windows kernel-mode driver emulating well-known USB game controllers.
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]
[CVPR 2026] Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface
Solve Visual Understanding with Reinforced VLMs
StableWorld: Towards Stable and Consistent Long Interactive Video Generation
Official implementation of Learning Athletic Humanoid Tennis Skills from Imperfect Human Motion Data
A Vision-Language Model for Spatial Affordance Prediction in Robotics
[CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.
Official implementation of OpenTrack.
Code for kai0, including training, inference and data collection.
We release Evo-RL, the opensource real-world offline RL on So-101 and AgileX PiPER for easier reproduction.
The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.