-
Huazhong University of Science and Technology
- Wuhan
-
18:30
(UTC -12:00)
Stars
PyTorch code and models for VJEPA2 self-supervised learning from video.
OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation
[RSS 2026] HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model
CoRL 2025 TA-VLA: Elucidating the Design Space of Torque-aware Vision-Language-Action Models
[RSS 2025] PIN-WM : Learning Physics-INformed World Models for Non-Prehensile Manipulation
Bridging Large Vision-Language Models and End-to-End Autonomous Driving
cursi36 / robointer
Forked from InternRobotics/RoboInter[ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation
利用HuggingFace的官方下载工具从镜像网站进行高速下载。
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos (ICML 2026)
[ICRA 2026] VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
Codebase for PLATO: Planning with LLMs and Affordances for Tool Use
Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
[CVPR 2026] UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[ICLR 2026] RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation
Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.
Official code for "One-Shot Manipulation Strategy Learning by Making Contact Analogies".
moojink / openvla-oft
Forked from openvla/openvlaFine-Tuning Vision-Language-Action Models: Optimizing Speed and Success
Benchmarking Knowledge Transfer in Lifelong Robot Learning