Skip to content
View Master7Sword's full-sized avatar
  • Sun Yat-Sen University
  • Guangzhou

Block or report Master7Sword

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

Python 242 11 Updated Nov 6, 2025

GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Python 2,542 198 Updated Mar 10, 2026

GigaWorld-Policy: An Efficient Action-Centered World–Action Model

Python 1,286 100 Updated Apr 20, 2026

GigaWorld-0: World Models as Data Engine to Empower Embodied AI

Python 1,593 131 Updated Dec 3, 2025

The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)

Python 128 4 Updated Nov 15, 2025

LAP: Language-Action Pre-Training Enables Zero-Shot Cross Embodiment Transfer

Python 147 19 Updated May 20, 2026

[ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"

C++ 672 64 Updated Jun 10, 2026

ActionCodec: What Makes for Good Action Tokenizers

Python 50 1 Updated Mar 1, 2026

One framework to evaluate any VLA model on any robot simulation benchmark.

Python 373 34 Updated Jun 15, 2026

A opensource, torch-like api framework for dynamic multi-agent workflow.

Python 543 10 Updated May 30, 2026

[ICML 2026] LaST​$_0$​: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model

Python 78 6 Updated Apr 30, 2026

A test-time method for determining the execution horizon for flow-matching VLAs

Python 14 Updated Mar 17, 2026

Dexbotic: Open-Source Vision-Language-Action Toolbox

Python 1,219 173 Updated Jun 12, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,399 1,789 Updated Jan 30, 2026

RoboChallenge Inference example code

Python 146 9 Updated Jun 10, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,372 1,491 Updated May 19, 2026

Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"

Python 233 13 Updated May 30, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 1,483 106 Updated Jan 31, 2025

Tensor's VLA Training Infrastructure for Real-World Robotics in PyTorch

Python 173 23 Updated Jun 16, 2026

[ICLR 2026🔥] MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Python 150 7 Updated May 19, 2026

Official repository of LIBERO-plus, a generalized benchmark for in-depth robustness analysis of vision-language-action models.

Python 347 27 Updated Jan 21, 2026

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 3,803 531 Updated Jun 16, 2026

Paper Survey for Visual Language Action

82 3 Updated Feb 3, 2026

[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI

2,083 142 Updated Jun 10, 2026

RoboTwin 2.0 Offical Repo

Python 2,451 400 Updated May 23, 2026

The code for controlling the Piper robotic arm

Python 343 76 Updated May 22, 2026

RealSense SDK

C++ 8,830 5,013 Updated Jun 16, 2026

Running VLA at 30Hz frame rate and 480Hz trajectory frequency

Python 575 41 Updated Feb 10, 2026
Next