Skip to content
View piaofu110's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report piaofu110

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Python 384 26 Updated May 2, 2026
Python 137 18 Updated May 4, 2026

ACMMM2024: Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement

Python 137 7 Updated May 20, 2025

CVPR 2025 | Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond

Python 108 8 Updated Nov 25, 2025

Repository for "Visible-Thermal Tiny Object Detection: A Benchmark Dataset and Baselines"

Python 225 23 Updated Jun 1, 2026

CVPR 2025 - V2X-R: Cooperative LiDAR-4D Radar Fusion for 3D Object Detection with Denoising Diffusion

Python 160 12 Updated Nov 26, 2025

AAAI2025 Oral - L4DR: LiDAR-4DRadar Fusion for Weather-Robust 3D Object Detection

Python 74 13 Updated Mar 10, 2026

ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration

Python 33,486 4,173 Updated May 27, 2026

Real-Time VLAs via Future-state-aware Asynchronous Inference.

Python 417 32 Updated Apr 22, 2026

siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems

Python 360 27 Updated Jan 30, 2026

[CVPR'2026] "MM-ACT: Learn from Multimodal Parallel Generation to Act"

Python 108 5 Updated Mar 13, 2026

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 416 24 Updated Feb 11, 2026

[Paper][EMNLP 2025] RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models

Python 17 1 Updated Jan 29, 2026

MrlX: A Multi-Agent Reinforcement Learning Framework

Python 211 12 Updated Jan 19, 2026

[ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs

Python 52 3 Updated Apr 17, 2026

NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

Python 106 9 Updated Jan 11, 2026

Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"

Python 211 8 Updated Nov 13, 2024

Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"

Python 233 13 Updated May 30, 2025

[ICCV 2025] VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving

10 2 Updated Jul 21, 2025

Joycon-Robotics: Low-Cost, Convenient Teleoperation for One- and Two-Arm Robots

Jupyter Notebook 231 31 Updated Apr 12, 2026

U-Arm: Lerobot-Everything-Cross-Embodiment-Teleoperation

Python 272 30 Updated May 11, 2026

Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment

Python 295 31 Updated May 12, 2026

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving (ICLR 2026)

Python 377 29 Updated Feb 11, 2026

[ICLR 2026] Unified Vision-Language-Action Model

Python 306 22 Updated Oct 15, 2025

🔥 The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]

Python 181 9 Updated Mar 12, 2026

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Python 351 13 Updated Oct 3, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,863 112 Updated Mar 18, 2025

[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Python 1,732 113 Updated Jan 6, 2026
Next