Skip to content
View lu-m13's full-sized avatar

Block or report lu-m13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 873 95 Updated Apr 13, 2026

Official implementation of VLAA-GUI series

Python 31 1 Updated Apr 27, 2026

A Universal Platform for Training and Evaluation of Mobile Interaction

Python 63 6 Updated Sep 24, 2025

Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

Python 32 Updated Dec 6, 2024
Python 305 56 Updated May 27, 2026
11 Updated Feb 7, 2026

[NeurIPS 2025 Spotlight] OpenCUA: Open Foundations for Computer-Use Agents

Python 784 103 Updated May 25, 2026

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,950 204 Updated May 21, 2025

The official repository of Qwen-VLA

614 24 Updated May 29, 2026
Python 120 9 Updated Jun 17, 2026

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 36,862 3,709 Updated Jun 18, 2026

Ideogram 4: Open image model at the forefront of design

Python 2,132 212 Updated Jun 4, 2026

A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

Python 12,564 1,426 Updated Aug 20, 2024

Native and Compact Structured Latents for 3D Generation

Python 8,400 1,024 Updated Jun 5, 2026

Benchmarking Agentic Procedural 3D Modeling Via Code

Python 47 3 Updated Jun 2, 2026

[CVPR 2026] VGGDrive: Empowering Vision-Language Models with Cross-View Geometric Grounding for Autonomous Driving

Python 99 5 Updated May 8, 2026

Code for "Improving Robotic Manipulation with Efficient Geometry-Aware Vision Encoder"

Python 26 Updated Oct 14, 2025
Python 69 6 Updated May 21, 2026
Jupyter Notebook 26 6 Updated Feb 12, 2026

A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models

Python 609 10 Updated Jun 15, 2026

A curated collection of resources, tools, and frameworks for developing GUI Agents.

433 28 Updated Jun 2, 2026

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos (ICML 2026)

Python 48 Updated May 4, 2026

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,842 1,109 Updated Nov 1, 2024

Scalable pipeline for synthesizing verifiable RLVR training data for computer-use agents

Python 159 11 Updated May 26, 2026
Next