ginreedcho

Follow

肖金苇 ginreedcho

Follow

a student at University of Chinese Academy of Sciences

1 follower · 2 following

Highlights

Pro

Stars

52 stars written in Python

DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 378 34 Updated Feb 22, 2025

microsoft / GUI-Actor

[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Python 351 40 Updated Oct 29, 2025

likaixin2000 / ScreenSpot-Pro-GUI-Grounding

GUI Grounding for Professional High-Resolution Computer Use

Python 277 30 Updated Oct 27, 2025

LeapLabTHU / ExpeL

Python 176 19 Updated Dec 20, 2024

xueleichen / PSNR-SSIM-UCIQE-UIQM-Python

Python code for several metrics: PSNR, SSIM, UCIQE and UIQM

Python 170 17 Updated Mar 5, 2023

KANABOON1 / MemGen

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 155 13 Updated Nov 1, 2025

dvlab-research / ARPO

Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay

Python 134 8 Updated May 29, 2025

showlab / WorldGUI

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

Python 99 8 Updated Jul 27, 2025

MBZUAI-LLM / web2code

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 92 8 Updated Oct 23, 2024

OSU-NLP-Group / WebDreamer

[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"

Python 89 6 Updated Oct 5, 2025

YXB-NKU / SE-GUI

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 75 4 Updated Oct 21, 2025

InfiXAI / InfiGUI-R1

Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"

Python 60 3 Updated May 23, 2025

harpreetsahota204 / gui_agent_research_landscape

An awesome repository that maps the current landscape of GUI/OS Agent research

Python 48 4 Updated Aug 18, 2025

amazon-science / AgentOccam

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Python 45 7 Updated Jan 28, 2025

alibaba / UI-Ins

Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Python 36 2 Updated Nov 6, 2025

liujiawei2333 / PSNR-SSIM-batch-image

This is a script for batch evaluation of psnr and ssim indicators of reconstructed images. It is suitable for image compression, image restoration, super-resolution reconstruction, image denoising …

Python 33 3 Updated Oct 23, 2019

WebChoreArena / WebChoreArena

Python 28 Updated Aug 17, 2025

penghao-wu / GUI_Reflection

Python 25 4 Updated Sep 19, 2025

SkyRiver-2000 / TRAD-Official

[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Python 20 2 Updated Mar 28, 2024

ChangWinde / PiCor

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

Python 19 Updated Jul 26, 2025

meituan-longcat / R-HORIZON

R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Python 18 1 Updated Oct 21, 2025

SqueezeAILab / plan-and-act

[ICML 2025] Improving Planning of Agents for Long-Horizon Tasks

Python 11 Updated Oct 2, 2025