Skip to content
View ginreedcho's full-sized avatar

Highlights

  • Pro

Block or report ginreedcho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
52 stars written in Python
Clear filter

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 378 34 Updated Feb 22, 2025

[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Python 351 40 Updated Oct 29, 2025

GUI Grounding for Professional High-Resolution Computer Use

Python 277 30 Updated Oct 27, 2025
Python 176 19 Updated Dec 20, 2024

Python code for several metrics: PSNR, SSIM, UCIQE and UIQM

Python 170 17 Updated Mar 5, 2023

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 155 13 Updated Nov 1, 2025

Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay

Python 134 8 Updated May 29, 2025

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

Python 99 8 Updated Jul 27, 2025

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 92 8 Updated Oct 23, 2024

[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"

Python 89 6 Updated Oct 5, 2025

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 75 4 Updated Oct 21, 2025

Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"

Python 60 3 Updated May 23, 2025

An awesome repository that maps the current landscape of GUI/OS Agent research

Python 48 4 Updated Aug 18, 2025

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

Python 45 7 Updated Jan 28, 2025

Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Python 36 2 Updated Nov 6, 2025

This is a script for batch evaluation of psnr and ssim indicators of reconstructed images. It is suitable for image compression, image restoration, super-resolution reconstruction, image denoising …

Python 33 3 Updated Oct 23, 2019
Python 28 Updated Aug 17, 2025
Python 25 4 Updated Sep 19, 2025

[SIGIR 2024] TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Python 20 2 Updated Mar 28, 2024

[AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".

Python 19 Updated Jul 26, 2025

R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Python 18 1 Updated Oct 21, 2025

[ICML 2025] Improving Planning of Agents for Long-Horizon Tasks

Python 11 Updated Oct 2, 2025