Skip to content
View ginreedcho's full-sized avatar

Highlights

  • Pro

Block or report ginreedcho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Python 35 2 Updated Oct 29, 2025

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 388 75 Updated Nov 5, 2025

R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Python 17 1 Updated Oct 21, 2025

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 153 12 Updated Nov 1, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,982 1,259 Updated Oct 27, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,925 1,291 Updated Nov 3, 2025

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 769 55 Updated Jul 31, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

1,988 111 Updated Nov 5, 2025

[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

Python 427 84 Updated Sep 6, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,120 11,042 Updated Nov 5, 2025

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

Python 1,776 122 Updated Jul 10, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,134 2,428 Updated Nov 5, 2025
JavaScript 12 2 Updated Aug 18, 2025

[ICML 2025] Improving Planning of Agents for Long-Horizon Tasks

Python 11 Updated Oct 2, 2025

[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"

Python 89 6 Updated Oct 5, 2025

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 378 34 Updated Feb 22, 2025

The model, data and code for the visual GUI Agent SeeClick

HTML 433 24 Updated Jul 13, 2025

Mobile-Agent: The Powerful GUI Agent Family

Python 6,177 617 Updated Oct 31, 2025

Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay

Python 134 8 Updated May 29, 2025

An awesome repository that maps the current landscape of GUI/OS Agent research

Python 48 4 Updated Aug 18, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,705 1,092 Updated Apr 30, 2025

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 22,034 2,642 Updated Jun 12, 2025

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

Jupyter Notebook 4,675 460 Updated Oct 13, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 21,706 2,541 Updated Oct 19, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 20,966 1,851 Updated Oct 17, 2025

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 92 8 Updated Oct 23, 2024
Python 8,655 514 Updated Oct 9, 2024

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 75 4 Updated Oct 21, 2025

GUI Grounding for Professional High-Resolution Computer Use

Python 277 30 Updated Oct 27, 2025
Next