Skip to content
View Gikiman's full-sized avatar
  • Tsinghua University
  • Beijing

Block or report Gikiman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,847 175 Updated Feb 27, 2026

Agent S: an open agentic framework that uses computers like a human

Python 10,925 1,272 Updated Feb 21, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 7,657 722 Updated Apr 25, 2026

Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞

Python 11,746 1,354 Updated Apr 23, 2026

Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"

Python 41 5 Updated Apr 25, 2026

Memory_Driven_GUI_Agent

Python 8 1 Updated Apr 13, 2026

A curated collection of resources, tools, and frameworks for developing GUI Agents.

409 28 Updated Apr 16, 2026

Large language model review prompts

503 44 Updated Mar 19, 2026

[AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding

Python 307 10 Updated Apr 15, 2026

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

Python 473 50 Updated Feb 27, 2026

Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay

Python 158 11 Updated May 29, 2025

DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation

Python 88 6 Updated Feb 26, 2026

Solve Visual Understanding with Reinforced VLMs

Python 5,947 377 Updated Mar 12, 2026

[NeurIPS 2025]"Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning"

Python 102 6 Updated Oct 21, 2025

[AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"

Python 152 12 Updated Nov 24, 2025

Mobile-Agent: The Powerful GUI Agent Family

Python 8,577 868 Updated Apr 14, 2026

[AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalized GUI Agents

HTML 99 8 Updated Dec 1, 2025

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Python 117 9 Updated Jul 17, 2025

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 394 35 Updated Feb 22, 2025

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Python 243 19 Updated May 5, 2025

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 712 71 Updated Feb 15, 2026

Elevate your AI research writing, no more tedious polishing ✨

20,020 1,600 Updated Mar 25, 2026
Python 80 Updated Oct 14, 2025

The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"

Python 199 18 Updated Dec 25, 2025
Python 309 20 Updated Jan 3, 2026

Edit Banana: A framework for converting statistical formats into editable.

Python 5,089 338 Updated Apr 25, 2026

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.

Python 137 3 Updated Feb 10, 2026

GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.

Python 895 69 Updated Mar 20, 2026

Agent Skills as a Memory Layer

JavaScript 3,349 314 Updated Apr 21, 2026

A curated list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

830 65 Updated Mar 3, 2026
Next