Xixi Wu WxxShirley

🤔

focus

157 followers · 53 following

The Chinese University of Hong Kong
Hong Kong
wxxshirley.github.io
@XixiWu1120

Achievements

x2 x2

Achievements

x2 x2

Lists (8)

Sort

Stars

WxxShirley / Agent-STAR

Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"

Python 11 Updated Mar 24, 2026

X-PLUG / MobileAgent

Mobile-Agent: The Powerful GUI Agent Family

Python 8,314 838 Updated Mar 26, 2026

UniPat-AI / UniScientist

UniScientist is designed to advance universal scientific research intelligence through a unified paradigm

Python 151 10 Updated Mar 14, 2026

Gen-Verse / Open-AgentRL

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

Python 412 49 Updated Feb 27, 2026

langfengQ / DrMAS

Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.

Python 124 8 Updated Feb 11, 2026

RLinf / RLinf

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,885 363 Updated Mar 26, 2026

Leey21 / awesome-ai-research-writing

Elevate your AI research writing, no more tedious polishing ✨

14,082 1,091 Updated Mar 25, 2026

THUDM / CaRR

This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".

Python 59 6 Updated Mar 14, 2026

imlrz / DeepResearch-Bench-II

DeepResearch Bench II (DRB2) is the follow-up to DeepResearch Bench, with a stronger focus on measuring the gap between deep research systems and human experts. It does so by decomposing expert-wri…

Python 37 2 Updated Feb 24, 2026

Alibaba-NLP / qqr

qqr is an RL training framework for open-ended agents.

Python 227 20 Updated Mar 25, 2026

UniPat-AI / BabyVision

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 203 7 Updated Jan 13, 2026

stepfun-ai / PaCoRe

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 334 14 Updated Feb 5, 2026

lukahhcm / Awesome_Environment_Scaling

Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to agent intelligence.

64 3 Updated Jan 28, 2026