WxxShirley

🤔

focus

Xixi Wu WxxShirley

🤔

focus

145 followers · 51 following

The Chinese University of Hong Kong
Hong Kong
wxxshirley.github.io
@XixiWu1120

Achievements

x2 x2

Achievements

x2 x2

Lists (8)

Sort

Stars

yaochenzhu / Rank-GRPO

(Netflix 2025) Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

Python 20 2 Updated Nov 3, 2025

meituan-longcat / vitabench

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Python 47 4 Updated Nov 4, 2025

HUST-AI-HYZ / MemoryAgentBench

Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 117 18 Updated Oct 7, 2025

hkust-nlp / deepsearch-tts

Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

Python 20 1 Updated Oct 8, 2025

MIT-MI / MEM1

Python 149 12 Updated Oct 27, 2025

hkust-nlp / WebExplorer

The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

Python 84 1 Updated Sep 29, 2025

microsoft / rStar

Python 1,332 120 Updated Sep 12, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 486 29 Updated Oct 8, 2025

MiroMindAI / MiroFlow

MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.

Python 806 100 Updated Nov 5, 2025

Shiy-Li / Awesome-Graph-augmented-LLM-Agent

[IEEE Intelligent Systems] Awesome-Graph-augmented-LLM-Agent (GLA)

20 1 Updated Nov 1, 2025

RUC-NLPIR / ARPO

The official code of ARPO & AEPO

Python 758 35 Updated Nov 5, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,668 440 Updated Nov 4, 2025

BytedTsinghua-SIA / MemAgent

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 769 55 Updated Jul 31, 2025

THUDM / TreeRL

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 78 6 Updated Jun 16, 2025

ulab-uiuc / Router-R1

[NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning

Python 74 6 Updated Sep 19, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 18,278 2,120 Updated Sep 24, 2025

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,238 2,937 Updated Oct 21, 2025

Ruiyang-061X / Awesome-Search-RL

42 4 Updated Jun 10, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,135 96 Updated Oct 20, 2025