Skip to content
View WxxShirley's full-sized avatar
🤔
focus
🤔
focus

Block or report WxxShirley

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,398 262 Updated Feb 6, 2026

Elevate your AI research writing, no more tedious polishing ✨

3,645 299 Updated Feb 4, 2026

This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards".

Python 54 4 Updated Jan 12, 2026

DeepResearch Bench II

Python 14 Updated Feb 2, 2026

qqr is an RL training framework for open-ended agents.

Python 205 19 Updated Jan 21, 2026

We introduce BabyVision, a benchmark revealing the infancy of AI vision.

Python 175 6 Updated Jan 13, 2026

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 310 9 Updated Feb 5, 2026

Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to agent intelligence.

59 2 Updated Jan 28, 2026

Develop review and rebuttal agents for openreview website

Python 6 Updated Dec 15, 2025

Public quant internship repository, maintained by NUFT but available for everyone.

OCaml 1,921 134 Updated Oct 19, 2025

[ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Python 44 Updated Jul 12, 2025

Official code for our paper: "SketchThinker-R1: Towards Efficient Sketch-Style Reasoning in Large Multimodal Models".

Python 6 1 Updated Nov 3, 2025

(ICLR'26 + Netflix) Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning

Python 36 4 Updated Nov 17, 2025

[ICLR 2026] VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Python 82 9 Updated Jan 31, 2026

Open source code for ICLR 2026 Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Python 219 33 Updated Jan 27, 2026

Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

Python 20 1 Updated Oct 8, 2025
Python 239 18 Updated Jan 3, 2026

The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

Python 101 2 Updated Sep 29, 2025
Python 1,388 124 Updated Sep 12, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 551 35 Updated Nov 26, 2025

MiroFlow is an agent framework that enables tool-use agent tasks, featuring a reproducible GAIA score of 82.4%.

Python 2,428 248 Updated Jan 30, 2026

[IEEE Intelligent Systems] Awesome-Graph-augmented-LLM-Agent (GLA)

62 3 Updated Nov 17, 2025

The official code of ARPO & AEPO

Python 879 41 Updated Jan 28, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,078 500 Updated Feb 5, 2026

A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.

Python 881 58 Updated Jul 31, 2025

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25

Python 88 8 Updated Jun 16, 2025

[NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning

Python 115 13 Updated Dec 30, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 19,024 2,217 Updated Feb 6, 2026

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,832 3,054 Updated Jan 8, 2026
Next