-
Rutgers University
- New Jersey, USA
- scholar.google.com/citations?user=e3s9h8MAAAAJ
- https://mowenyii.github.io
- in/wenyi-mo-133abb313
Highlights
- Pro
Stars
SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning
FORTSearcher: Synthesizing Hard-to-Shortcut Search Tasks for Deep Search Agents
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
repo for paper https://arxiv.org/abs/2504.13837
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
A version of verl to support diverse tool use [TMLR 2026]
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)
slime is an LLM post-training framework for RL Scaling.
Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihan Wang*, Jiateng Liu, Yangyi Chen, Lifan Yuan, Hao Peng and …
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
🤗 smolagents: a barebones library for agents that think in code.
OpenSeeker: A search agent with open-source data and models
Train transformer language models with reinforcement learning.
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework