StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Implementation for the paper "StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction".

In this work, we present Strategic Trajectory Abstraction (StraTA), a simple framework that introduces an explicit trajectory-level strategy into agentic reinforcement learning (RL). StraTA samples a compact strategy from the initial task state, conditions subsequent actions on that strategy, and trains strategy generation and action execution jointly with a hierarchical GRPO-style rollout design, further enhanced by diverse strategy rollout and critical self-judgment.

📰 News

[2026/05/04] The preprint of our paper is available on arXiv, with code released on GitHub.

⚙️ Setup

Our implementation is based on the rLLM framework. You can follow the tutorial to setup the framework.

🚀 Usage

Our implementation supports three environments based on the AgentGym codebase. You should first setup the environment and prepare the dataset. Refer to following tutorials for usage:

ALFWorld: a text-based embodied household environment.
WebShop: a web-based online shopping environment.
SciWorld: a text-based scientific experimentation environment.

🤗 Models

The model checkpoints are available on HuggingFace. You can download them for evaluation.

📜 Citation

Please consider citing our paper if you find it helpful:

@article{xue2026strata,
  title={StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction},
  author={Xue, Xiangyuan and Zhou, Yifan and Wang, Zidong and Tang, Shengji and Torr, Philip and Ouyang, Wanli and Bai, Lei and Yin, Zhenfei},
  journal={arXiv preprint arXiv:2605.06642},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
agentgym		agentgym
assets		assets
examples		examples
rllm		rllm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

📰 News

⚙️ Setup

🚀 Usage

🤗 Models

📜 Citation

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

📰 News

⚙️ Setup

🚀 Usage

🤗 Models

📜 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages