Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning

Overview

Note:

If you encounter any errors or issues, feel free to open an issue or submit a pull request.

Pentest-R1 is a two-stage reinforcement learning (RL) framework designed to substantially improve the reasoning capabilities of Large Language Models (LLMs) for autonomous penetration testing. It begins with offline RL on a curated dataset of over 500 real-world expert walkthroughs to instill core attack logic. Then, it applies online RL in an interactive Capture The Flag (CTF) environment, allowing the agent to learn robust error correction and adaptive strategies through direct environmental feedback.

Figure: The framework architecture of Pentest-R1, illustrating the two-stage training process.

Quick Start

Prerequisites

Ensure your environment meets the following requirements before proceeding:

Programming Language: Python 3.11.11
Containerization: Docker
Package Manager: Pip

Installation

Clone the Pentest-R1 repository:
```
git clone 
```
Navigate to the project directory:
```
cd Pentest-R1
```
Install the required Python dependencies:
```
pip install -r requirements.txt
```

Running the Training Framework

The training is divided into two main stages.

Stage 1: Offline Reinforcement Learning

This stage trains the base LLM on the curated dataset of expert walkthroughs to learn foundational penetration testing logic.

Run the following command to start Stage 1 training:

python grpo_stage1.py

Stage 2: Online Reinforcement Learning in Interactive Environments

This stage fine-tunes the agent from Stage 1 in a live, interactive CTF environment. This requires setting up the Intercode-CTF Docker environment first.

Build the CTF Environment: Navigate to the environment directory and build the Docker image.
```
cd train_ctf_env
docker build -t intercode-ctf .
cd .. 
```
Run Stage 2 Training: Once the environment is ready, start the online RL training. This script will interact with the Docker container to provide real-time feedback to the agent.
```
python grpo_multi_turn_stage2.py
```

Contact

If you have any questions or suggestions, please open an issue on GitHub. Contributions, discussions, and improvements are always welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
experiments/auto-pen-bench		experiments/auto-pen-bench
images		images
train_ctf_env		train_ctf_env
README.md		README.md
grpo_multi_turn_stage2.py		grpo_multi_turn_stage2.py
grpo_stage1.py		grpo_stage1.py
grpo_stage_1_mod.py		grpo_stage_1_mod.py
merge.py		merge.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning

Table of Contents

Overview

Quick Start

Prerequisites

Installation

Running the Training Framework

Stage 1: Offline Reinforcement Learning

Stage 2: Online Reinforcement Learning in Interactive Environments

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Pentest-R1: Towards Autonomous Penetration Testing Reasoning Optimized via Two-Stage Reinforcement Learning

Table of Contents

Overview

Quick Start

Prerequisites

Installation

Running the Training Framework

Stage 1: Offline Reinforcement Learning

Stage 2: Online Reinforcement Learning in Interactive Environments

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages