Schema-R1

Our Paper: https://arxiv.org/abs/2506.11986

Training Plots

Setup

conda create -n schema-R1 python=3.11 && conda activate schema-R1 && pip install --upgrade pip

Next, install vLLM and FlashAttention:

pip install vllm==0.8.4
pip install setuptools && pip install flash-attn --no-build-isolation
#Recommend manual download and installation [flash-attn](https://github.com/Dao-AILab/flash-attention/releases)
pip install swanlab==0.5.7

Tips, check the transformer version while fail to start VLLM:

pip install transformers==4.51.3
pip install trl==0.18.0

Training models

cd Schema-R1 root

Step 1: Cold start:

python /src/SFT/grpo_SFT_spider_COT.ipynb    
#SFT_reasoning_data.csv is generated by deepseek-r1

Step 2: GRPO training:

#Recommend 3*A100 40G for Qwn2.5-0.5B and 3*A100 80G for Qwen-2.5-1.5B
# Start VLLM
CUDA_VISIBLE_DEVICES=0 trl vllm-serve --model src/SFT_model 

# Training
CUDA_VISIBLE_DEVICES=1,2 ACCELERATE_LOG_LEVEL=info \
    accelerate launch --config_file test_order/zero2.yaml  --num_processes 2 \
    src/open_r1/grpo_schema.py --config test_order/config_schema_muti.yaml

Evaluation

ALL eval process can be used in Schema-R1/src/eval/

Acknowledegments

Smol-r1

Open-r1

DTS-SQL

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
assets		assets
src		src
test_order		test_order
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Schema-R1

Training Plots

Setup

Training models

Evaluation

Acknowledegments

About

Uh oh!

Releases

Packages

Languages

hongWin/Schema-R1

Folders and files

Latest commit

History

Repository files navigation

Schema-R1

Training Plots

Setup

Training models

Evaluation

Acknowledegments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages