Self-Alignment Optimization for Language Models

This repository contains the official implementation of Self-Alignment Optimization for Language Models (SAO). The project provides an end-to-end training pipeline that integrates advanced fine-tuning techniques for optimizing language model responses through self-alignment.

Setup and Installation

To set up the environment and install all dependencies, follow these steps:

Clone the repository:

git clone https://github.com/SJY8460/SAO.git
cd SAO

Create a conda environment:

Ensure you have Conda installed. Then, create a new environment with Python 3.8:
```
conda create -n sao_env python=3.8
conda activate sao_env
```
Install dependencies:

Essential dependencies are listed in requirements.txt. Install them using:
```
pip install -r requirements.txt
```

Dataset Generation Pipeline

The pipeline involves three major steps: prompt synthesis, response generation, and response scoring. Each step is implemented in a separate Python script.

Step 1: Generate Prompts

The first step involves synthesizing prompts based on a given template.

python code/vllm_synthesize_prompt.py --model_path <model_path> --template <template> --sample_size <sample_size> --output_path <output_path>

--model_path: Path to the model.
--template: Prompt template to be used.
--sample_size: Number of samples to use. Set to -1 to use the full dataset.
--output_path: Path to save the generated prompts.

Step 2: Generate Responses

Next, generate responses for the synthesized prompts using the model.

python code/vllm_synthesize_response.py --model_id <model_id> --input_file <input_file> --output_file <output_file> --num_responses 2

--model_id: Model identifier.
--input_file: Path to the file containing generated prompts.
--output_file: Path to save the synthesized responses.
--num_responses: Number of responses to generate per prompt.

Step 3: Score Responses

The final step is to score the generated responses using a specified prompt template.

python code/vllm_synthesize_score.py --model_id <model_id> --input_file_path <input_file_path> --output_dir <output_dir> --prompt_template_path <prompt_template_path>

--model_id: Model identifier.
--input_file_path: Path to the file containing the synthesized responses.
--output_dir: Directory to save the output scores.
--prompt_template_path: Path to the prompt template file used for scoring.

Run the Full Pipeline

You can directly utilize the provided shell script to run the entire end-to-end pipeline:

bash vllm_sao.sh

This script sequentially performs the three steps described above for each combination of model and template.

Output

The output files will be saved in the specified output_dir. Each model and template combination will have its own directory containing:

vllm_<template>_synthesis_prompt.jsonl: Contains the synthesized prompts.
vllm_<template>_synthesis_response.jsonl: Contains the generated responses.
vllm_<template>_synthesis_score.jsonl: Contains the scores for the generated responses.

Important Notes

Ensure that the output_dir specified in the script exists; otherwise, the script will create it for you.
If an output file already exists, the script will skip the corresponding process to save time.

Training

We utilize the LLaMA-Factory framework to fine-tune our model. Detailed configurations can be found in the train_config/ directory.

Evaluation

AlpacaEval 2.0 Benchmark

We utilize the alpaca_eval repository for evaluation. Local evaluator configurations, model configurations, and examples of run scripts are provided in the eval_config/alpaca_eval/ directory.

Arena-Hard

We utilize the arena-hard-auto repository for evaluation. Some configuration files and run scripts are listed in the eval_config/arena-hard/ directory.

MT-Bench

We utilize the FastChat repository for evaluation.

Downstream Tasks

We utilize the lm-evaluation-harness for evaluating downstream tasks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Alignment Optimization for Language Models

Setup and Installation

Dataset Generation Pipeline

Step 1: Generate Prompts

Step 2: Generate Responses

Step 3: Score Responses

Run the Full Pipeline

Output

Important Notes

Training

Evaluation

AlpacaEval 2.0 Benchmark

Arena-Hard

MT-Bench

Downstream Tasks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
code		code
data		data
eval_config		eval_config
train_config		train_config
README.md		README.md
requirements.txt		requirements.txt
vllm_sao.sh		vllm_sao.sh

Folders and files

Latest commit

History

Repository files navigation

Self-Alignment Optimization for Language Models

Setup and Installation

Dataset Generation Pipeline

Step 1: Generate Prompts

Step 2: Generate Responses

Step 3: Score Responses

Run the Full Pipeline

Output

Important Notes

Training

Evaluation

AlpacaEval 2.0 Benchmark

Arena-Hard

MT-Bench

Downstream Tasks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages