My Project

My project followed the tutorial and used the code from the posteriors python library found here. I additionally tried testing the model on the BoolQ dataset. The results and the report can be found in the paper for the project here.

Bayesian Ensemble Language Model

We create an ensemble last layer on top of Llama3 to perform uncertainty quantification. We fine-tune all the weights in the last attention layer in the model and achieve a distribution over distributions by grabbing 10 copies of the weights over 10 different training trajectories.

Installation

We need to install TQA dataset (https://allenai.org/data/tqa):

wget https://ai2-public-datasets.s3.amazonaws.com/tqa/tqa_train_val_test.zip && unzip -q tqa_train_val_test.zip
Download the HuggingFace Llama3 weights here: https://huggingface.co/meta-llama/Meta-Llama-3-8B.

Running the Code

Training Instructions

Make sure you change the used config to match the correct paths (i.e., you may need to change the dataset path value).

For training, run the following:

python train_ensemble.py --base configs/training/ensemble_bayes.yaml --devices 0,

If you have additional GPUs you can add them by specifying 0,1,2,3,... after the --devices flag.

Evaluation Instructions

Ensure that your config is pointing to a path with all the ensemble weights you would like to load in checkpoints_folder.

To evaluate, run the following:

python run_eval.py --base configs/evaluation/eval_bayes_ensemble.yaml --output ensemble.pkl

To recreate plots and get metrics, run python plot.py.

Details

The training code does not need a separate Llama3 model code. It can work out of the box. But, for inference, we made modifications you can find in llama3/bayesllama.py

The statements referred to in the paper are found in llama3/data/statements.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
llama3		llama3
squad2		squad2
tqa_train_val_test		tqa_train_val_test
.gitignore		.gitignore
README.md		README.md
boolq.py		boolq.py
eda_squad2.py		eda_squad2.py
ft_llama.py		ft_llama.py
generate_bayesllama.py		generate_bayesllama.py
load_lp_llama_in.py		load_lp_llama_in.py
move_ckpts.sh		move_ckpts.sh
plot.py		plot.py
run_eval.py		run_eval.py
train.py		train.py
train_ensemble.py		train_ensemble.py
uncertainties.png		uncertainties.png
uncertainty_distributions.png		uncertainty_distributions.png
uq.py		uq.py
visualize_bayesllama_gen.py		visualize_bayesllama_gen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

My Project

Bayesian Ensemble Language Model

Installation

Running the Code

Training Instructions

Evaluation Instructions

Details

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

My Project

Bayesian Ensemble Language Model

Installation

Running the Code

Training Instructions

Evaluation Instructions

Details

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages