Model Fusion via Optimal Transport

Requirements

Install the Python Optimal Transport Library

pip install POT

Other than that, we also need PyTorch v1 or higher and NumPy. (Also, Python 3.6 +)

Before running, unzip the respective pretrained model zip file. Also, you need to unzip the cifar.zip file for some imports to work.

Sample commands of one-shot model fusion

For MNIST + MLPNet

python main.py --gpu-id 1 --model-name mlpnet --n-epochs 10 --save-result-file sample.csv \
--sweep-name exp_sample --exact --correction --ground-metric euclidean --weight-stats \
--activation-histograms --activation-mode raw --geom-ensemble-type acts --sweep-id 21 \
--act-num-samples 200 --ground-metric-normalize none --activation-seed 21 \
--prelu-acts --recheck-acc --load-models ./mnist_models --ckpt-type final \
--past-correction --not-squared --dist-normalize --print-distances --to-download

For CIFAR10 + VGG11

python main.py --gpu-id 1 --model-name vgg11_nobias --n-epochs 300 --save-result-file sample.csv \
--sweep-name exp_sample --correction --ground-metric euclidean --weight-stats \
--geom-ensemble-type wts --ground-metric-normalize none --sweep-id 90 --load-models ./cifar_models/ \
--ckpt-type best --dataset Cifar10 --ground-metric-eff --recheck-cifar --activation-seed 21 \
--prelu-acts --past-correction --not-squared --normalize-wts --exact

We also recommend that users play around with some of options or hyper-parameters above, as the commands listed here are not highly tuned. For instance, getting rid of the --normalize-wts flag and running the below command instead, results in a test accuracy of 86.51% instead of 85.98% on CIFAR10.

python main.py --gpu-id 1 --model-name vgg11_nobias --n-epochs 300 --save-result-file sample.csv \
--sweep-name exp_sample --correction --ground-metric euclidean --weight-stats \
--geom-ensemble-type wts --ground-metric-normalize none --sweep-id 90 --load-models ./cifar_models/ \
--ckpt-type best --dataset Cifar10 --ground-metric-eff --recheck-cifar --activation-seed 21 \
--prelu-acts --past-correction --not-squared --exact

For CIFAR10 + ResNet18

python main.py --gpu-id 1 --model-name resnet18_nobias_nobn --n-epochs 300 --save-result-file sample.csv \
--sweep-name exp_sample --exact --correction --ground-metric euclidean --weight-stats \
--activation-histograms --activation-mode raw --geom-ensemble-type acts --sweep-id 21 \
--act-num-samples 200 --ground-metric-normalize none --activation-seed 21 --prelu-acts --recheck-acc \
--load-models ./resnet_models/ --ckpt-type best --past-correction --not-squared  --dataset Cifar10 \
--handle-skips

The code and pretrained models correspond to the paper: Model Fusion via Optimal Transport. If you use any of the code or pretrained models for your research, please consider citing the paper as.

@article{singh2020model,
  title={Model fusion via optimal transport},
  author={Singh, Sidak Pal and Jaggi, Martin},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
baseline.py		baseline.py
check_accuracy.py		check_accuracy.py
cifar.zip		cifar.zip
cifar_models.zip		cifar_models.zip
compute_activations.py		compute_activations.py
data.py		data.py
distillation_big_only.py		distillation_big_only.py
ensemble_cifar_models.py		ensemble_cifar_models.py
fusion_camera_ready.png		fusion_camera_ready.png
fusion_camera_ready_compressed.png		fusion_camera_ready_compressed.png
ground_metric.py		ground_metric.py
main.py		main.py
mnist.py		mnist.py
mnist_models.zip		mnist_models.zip
model.py		model.py
parameters.py		parameters.py
partition.py		partition.py
resnet_models.zip		resnet_models.zip
routines.py		routines.py
split_main.py		split_main.py
train_cifar_models.py		train_cifar_models.py
utils.py		utils.py
wasserstein_ensemble.py		wasserstein_ensemble.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model Fusion via Optimal Transport

Requirements

Sample commands of one-shot model fusion

For MNIST + MLPNet

For CIFAR10 + VGG11

For CIFAR10 + ResNet18

About

Releases

Packages

Contributors 2

Languages

sidak/otfusion

Folders and files

Latest commit

History

Repository files navigation

Model Fusion via Optimal Transport

Requirements

Sample commands of one-shot model fusion

For MNIST + MLPNet

For CIFAR10 + VGG11

For CIFAR10 + ResNet18

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages