Principal Masked Autoencoders

Official PyTorch codebase for Principal Masked Auto-Encoders (PMAE) presented in From Pixels to Components: Eigenvector Masking for Visual Representation Learning [arXiv].

Method

PMAE introduces an alternative approach to pixel masking for visual representation learning by masking principal components instead of pixel patches. This repository builds on top of the Masked Auto-Encoder (MAE, [arXiv]) a prominent baseline for Masked Image Modelling (MIM) and replaces the masking of patches of pixels by the masking of principal components.

Code Structure

.
├── assets                    # assets for the README file 
├── configs                   # directory in which all experiment '.yaml' configs are stored
├── scripts                   # bash scripts to launch training and evaluation
│   ├── train.sh              #   training script
│   └── eval.sh               #   evaluation script
├── src                       # the package
│   ├── plotting.py           #   plotting function to training tracking
│   ├── utils.py              #   helper functions for init of models & opt/loading checkpoint
│   ├── dataset               #   datasets, data loaders, ...
│   └── model                 #   models, training loops, ...
├── tools                     # scripts to compute PCA prior to training
├── main.py                   # entrypoint for launch PMAE pretraining locally on your machine
└── requirements.txt          # requirements file

Config files: Note that all experiment parameters are specified in config files (as opposed to command-line-arguments). See the config/ directory for example config files.

Installation

In your environment of choice, install the necessary requirements

!pip install -r requirements.txt

Alternatively, install individual packages as follows:

!pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
!pip install pandas numpy pillow scikit-learn scikit-image plotly kaleido matplotlib submitit hydra-core pytorch-lightning imageio medmnist wandb transformers

Create a config file that suits your machine:

cd ./config/user
cp abizeul_biomed.yaml myusername_mymachine.yaml

Adjust the paths in myusername_mymachine.yaml to point to the directory you would like to use for storage of results and for fetching the data

Make sure to either compute or download the necessary assets for the dataset you plan to use with PMAE. These include the mean and standard deviation for image normalization, as well as the eigenvalues and eigenvectors. For each dataset, these assets are available on Zenodo for which the link are listed below.ß

Once files are downloaded and stored on your local machine, make sure to specify their path in the dataset's config (data.mean.data.file,data.std.data.file,extradata.pcamodule.file and extradata.eigenratiomodule.file). See imagenet's config as an example.

Launch Training

To launch experiments, you can find training and evaluation scripts in scripts. The following modifications should be made to the train.sh script to ensure a smooth training on your local machine:

USER_MACHINE="myusername_mymachine"  # the user which runs the experiment
EXPERIMENT="pmae_tiny_pc"            # the experiment to run, defines the model, dataset and masking type
MASK=0.2                             # the masking ratio to use, default: 0.2

Please find the whole set of pre-defined experiment to chose from in config/experiment. Note that train.sh does include a final evaluation of the representations using a linear probe.

Distributed Training: For distributed training, please use the train_distributed.sh script instead and adjust the number of GPUs according to your own ressources. Note that our code uses Pytorch Lightning for distributed training.

Baselines: To run the MAE baseline in place of PMAE, adjust EXPERIMENT to mae_tiny or any other experiment which starts by mae.

Random Masking: To run PMAE with randomized masking ratios as presented in the [arXiv], adjust EXPERIMENT to pmae_tiny_pcsampling or any other experiment which contains pcsampling.

Launch Evaluation

To evaluate a checkpoint, the evaluation script for linear probe, MLP probe, k-nearest neighbors, and fine-tuning approaches can be found in the scripts directory. The following modifications should be made to the eval.sh script to ensure a smooth evaluation on your local machine:

USER_MACHINE="myusername_mymachine"  # the user which runs the experiment
EXPERIMENT="pmae_tiny_pc"            # the experiment to run, defines the model, dataset and masking type
EPOCH=800                            # the epoch to be evaluated
MASK=0.2                             # the masking ratio to use, default: 0.2

Additionally, ensure the path to the checkpoint you want to evaluate is correctly set in your user configuration file. For reference, see config/user/abizeul_euler.yaml. The specified checkpoint (defined by its path and epoch) will then be evaluated.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Principal Masked Autoencoders

Method

Code Structure

Installation

Launch Training

Launch Evaluation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
assets		assets
config		config
scripts		scripts
src		src
tools		tools
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

alicebizeul/pmae

Folders and files

Latest commit

History

Repository files navigation

Principal Masked Autoencoders

Method

Code Structure

Installation

Launch Training

Launch Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages