Snakemake workflow for testing μ-PBWT against Durbin's PBWT and Syllable-PBWT on 1000 Genome Project (1KGP) phase 3 data. 1KGP data are available at this link.
Snakemake need to be already installed, for example via conda:
conda create -c conda-forge -c bioconda -n snakemake snakemakecd muPBWT-1KGP-workflow
snakemake --cores <num_cores> --use-conda --resources load=100The option --resources load=100 option will avoid using too much RAM for Durbin Algorithm 5 (about 500gb is still needed), running only one job at a time for the rule runPbwtIndexed(thanks Jan Schreiber).
The pipeline will generatecd some results:
- in
results/datasome useful CSV files - in
results/plotssome plots in PDF format - in
results/tablessome tables in LaTeX syntax