Skip to content

DESEq error on MonoallelicExpression_pipeline_MAE_deseq_mae_R module using demo dataset #601

@Roshanshafiha

Description

@Roshanshafiha

Hi, I am running the whole DROP pipeline (version-1.4.0) on the test data and I am getting the below error, may I know how can I fix this?

Image

Below is my config file:



projectTitle: Detection of RNA Outlier Pipeline
htmlOutputPath: Output/html
indexWithFolderName: true
root: Output
sampleAnnotation: Data/sample_annotation_relative.tsv
geneAnnotation:
  v29: Data/gencode_annotation_trunc.gtf
genomeAssembly: hg19
genome: 
  ncbi: Data/chr21_ncbi.fa
  ucsc: Data/chr21.fa
hpoFile: /projects/pipelines/drop_demo/hpo_genes.tsv.gz
random_seed: true
exportCounts:
  geneAnnotations:
  - v29
  excludeGroups:
    - mae
    - outrider_external
    - fraser_external

aberrantExpression:
    run: true
    groups:
      - outrider
      - outrider_external
    fpkmCutoff: 1
    implementation: autoencoder
    padjCutoff: 1
    zScoreCutoff: 0
    genesToTest: "Data/genes_to_test.yaml"
    maxTestedDimensionProportion: 3
    yieldSize: 2000000

aberrantSplicing:
    run: true
    groups:
      - fraser
      - fraser_external
    recount: true
    longRead: false
    keepNonStandardChrs: false
    filter: false
    minExpressionInOneSample: 20
    quantileMinExpression: 10 
    quantileForFiltering: 0.95 
    minDeltaPsi: 0.05
    implementation: PCA
    padjCutoff: 1 
    maxTestedDimensionProportion: 6
    genesToTest: "Data/genes_to_test.yaml"
    ### FRASER1 configuration for demo dataset
    FRASER_version: "FRASER" 
    deltaPsiCutoff : 0.05 

mae:
    run: true
    groups:
      - mae
    gatkIgnoreHeaderCheck: true
    padjCutoff: .5
    allelicRatioCutoff: 0.7
    addAF: false
    maxAF: .001
    maxVarFreqCohort: 1
    # VCF-BAM matching
    qcVcf: Data/qc_vcf_1000G.vcf.gz
    qcGroups:
      - mae
    dnaRnaMatchCutoff: 0.85

rnaVariantCalling:
    run: true
    groups:
      - batch_0
      - batch_1
    highQualityVCFs:
      - Data/high_confidence_snps.vcf.gz
      - Data/high_confidence_indels.vcf.gz
    dbSNP: Data/dbSNP_chr21.vcf.gz
    repeat_mask: Data/repeat_mask_chr21.bed
    createSingleVCF: true
    addAF: false
    maxAF: 0.001
    maxVarFreqCohort: 1
    hcArgs: null
    minAlt: 3
    yieldSize: 100000

tools:
  gatkCmd: gatk
  bcftoolsCmd: bcftools
  samtoolsCmd: samtools

I have downloaded the demo data from the following script download_data.sh :

#!/bin/bash
set -e

# get data
resource_url="https://github.com/gagneurlab/drop_demo_data/archive/refs/heads/main.zip"
tmpdir="$(dirname "$(mktemp)")"
wget -nc -P $tmpdir $resource_url

# if the directory Data does not exist
if [ ! -d "Data" ]; then
        unzip "$tmpdir/main.zip" 
        mv drop_demo_data-main/Data Data
        rm -rf drop_demo_data-main
else
    echo "Data directory not empty, is not updated"
fi

# unzip fasta
cd ./Data
if [ ! -f "chr21.fa" ]; then gunzip chr21.fa.gz; fi

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions