Skip to content

Is fastq file converted from "Dorado" unsorted bam by "samtools fastq" suitable for m6anet #196

@Tesdhi

Description

@Tesdhi

Hi Matthias (tagging you here @mocherry),

  1. You can pass the --emit-fastq flag to dorado basecaller, which would emit a fastq file, this is sufficient for downstream running nanopolish and m6anet

  2. You can use minimap2 and samtools to get a sorted.bam file:


minimap2 -ax map-ont -uf -t 3 --secondary=no <MMI> <PATH/TO/FASTQ.GZ> > <PATH/TO/SAM> 2>> <PATH/TO/SAM_LOG>

samtools view -Sb <PATH/TO/SAM> > <PATH/TO/BAM>

samtools sort <PATH/TO/BAM> -o <PATH/TO/SORTED.BAM> 

samtools index <PATH/TO/BAM>

  1. You can then use the fastq file and the fast5 files (or convert the pod5 files to fast5 files with pod5 convert to_fast5 and run nanopolish index

  2. Then, you can run nanopolish eventalign with the fast5, fastq, and sorted.bam, which will give you an eventalign.txt file to input to m6anet dataprep.

Not sure whether you are open to using command line, but you can check out the nf-core/nanoseq, which does all the steps for you.

Thanks!

Best wishes,

Yuk Kei

Originally posted by @yuukiiwa in #155

Hello Yuk Kei!

For the step1 you mentioned about getting the "fastq" file by "dorado",

I am wondering whether it is an alternative way to use "samtools fastq" to convert the unsorted bam file (produced by "dorado basecaller") to "fastq" file as the input "fastq" for "nanopolish eventalign"?

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions