Hi Matthias (tagging you here @mocherry),
-
You can pass the --emit-fastq flag to dorado basecaller, which would emit a fastq file, this is sufficient for downstream running nanopolish and m6anet
-
You can use minimap2 and samtools to get a sorted.bam file:
minimap2 -ax map-ont -uf -t 3 --secondary=no <MMI> <PATH/TO/FASTQ.GZ> > <PATH/TO/SAM> 2>> <PATH/TO/SAM_LOG>
samtools view -Sb <PATH/TO/SAM> > <PATH/TO/BAM>
samtools sort <PATH/TO/BAM> -o <PATH/TO/SORTED.BAM>
samtools index <PATH/TO/BAM>
-
You can then use the fastq file and the fast5 files (or convert the pod5 files to fast5 files with pod5 convert to_fast5 and run nanopolish index
-
Then, you can run nanopolish eventalign with the fast5, fastq, and sorted.bam, which will give you an eventalign.txt file to input to m6anet dataprep.
Not sure whether you are open to using command line, but you can check out the nf-core/nanoseq, which does all the steps for you.
Thanks!
Best wishes,
Yuk Kei
Originally posted by @yuukiiwa in #155
Hello Yuk Kei!
For the step1 you mentioned about getting the "fastq" file by "dorado",
I am wondering whether it is an alternative way to use "samtools fastq" to convert the unsorted bam file (produced by "dorado basecaller") to "fastq" file as the input "fastq" for "nanopolish eventalign"?
Thank you!
Originally posted by @yuukiiwa in #155
Hello Yuk Kei!
For the step1 you mentioned about getting the "fastq" file by "dorado",
I am wondering whether it is an alternative way to use "samtools fastq" to convert the unsorted bam file (produced by "dorado basecaller") to "fastq" file as the input "fastq" for "nanopolish eventalign"?
Thank you!