Skip to content

linyz/remora

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

  1. Make STAR index and other refs:

    • (optional for DE analysis) concat deaminases sequence with GRCh38
    • (optional for DE analysis) concat deaminases gtf annotation with gencode v40 basic gtf
    • samtools faidx
    • picard CreateSequenceDictionary
    • make chromosome bed files (for run mpileup in parallel) ./chrBed, chromosome length from GRCh38
  2. Run run_STAR.sh. I put each pair of paired-end fastq.gz files in a subfolder

    for dir in ./*/
    do
    cd $dir
    echo $PWD
    qsub -cwd -pe smp 4 -l h_rt=336:00:00 -l mem_free=15G ../run_STAR.sh 
    cd ..
    done
    
    • STAR alignment: need to trim 3' illumina universal adapter
    • samtools sorting etc
    • picard MarkDuplicate
    • gatk SplitNCigarReads
    • split by read orientation
  3. run bcftools mpileup for variants calling. I split up files by chromosome and strand for parallel computing.

  4. Custom R script (vcf_preprocess.R) to reformat vcf files, convert DNA alt to RNA alt, and basic filtering

    • If refs have extra chromosomes, remove ##contig lines to only keep chr1-23, X, Y, M;
  5. concat all [strand_chr].vcf files for downstream editing site calling. (so multiple test correction is done tx-wide)

  6. Edit_call_cmh.R

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors