GitHub - linyz/remora

Make STAR index and other refs:
- (optional for DE analysis) concat deaminases sequence with GRCh38
- (optional for DE analysis) concat deaminases gtf annotation with gencode v40 basic gtf
- samtools faidx
- picard CreateSequenceDictionary
- make chromosome bed files (for run mpileup in parallel) ./chrBed, chromosome length from GRCh38
Run run_STAR.sh. I put each pair of paired-end fastq.gz files in a subfolder
```
for dir in ./*/
do
cd $dir
echo $PWD
qsub -cwd -pe smp 4 -l h_rt=336:00:00 -l mem_free=15G ../run_STAR.sh 
cd ..
done
```
- STAR alignment: need to trim 3' illumina universal adapter
- samtools sorting etc
- picard MarkDuplicate
- gatk SplitNCigarReads
- split by read orientation
run bcftools mpileup for variants calling. I split up files by chromosome and strand for parallel computing.
Custom R script (vcf_preprocess.R) to reformat vcf files, convert DNA alt to RNA alt, and basic filtering
- If refs have extra chromosomes, remove ##contig lines to only keep chr1-23, X, Y, M;
concat all [strand_chr].vcf files for downstream editing site calling. (so multiple test correction is done tx-wide)
Edit_call_cmh.R

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
chrBed		chrBed
Edit_call_cmh.R		Edit_call_cmh.R
bcftoolsf_multi_PUM1-rABE_chr.sh		bcftoolsf_multi_PUM1-rABE_chr.sh
readme.MD		readme.MD
run_STAR.sh		run_STAR.sh
vcf_preprocess1.R		vcf_preprocess1.R

Provide feedback