end-to-end robust and comprehensive ORF prediction pipeline
The Hiller Lab at the Senckenberg Research Institute
orf .
pipeline .
us
genome ─────────────────────────────────────────────
ORFs ATG════TAA ATG══════════TGA ATG══TAG
translationAi █████░░░░ ████████░░░░ ██░░░░░
RNASamba ████░░░░░ ███████░░░░░ █░░░░░░
TRANSAID ██████░░░ █████████░░░ ██░░░░░
Netstart2 ███░░░░░░ ██████░░░░░ █░░░░░░
BLAST ███████░░ ████████░░░ ░░░░░░░
features └──────┘ └──────────┘ └─────┘
│ │ │
▼ ▼ ▼
GBoost: + GBoost: + GBoost: −
Note
Requirements: Nextflow ≥ 25.04.6, Docker or Apptainer, Java.
git clone https://github.com/hillerlab/xorf.git
cd xorfEdit params.json (set regions, sequence, database), then:
# Docker
nextflow run main.nf -params-file params.json -profile docker
# Apptainer / Singularity
nextflow run main.nf -params-file params.json -profile apptainerSmoke test:
nextflow run main.nf -profile test,apptainerNote
You can also specify these options directly in params.json.
A helper sh script is provided to run the pipeline on a SLURM cluster. See details below.
Click to expand
Edit the path variables at the top of assets/hpc/xorf.sh (cache dir, container image, manifest path), then submit:
sbatch --array=1-<N> xorf.shEach array task spawns one Nextflow head job that submits all compute as child SLURM jobs.
PREDICT_ORFS run as SLURM job arrays. Partition routing, array sizes, and resource tiers are documented inline in nextflow.config — edit there to match your cluster.
results/
├── 00_concat/ *bed
├──── 00_concat/raw/ *bed
├── 01_duplicates/ *bed
├── 02_results/ *bed
└── pipeline_info/ timeline, trace, DAG
| File | What |
|---|---|
params.json |
Genome paths, alignment settings, checkpoints — per run |
nextflow.config |
Compute resources, profiles, container, SLURM — rarely |