Skip to content

xmao/kobas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KOBAS — KEGG Orthology Based Annotation System

What's KOBAS?

KEGG Orthology Based Annotation System (KOBAS) is a Python application for bioinformatics. KOBAS assigns KEGG Orthology (KO) terms to query sequences via BLAST similarity search, and discovers enriched pathways among the annotation results by frequency analysis or statistical significance testing.

Installation

# Using pixi (recommended)
pixi install
pixi run test

# Or using pip
pip install -e .[dev]

Requires Python >= 3.10. See docs/install.txt for detailed instructions.

Usage

KOBAS provides a unified kobas command with subcommands for each task.

Sequence Annotation

Annotate a protein FASTA file with KO terms:

kobas annotate input.fasta

For nucleotide sequences:

kobas annotate -p blastx input.fasta

Pathway Discovery

Find the most frequent pathways among annotations:

kobas pathfind annotation.txt

Find statistically enriched pathways against a background organism (KEGG three-letter abbreviation) or file:

kobas pathfind -k h annotation.txt hsa
kobas pathfind -k b annotation.txt background.txt

Statistical methods:

Flag Method
-k b Binomial test
-k c Chi-square test
-k h Hypergeometric test (default)

Utility Commands

Filter or translate annotation files:

kobas filter-annot [-e EVALUE] [-r RANK] [-s SIEVE] annot_file

Filter or translate FASTA files:

kobas filter-fasta [-s SIEVE] [-t TRANSLATE] fasta_file

Initialize KOBAS data:

kobas init -s /path/to/kobas-data

Run <subcommand> -h for full options.

Contact

About

KOBAS: a command line tool for identifying significant pathways from genomic data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors