GitHub - AuReMe/prolipipe: The prolific pipeline is about genome annotation and metabolic pathway reconstruction. From .fasta genomes, it enables to find if strains of bacteria are theorically able to produce metabolites

Prolipipe : large-scale assessment of metabolic profiles on bacteria focusing on specific pathways.

Assessing capacity to synthesize or degrade specific compounds among a large set of bacterial metabolic networks and screen them accordingly.

Table of contents

License

This workflow is licensed under the GNU GPL-3.0-or-later, see the LICENSE file for details.

Installation

Dependencies

Prolipipe relies on outputs from the "AuFAMe" package It runs with python >= 3.8.

These python packages are needed :

pandas

numpy

plotly

glob2

jupyter

papermill

padmet

Prolipipe also needs Quarto to generate the interactive report (version > 1.7).

Conda

Prolipipe is available on the conda channel "fermentsdufutur" and can be installed with:

conda create -n prolipipe
conda activate prolipipe
conda install -c fermentsdufutur prolipipe

Preparation of taxfile

Prolipipe and AuFAMe rely on data about genomes, either a taxonomic ID for building accurate GSMs in the case of AuFAMe or species name and assembly level categories for clustering during Prolipipe's analyses. A compatible structure for both is the following columns in a tsv file :

"Species" : space-separated species name (used to categorize genomes in reports and heatmaps)

"Taxon_id" : strict taxID of the species ; can be "2" (Bacteria) if ignored

"Filename" : strict name of the genome file, without file extension

"Strain" : space-separated strain name

"Status" : another metric to categorize genomes based on assembly quality ; that way, can be either "Complete", "Chromosome", "Scaffold" and "Contig"

Usage

To run Prolipipe with padmet files from AuFAMe as input, generate TSV files and an interactive Quarto report:

prolipipe -pad DIRECTORY --tax TAXFILE --pwy PWY_FOLD

To generate TSV files without the Quarto report:

prolipipe -pad DIRECTORY --tax TAXFILE --pwy PWY_FOLD --no-report

To run Prolipipe with TSV files from AuFAMe as input and generate an interactive Quarto report:

prolipipe -i DIRECTORY --tax TAXFILE --pwy PWY_FOLD

To regenerate Quarto report from TSV files created by Prolipipe:

prolipipe-report -i DIRECTORY -d OUT_DIRECTORY

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
.github/workflows		.github/workflows
prolipipe		prolipipe
side_scripts		side_scripts
toy_example		toy_example
.gitignore		.gitignore
LICENSE		LICENSE
README.rst		README.rst
environment.yml		environment.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prolipipe : large-scale assessment of metabolic profiles on bacteria focusing on specific pathways.

License

Installation

Dependencies

Conda

Preparation of taxfile

Usage

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Prolipipe : large-scale assessment of metabolic profiles on bacteria focusing on specific pathways.

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Languages