Tags: nhoffman/ya16sdb
Tags
ANI annotations, dependency updates and pipeline cleanup (#43) * renaming this to download.py * Handles accession2taxid now instead of global list of ncbi accessions * Re-structuring pipeline for clarity * bin/ya16sdb.py holds some common code * Remove sequences with unknown tax_ids after looking for features * adding an easier to manage cache folder * Fixing set difference logic * removing feather-format and locking in dependency versions * moving SEQ_INFO_COLS into ya16sdb.py script * Correctly appending previous records to records cache * Adding ANI tax check annotations * 0.8 update notes * Not needed anymore * Explicit merge(on="...") for clarity and some additional comments * need biopython and bumping sqlalchemy version * Using headers in ani and asm files (inspired from dhoogest wgs extraction code) and prioritizing assembly_genbank accessions found in asm file * 0.7 release
PreviousNext