Skip to content

Cuttlefish3 segfaults randomly on a standard E.coli pangenome #52

@sebschmi

Description

@sebschmi

Dataset:

3681 E. Coli genomes used in e.g. the Eulertigs paper. One genome has since been deleted from NCBI and was hence not included.

Command:

bin/cuttlefish build -m 128 -s references.fa -k 27 -t 128 --path-cover -o unitigs -w workdir --ref -c 1

Additional Info:

The same command works with -k 63, or without --path-cover.

Log:

Constructing the compacted reference de Bruijn graph for k = 27.
Edge frequency cutoff: 1.
Partitioned 18333 MB of uncompressed data.
Number of processed chunks: 2413.
Total size of chunks: 19224055383.
Number of records: 745406.
Number of super (k - 1)-mers: 2359174334.
Total length of the weak super k-mers:  82590357521.
Total length of the super (k - 1)-mers: 77873499665.
Total work in parse: 24.7938s.
Total work in processing records: 492.486s.
Max work in processing records: 492.486s.
Total atlas size in bytes: 47183486680.
Sequence splitting into subgraphs completed. Time taken: 529.71 seconds.
Solved 16384 subgraphs.16064 subgraphs.16067 subgraphs.16067 subgraphs.
Total work in graph construction: 1383.89 (s).
Total work in graph contraction:  120.739 (s).
Total work in bucket removal:     94.8811 (s).
Maximum k-mer count in bucket: 5569711.
Minimum k-mer count in bucket: 412034.
Sum graph size: 181111067.
Largest graph size: 100994.
Smallest graph size: 5275.
Sum label size: 940099714.
Bytes in super k-mer buckets: 47183486680.
Bytes in compressed super k-mer buckets: 22227158678.
lm-tig count: 29192097.
Trivial maximal unitig count: 3441242.
Trivial ICC count: 0.
Subgraphs construction and contraction completed. Time taken: 6.35336 seconds.
Edge-matrix size: 25750855
Phantom edge upper-bound: 5875
Expecting at most 5648246 more non-DCC maximal unitigs
Hash table capacity during contraction: 1048576.
Part: 10
Formed 5648254 meta-vertices.
Found 8 ICCs.
Found 5875 phantoms.
Map clearing time: 0.150711.
Non-diagonal edges contraction time: 1.73894.
Diagonal-chain computation time: 0.439419.
Diagonal-chain contraction time: 0.343218.
Filtering in false-phantom edges time: 0.138168.
Discontinuity-graph contraction completed. Time taken: 3.52024 seconds.
Hash table capacity during expansion: 1048576.
Part: 64
Map clearing time: 0.191931.
Path-info load time: 0.866394.
Non-diagonal blocks expansion time: 1.21174.
Diagonal block expansion time: 0.199794.
Special case time: 1.73619.
Deletion time: 0.117828.
Expansion of contracted graph completed. Time taken: 4.8203 seconds.
Sum edge-bucket size: 25756730
Maximum edge-bucket size: 34894
Found 25756730 edges.
Time taken in mapping: 1.24098s.
Maximum maximal unitig bucket size:   33801
Maximum label length in mtig-buckets: 1124388
Peak-RAM before collation: 36.1891
Command terminated by signal 11

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions