Lists (15)
Sort Name ascending (A-Z)
Stars
OpenRefine is a free, open source power tool for working with messy data and improving it
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
DROID (Digital Record and Object Identification)
The Bagger application packages data files according to the BagIt specification.
ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and de…
NARA File Analyzer and Metadata Harvester
Collection of tools for processing archival artifacts