Lists (15)
Sort Name ascending (A-Z)
Stars
HTRflow is the underlying engine for our HTR-pipeline
The Sleuth Kit® (TSK) is a library and collection of command line digital forensics tools that allow you to investigate volume and file system data. The library can be incorporated into larger digi…
This is the development tree. Production downloads are at:
🍃 JavaScript library for mobile-friendly interactive maps 🇺🇦
Tesseract Open Source OCR Engine (main repository)
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
brozzler - distributed browser-based web crawler
An ArchivesSpace plugin to provide harmful content warnings
List of my most used commands and shortcuts in the terminal for Mac
DROID (Digital Record and Object Identification)
Library of Congress Reconciliation Service for OpenRefine (LCNAF, LCSH)
Open Source Data Science Resources.
List of Data Science Cheatsheets to rule the world
Toturials coming with the "data science roadmap" picture.
📝 An awesome Data Science repository to learn and apply for real world problems.
📚 Playground and cheatsheet for learning Python. Collection of Python scripts that are split by topics and contain code examples with explanations.
The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than 100 days. Follow your own pace. These vide…
Roadmap to becoming an Artificial Intelligence Expert in 2022
Open ONI (Open Online Newspaper Initiative) Django web app
This software project is no longer being actively developed at the Library of Congress. Consider using the Open-ONI (https://github.com/open-oni) fork of the chronam software. Project mailing list:…
CCA Digital Archives Processing Manual
ePADD is a software package developed by Stanford University's Special Collections & University Archives that supports archival processes around the appraisal, ingest, processing, discovery, and de…