-
alto-tools Public
Python tools for performing various operations on ALTO XML files
-
-
ocr-conversion Public
Conversions between various OCR formats
-
eynollah Public
Forked from qurator-spk/eynollahDocument Layout Analysis
-
alto-ocr-confidence Public archive
calculate OCR confidence per page in ALTO
-
-
ner-corpora Public
Forked from EuropeanaNewspapers/ner-corporaNamed Entity Recognition corpus for (historical) Dutch, French, German
5 UpdatedApr 5, 2023 -
-
ocr-gt Public
OCR & Ground Truth Resources
-
hip21_ocrevaluation Public
A Survey of OCR Evaluation Tools and Metrics (HIP'21)
-
-
-
Fast classification of newspaper pages using fastai
-
bbz-ocr-train Public
Forked from AniketGurav/bbz-ocr-trainGround truth line annotations for the Berliner Börsen-Zeitung
1 UpdatedJun 3, 2020 -
-
-
interoperability-framework Public
Forked from impactcentre/interoperability-frameworkInteroperability layer supporting the loose coupling of software components developed during the IMPACT project
Java UpdatedApr 28, 2018 -
-
deep-wittgenstein Public
Forked from stefan-it/deep-wittgensteinClassification of Wittgenstein's remarks
Python GNU Affero General Public License v3.0 UpdatedMar 5, 2018 -
EN-data_mining Public
Forked from altomator/EN-data_miningData Mining Historical Newspaper Metadata (METS/ALTO formats)
HTML UpdatedNov 7, 2017 -
stringmetric Public
Forked from rockymadden/stringmetric🎯 String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Re…
Scala UpdatedJul 28, 2017 -
warcbase Public
Forked from lintool/warcbaseWarcbase is an open-source platform for managing and analyzing web archives
-
ner-app Public
Forked from EuropeanaNewspapers/ner-appNamed Entity Recognition tool for Europeana Newspapers
Java Other UpdatedFeb 16, 2017 -
altoedit-2.0 Public
Forked from renevanderark/altoedit-2.0edit the alto directly in the xml
JavaScript UpdatedApr 28, 2015 -
alto-editor Public
Forked from KBNLresearch/alto-editorBrowser based post correction tool for Alto XML files
JavaScript UpdatedSep 20, 2013 -
scape-tavernahadoop-demonstrator Public
Forked from openpreserve/scape-tavernahadoop-demonstratorSCAPE demonstrator project for Taverna and Hadoop
Java UpdatedAug 30, 2013 -
hack4europe Public
Forked from KBNLresearch/hack4europeJavascript based portal for searching Europeana collections and creating enrichments on the metadata
JavaScript Apache License 2.0 UpdatedAug 21, 2012 -
web-wf-design Public archive
Forked from http://code.google.com/p/taverna/source/browse/portal/web-wf-design/trunk/web-wf-design/ for experimenting