Project Developer / Digitizer / XML Enthusiast
Highlights
- Pro
Stars
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.
CrazyCrud / page-xml-draw
Forked from VRI-UFPR/page-xml-drawA powerful CLI tool for visualization and encoding of PAGE-XML files
UB-Mannheim / ocr-model-metadata
Forked from OCR-D/gt-metadataMetadata tool for ocr models
Create a teiCorpus-file from a collection of TEI documents
This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation was created in the context of the OCR-BW project.
tboenig / gt-guidelines
Forked from kba/gt-guidelinesOCR-D guidelines for Ground Truth production
Ground truth for digitized publications of UB Tübingen