Skip to content
View kba's full-sized avatar

Highlights

  • Pro

Organizations

@DM2E @OCR-D

Block or report kba

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Research artefact for the paper ‘“Works on My Machine”: A Case Study of Replicability Challenges in Computational Humanities Research’ at CHR 2025

Jupyter Notebook 3 Updated Dec 9, 2025

Code for the paper "UVDoc: Neural Grid-based Document Unwarping"

C++ 193 35 Updated Jul 28, 2024

Tutorial für die Nutzung der von IDM 4 bereitgestellten Datasette-Instanz

R 4 Updated Jan 28, 2026

Web app to upload and display multiple PageXML files

Python 2 1 Updated Jan 12, 2026

A Python Interpreter written in Rust

Rust 21,792 1,399 Updated Feb 15, 2026

OCR-D integration for PaddleOCR

Python 2 Updated Nov 3, 2025

Rust bindings for the C++ api of PyTorch.

Rust 5,281 415 Updated Jan 22, 2026

Contexts Optical Compression

Python 22,464 2,059 Updated Jan 27, 2026

OCR-D processor for the party text recognizer

Python 3 1 Updated Jan 11, 2026

OCR-D wrapper for yolo based on the ocrd_detectron2 wrapper

Python 2 1 Updated Jan 11, 2026

Event-driven networking engine written in Python.

Python 5,945 1,207 Updated Feb 9, 2026

Coordinates of manually annotated job ads with a link to ANNO Corpus.

3 Updated May 28, 2025

Create web-based user interfaces with Python. The nice way.

Python 15,352 904 Updated Feb 14, 2026

This repository contains code to read, process, and integrate data from inventory cards.

Python 1 1 Updated Nov 18, 2025
Python 7 1 Updated Oct 16, 2023

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

JavaScript 758 42 Updated Feb 12, 2026

Schemas for repositories of HTR/OCR models

Python 11 Updated Jul 21, 2025

Page-wise text recognition with lower-supervision line data models

Python 51 7 Updated Feb 13, 2026

OCR Confidence Analysis script written in python

HTML 6 1 Updated Apr 20, 2024
Jupyter Notebook 3 Updated Feb 2, 2025

XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml

Python 88 28 Updated Jan 20, 2026

Templating Kubernetes resources with *real* code

C# 114 26 Updated Dec 10, 2022

A tool to automatically convert old string literal formatting to f-strings

Python 727 37 Updated Dec 15, 2025

Contextual HookFormer for Glacier Calving Front Segmentation (DOI: 10.1109/TGRS.2024.3368215)

Python 3 Updated Nov 28, 2025
Shell 16 28 Updated Jan 7, 2026

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,956 3,535 Updated Dec 5, 2025

This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.

Python 9 1 Updated Jan 28, 2024
Next