Skip to content
View kba's full-sized avatar

Highlights

  • Pro

Organizations

@DM2E @OCR-D

Block or report kba

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Event-driven networking engine written in Python.

Python 5,879 1,203 Updated Oct 7, 2025

OCR-D processor for Hugging Face transformer OCR models

Python 4 Updated Oct 6, 2025

Coordinates of manually annotated job ads with a link to ANNO Corpus.

3 Updated May 28, 2025

Create web-based user interfaces with Python. The nice way.

Python 14,132 840 Updated Oct 10, 2025

This repository contains code to read, process, and integrate data from inventory cards.

Python 1 1 Updated Jan 17, 2025
Python 7 1 Updated Oct 16, 2023

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

JavaScript 475 26 Updated Sep 28, 2025

Schemas for repositories of HTR/OCR models

Python 10 Updated Jul 21, 2025

Page-wise text recognition with lower-supervision line data models

Python 47 5 Updated Sep 18, 2025

OCR Confidence Analysis script written in python

HTML 6 1 Updated Apr 20, 2024
Jupyter Notebook 3 Updated Feb 2, 2025

XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml

Python 86 27 Updated Aug 16, 2025

Templating Kubernetes resources with *real* code

C# 114 28 Updated Dec 10, 2022

A tool to automatically convert old string literal formatting to f-strings

Python 724 38 Updated Sep 8, 2025

Contextual HookFormer for Glacier Calving Front Segmentation (DOI: 10.1109/TGRS.2024.3368215)

Python 3 Updated Jul 25, 2025
Shell 14 23 Updated Oct 6, 2025

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 28,102 3,479 Updated Sep 24, 2024

This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.

Python 9 1 Updated Jan 28, 2024

Loghi is a comprehensive toolkit designed for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR), offering an accessible approach to transcribing historical documents and tr…

Shell 129 20 Updated Oct 5, 2025

Layout analysis to find layout elements in documents (similar to P2PaLA)

Python 19 8 Updated Oct 3, 2025

Convert ALTO XML to plain text + minimal metadata

Python 17 2 Updated Oct 17, 2024

Obsolete repo, merged into eynollah

12 10 Updated Sep 29, 2025

Complete lxml external type annotation

Python 69 8 Updated Oct 1, 2025

formatting and integrating the Deutches Textarchiv dictionary into various applications

Makefile 2 Updated Mar 1, 2024

OCR Groundtruth ULB VD18 - OCR-D Phase III

4 2 Updated Oct 25, 2024

OCR Groundtruth ULB VD18 Latin - OCR-D Phase III

4 3 Updated Oct 25, 2024

OCR Grountruth ULB VD18 German Fraktur - OCR-D Phase III

4 3 Updated Oct 25, 2024

A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations

Python 4,186 254 Updated Jun 3, 2025
Next