Highlights
- Pro
- All languages
- AngelScript
- Assembly
- Batchfile
- Bikeshed
- C
- C#
- C++
- CMake
- COBOL
- CSS
- Clojure
- CoffeeScript
- Common Workflow Language
- Crystal
- Cython
- D
- Dockerfile
- EJS
- Elm
- Emacs Lisp
- Go
- Groovy
- HTML
- Haskell
- Inform 7
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Less
- Lex
- Lua
- MATLAB
- Makefile
- Markdown
- MoonScript
- Mustache
- OCaml
- PHP
- PLSQL
- Perl
- PostScript
- Processing
- PureBasic
- PureScript
- Python
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Swift
- TeX
- TypeScript
- Vala
- Vim Script
- Vue
- WebAssembly
- XQuery
- XSLT
- ZAP
Starred repositories
Event-driven networking engine written in Python.
OCR-D processor for Hugging Face transformer OCR models
Coordinates of manually annotated job ads with a link to ANNO Corpus.
Create web-based user interfaces with Python. The nice way.
This repository contains code to read, process, and integrate data from inventory cards.
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
Page-wise text recognition with lower-supervision line data models
OCR Confidence Analysis script written in python
XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml
Templating Kubernetes resources with *real* code
A tool to automatically convert old string literal formatting to f-strings
Contextual HookFormer for Glacier Calving Front Segmentation (DOI: 10.1109/TGRS.2024.3368215)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.
knaw-huc / loghi
Forked from rvankoert/loghiLoghi is a comprehensive toolkit designed for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR), offering an accessible approach to transcribing historical documents and tr…
Layout analysis to find layout elements in documents (similar to P2PaLA)
Convert ALTO XML to plain text + minimal metadata
Obsolete repo, merged into eynollah
formatting and integrating the Deutches Textarchiv dictionary into various applications
OCR Groundtruth ULB VD18 - OCR-D Phase III
OCR Groundtruth ULB VD18 Latin - OCR-D Phase III
OCR Grountruth ULB VD18 German Fraktur - OCR-D Phase III
A cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations