Highlights
- Pro
- All languages
- AngelScript
- Assembly
- Batchfile
- Bikeshed
- C
- C#
- C++
- CMake
- COBOL
- CSS
- Clojure
- CoffeeScript
- Common Workflow Language
- Crystal
- Cython
- D
- Dockerfile
- EJS
- Elm
- Emacs Lisp
- Go
- Groovy
- HTML
- Haskell
- Inform 7
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Less
- Lex
- Lua
- MATLAB
- Makefile
- Markdown
- MoonScript
- Mustache
- Nunjucks
- OCaml
- PHP
- PLSQL
- Perl
- PostScript
- Processing
- PureBasic
- PureScript
- Python
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Swift
- TeX
- TypeScript
- Vala
- Vim Script
- Vue
- WebAssembly
- XQuery
- XSLT
- ZAP
Starred repositories
Research artefact for the paper ‘“Works on My Machine”: A Case Study of Replicability Challenges in Computational Humanities Research’ at CHR 2025
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
Tutorial für die Nutzung der von IDM 4 bereitgestellten Datasette-Instanz
Web app to upload and display multiple PageXML files
Rust bindings for the C++ api of PyTorch.
OCR-D processor for the party text recognizer
OCR-D wrapper for yolo based on the ocrd_detectron2 wrapper
Event-driven networking engine written in Python.
Coordinates of manually annotated job ads with a link to ANNO Corpus.
Create web-based user interfaces with Python. The nice way.
This repository contains code to read, process, and integrate data from inventory cards.
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
Page-wise text recognition with lower-supervision line data models
OCR Confidence Analysis script written in python
XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml
Templating Kubernetes resources with *real* code
A tool to automatically convert old string literal formatting to f-strings
Contextual HookFormer for Glacier Calving Front Segmentation (DOI: 10.1109/TGRS.2024.3368215)
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
This script processes PAGE XML files, a format widely used in document layout analysis, to perform various operations like validating, repairing, extending, and modifying text regions and lines.