Highlights
- Pro
- All languages
- AngelScript
- Assembly
- Batchfile
- Bikeshed
- C
- C#
- C++
- CMake
- COBOL
- CSS
- Clojure
- CoffeeScript
- Common Workflow Language
- Crystal
- Cython
- D
- Dockerfile
- EJS
- Elm
- Emacs Lisp
- Go
- Groovy
- HTML
- Haskell
- Inform 7
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- Less
- Lex
- Lua
- MATLAB
- Makefile
- Markdown
- MoonScript
- Mustache
- OCaml
- PHP
- PLSQL
- Perl
- PostScript
- Processing
- PureBasic
- PureScript
- Python
- R
- Rich Text Format
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Svelte
- Swift
- TeX
- TypeScript
- Vala
- Vim Script
- Vue
- WebAssembly
- XQuery
- XSLT
- ZAP
Starred repositories
GOCR - State-of-the-Art OCR foundation for German documents. Fast, CPU, no GPU.
The AI coding agent that runs on stolen Chipotle compute 🌯 Fork of OpenCode with Pepper AI as default model. Community project to add providers from Home Depot, Lowes, Target, Starbucks & more.
TextBite: A Historical Czech Document Dataset for Logical Page Segmentation
mittagessen / dfine_kraken
Forked from ArgoHA/D-FINE-segD-FINE for document region segmentation
Image Annotation Tool and Image Search
This repository contains a segmentation model for historical and modern prints.
Research artefact for the paper ‘“Works on My Machine”: A Case Study of Replicability Challenges in Computational Humanities Research’ at CHR 2025
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
Tutorial für die Nutzung der von IDM 4 bereitgestellten Datasette-Instanz
Web app to upload and display multiple PageXML files
Rust bindings for the C++ api of PyTorch.
OCR-D processor for the party text recognizer
OCR-D wrapper for yolo based on the ocrd_detectron2 wrapper
Event-driven networking engine written in Python.
Coordinates of manually annotated job ads with a link to ANNO Corpus.
Create web-based user interfaces with Python. The nice way.
This repository contains code to read, process, and integrate data from inventory cards.
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
Page-wise text recognition with lower-supervision line data models
OCR Confidence Analysis script written in python
XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml