Starred repositories
using JUCE to create a 3D spectrogram drawn with OpenGL
A demo of using Hilbert-Huang Transform (HHT) for non-stationary and non-linear signal analysis.
OCR, layout analysis, reading order, table recognition in 90+ languages
NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)
Guayadeque is a music management program designed for all music enthusiasts. It is Full Featured Linux media player that can easily manage large collections and uses the Gstreamer media framework.
The RaftLib C++ library, streaming/dataflow concurrency via C++ iostream-like operators
Implementation of the paper Keys to Accurate Feature Extraction Using Residual Spiking Neural Networks
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON do…
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Official implementation of Character Region Awareness for Text Detection (CRAFT)
Layout analysis to find layout elements in documents (similar to P2PaLA)
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
Recurrent neural network for audio noise reduction
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/
Implementation of Nougat Neural Optical Understanding for Academic Documents
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.
(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
Generic framework for historical document processing
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
A Unified Toolkit for Deep Learning Based Document Image Analysis