-
Huygens Institute
- Amsterdam, the Netherlands
- http://marijnkoolen.com/
- @marijnkoolen
Stars
Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
knaw-huc / loghi
Forked from rvankoert/loghiLoghi is a comprehensive toolkit designed for Handwritten Text Recognition (HTR) and Optical Character Recognition (OCR), offering an accessible approach to transcribing historical documents and tr…
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Statistical Rethinking course and book package
Python module that makes working with XML feel like you are working with JSON
Lightning fast, spec-compatible, streaming RDF for JavaScript
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to t…
Fully featured framework for fast, easy and documented API development with Flask
React + Vue Search UI for Elasticsearch & Opensearch. Compatible with Algolia's Instantsearch and Autocomplete components.