Stars
collections of useful things to discuss/teach in our team
A Python programme to convert plays from minimally annotated tabular TXT files to XM-TEI.
Syllabification and stress detection for Spanish
Choralhandschriften und Drucke der Zentralbibliothek der Wiener Franziskanerprovinz Graz
Cookiecutter template to export (and process) TEI/XML from Transkribus
A python package providing some utility functions for interacting with the Transkribus-API
Spanish data from the AnCora corpus.
iuliandita / keychron
Forked from kurgol/keychronCommunity-maintained guides for using Keychron keyboards on Linux.
Corpus of Spanish Golden-Age Sonnets (with metrical annotation) / Corpus de Sonetos del Siglo de Oro (con anotación métrica)
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
Keywords: lexical diversity MTLD HDD vocabulary type token python