High-performance toolkit for querying linguistic dependency parses
-
Updated
Jan 21, 2026 - Rust
High-performance toolkit for querying linguistic dependency parses
A minimal, pure Python library to interface with CoNLL-U format files.
A Python toolkit for working with CoNLL-U files, Universal Dependencies treebanks, and annotated corpora.
spaCy-based CLI for web linguistic analysis with embeddings, sentiment, POS/NER, and Unix pipeline composability. Outputs JSON, Parquet, CoNLL-U for ML workflows.
"Galahad". Goal: enable linguists to experiment with different taggers and use the result in other INT products
A minimal, pure Python interface that turns CoNLL-U format files into A huggingFace Dataset
BERT Fine-Tuning for Part-of-Speech (POS) Tagging (PyTorch & Hugging Face).
A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
Data velds encapsulating statistics on conllu data.
Demo training data for the CLSInfra training school 2024.
NER tagging with HMM and Viterbi algorithm - University Project
A Python3 package for extracting syntactic complexity measures from CoNLL-U annotations.
A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic processing with the Stanza pipeline, machine translation and word alignment with the Eflomal tool.
Count Bigram frequency in a conllu format corpus
Exploring and visualizing CONULLU files in Python
A tool for validating English CoNLL-U data files.
A package for manipulating Universal Dependencies trees
Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"
Add a description, image, and links to the conllu topic page so that developers can more easily learn about it.
To associate your repository with the conllu topic, visit your repo's landing page and select "manage topics."