A minimal, pure Python library to interface with CoNLL-U format files.
-
Updated
Dec 5, 2025 - Python
A minimal, pure Python library to interface with CoNLL-U format files.
End-to-end integration of HuggingFace's models for sequence labeling.
A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
Simple script to parse text with spaCy and print the output in CoNLL-U format.
A Python3 package for extracting syntactic complexity measures from CoNLL-U annotations.
Toolkit that simplifies corpus processing
A Python toolkit for working with CoNLL-U files, Universal Dependencies treebanks, and annotated corpora.
Tool for translating a corpus file from one language to another.
Count Bigram frequency in a conllu format corpus
BERT Fine-Tuning for Part-of-Speech (POS) Tagging (PyTorch & Hugging Face).
A minimal, pure Python interface that turns CoNLL-U format files into A huggingFace Dataset
A tool for validating English CoNLL-U data files.
GitHub repository for Arc-Eager Transition-Based Parser
spaCy-based CLI for web linguistic analysis with embeddings, sentiment, POS/NER, and Unix pipeline composability. Outputs JSON, Parquet, CoNLL-U for ML workflows.
Add a description, image, and links to the conllu topic page so that developers can more easily learn about it.
To associate your repository with the conllu topic, visit your repo's landing page and select "manage topics."