branches of https://victorio.uit.no/langtech/trunk/tools/CorpusTools used by Giellatekno.UiT.no for corpus gathering.
-
Updated
Jun 29, 2015 - Python
branches of https://victorio.uit.no/langtech/trunk/tools/CorpusTools used by Giellatekno.UiT.no for corpus gathering.
Estonian TimeML Annotated Corpus \ Eesti keele TimeML märgendatud korpus
Python API for extracting data from the MPQA corpus
Forpus is a Python library for processing plain text corpora to various corpus formats.
An unofficial Python API that allows users to create a corpus of lyrical text from their favorite artists and billboard charts
An open source reimplementation of Benny Brodda's BETA in Python
A concordancing program for English with a GUI interface that can read .docx, .srt, and plaintext files and export concordance lines to .txt,. docx, .tsv, .xlsx, and .html.
Utilities for Processing the Saarbrücken Corpus of Spoken English
Utilities for Processing the bAbi Tasks Corpus
Utilities for Processing the Dialogue State Tracking Challenge 3 Corpus
Utilities for Processing the FRAMES Corpus
Library for Python to use Korp API
Utilities for Processing the HCRC Map Task Corpus
Utilities for Processing the Meeting Recorder Dialogue Act Corpus
Utilities for Processing the BT Oasis Corpus
Utilities for Processing the Switchboard Dialogue Act Corpus
📚 Icelandic Corpora Toolkit - A collection of scripts to use with various Icelandic text corpora
Vietnamese corpus search tools and statistical analysis
The AP Exam Corpus Project is a Python application that generates corpora for AP exams.
Add a description, image, and links to the corpus-tools topic page so that developers can more easily learn about it.
To associate your repository with the corpus-tools topic, visit your repo's landing page and select "manage topics."