Skip to content
@TM4VETR

TM4VETR

Text Mining 4 VET Research

This is a preliminary collection of Text Mining steps useful for VET research.

Every part uses stout and stdin to forward data to the next step, use command line arguments to pass additional parameters. Only Readers, as first step of a pipeline, and Writer, as last step of the pipeline, differ.

Attention: Some arguments are currently missing, for example defining typesystem.

Example pipeline:

#!/bin/bash

python3 GenericTextReader/reader.py --file testinput |  python3 SpacyNER/spacyner.py | python3 NECSVWriter/necsvwriter.py -o out.csv

See https://github.com/TM4VETR/ExamplePipelines.

Pinned Loading

  1. SpacyNER SpacyNER Public

    Python

  2. GenericTextReader GenericTextReader Public

    Python

  3. NECSVWriter NECSVWriter Public

    Python

  4. ExamplePipelines ExamplePipelines Public

    Some example Pipelines

    Shell

Repositories

Showing 10 of 16 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…