A template for starting a new allennlp project using config files and allennlp train. For simple
projects, all you need to do is get your model code into the class in my_project/model.py, your
data loading code into the DatasetReader code in my_project/dataset_reader.py, and that's it,
you can train a model (we recommend also making appropriate changes to the test code, and using that
for development, but that's optional).
See the AllenNLP Guide for a quick start on how to use what's in this example project. We're grabbing the model and dataset reader classes from that guide. You can replace those classes with a model and dataset reader for whatever you want (including copying code from our model library as a starting point). The very brief version of what's in here:
-
A
Modelclass inmy_project/model.py. -
A
DatasetReaderclass inmy_project/dataset_reader.py. -
Tests for both of these classes in
tests, including a small toy dataset that can be read. We strongly recommend that you use a toy dataset with tests like this during model development, for quick debug cycles. To run the tests, just runpytestfrom the base directory of this project. -
An
.allennlp_pluginsfile, which makes it easier to useallennlp trainwith this project. If you change the name ofmy_project, you should also change the line in this file to match. -
An example configuration file for training the model, which you would use with the following shell command from the base directory of this repository, after
pip install allennlp:allennlp train -s /dir/to/save/results training_config/my_model_trained_on_my_dataset.json