Releases: stickeritis/sticker
0.11.1
0.11.0
0.10.0: Revert "Generate Rust files from protobuf files on each build"
This reverts commit f64a16af7ef2b77fa69019486720861fd42e38a4.
Add support for pipelines
- Add
Pipelinedata structure tosticker-utilsthat allows loading of several models, that are then applied in sequence. - Update
sticker-tagandsticker-serverto support annotation pipelines. These commands now accept multiple model configuration files as arguments and will apply these models in sequence when tagging. - Add the
--inputand--outputoptions tosticker-tag. Since we cannot disambiguate between model configuration arguments and input/output corpora, the input and output files are now specified through these two flags. If these flags are not used,stdin/stdoutwill be used as before. - The address/port to bind to is now specified using the
addroption ofsticker-server. By default,sticker-serverwill bind tolocalhost:4000.
Updated CoNLL-X dependency
Update the CoNLL-X dependency to version 0.12. Unfortunately, this requires bumping the version, since this is an API-breaking change.
Reduce model load time with quantized embeddings
This release contains one large change: the loading of quantized models is speeded up by computing the unknown word embedding as an avarage of the subquantizers, rather than an average of all in-vocab word embeddings.
0.6.1
Subword representations using byte RNNs
This release adds support for word representations using byte RNNs. These word representations can be enabled by simply adding
[input]
subwords = true
For consistency, the embeddings.word and embeddings.tag options have also been moved to the input section. Hopefully, this is one of the last changes to the configuration file format.
Various options have been added to the graph writing scripts to control the hyper parameters of byte RNNs.
0.5.1
TensorBoard summaries, sticker-pretrain, support for Tensorflow 1.14
Features
- Add support for writing TensorBoard summaries.
- Split out pretraining support into a separate
sticker-pretrainutility. - Add the
--residualoption for RNN residual connections. - Add the
--maxlenoption tosticker-{pretrain,train}for discarding long sentences. - Write the model hyperparameters to the graph and stdout. Provide the
sticker-graph-metadatautility to print the hyperparameters. - Add the
gpu_allow_growthoption to allocate video RAM on a by-need basis.
Maintenance/bugfixes
- Restore the
hidden_sizeoption for dilated convolutions. - Remove the
DefaultConfigclass fromsticker-graph. - Support Tensorflow 1.14.
- Fix CRF support.
- Factor out tagger construction boilerplate to
TaggerWrapper.
Thanks to @twuebi for the many contributions to this release!