Releases: segment-any-text/wtpsplit
Releases · segment-any-text/wtpsplit
Release 2.1.7
- Suppress annoying warnings of upstream dependencies in some Python version
- Add possibility to not merge LoRA weights (still defaults to merging for efficiency reasons)
Full Changelog: 2.1.6...2.1.7
Release 2.1.6
Release 2.1.5
Changelog
Release 2.1.4
- Introduce optional hat weighting by @lsorber
- Clarify LoRA adaptation
- Clarify
treat_newline_as_space: renamed tosplit_on_input_newlines.treat_newline_as_spacewill be deprecated in a future release.
Release 2.1.2
- Fixes #142: AssertionError when string is only comprised of newlines, whitespace, or if its an empty strong.
Release 2.1.1
- Change default behaviour for newlines in SaT.split.
- Now, while the model ignores them, they will used to split as simple post-processing.
- Small bugfixes for LoRA training
- Update Readme for advanced usage
Release 2.1.0
- Adds ONNX support for SaT models.
- Including export scripts and an updated README.
- This results in 50% improved inference time on GPU.
Release 2.0.8
- Fix splitting of short sequences into individual characters (#127)
Release 2.0.7
- Allow numpy>=2.0
- Fix adaptation code
- Add some comments
Release 2.0.5
- Fixes potential CUDA device error when the input has exactly 511 tokens (#121).