5 releases (1 stable)
| 1.0.0 | Jun 23, 2020 |
|---|---|
| 0.3.0 | Jul 28, 2019 |
| 0.2.1 | Jul 10, 2019 |
| 0.2.0 | Jul 7, 2019 |
| 0.1.0 | Jul 6, 2019 |
#103 in #tokenizer
1,976 downloads per month
5MB
66K
SLoC
BlingFire in Rust
blingfire is a thin Rust wrapper for the BlingFire tokenization library.
Add the library to Cargo.toml to get started
cargo add blingfire
The library exposes two functions text_to_words and text_to_sentences
use blingfire;
fn main() {
let mut parsed = String::new();
blingfire::text_to_words("Cat,sat on the mat.", &mut parsed).unwrap();
assert_eq!(parsed.as_str(), "Cat , sat on the mat .");
blingfire::text_to_sentences("Cat sat. Dog barked.", &mut parsed).unwrap();
assert_eq!(parsed.as_str(), "Cat sat.\nDog barked.");
}
The code is licensed under the MIT License.