💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
-
Updated
Oct 16, 2025 - Rust
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Natural language detection library for Rust. Try demo online: https://whatlang.org/
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
The Jieba Chinese Word Segmentation Implemented in Rust
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
A fast, low-resource Natural Language Processing and Text Correction library written in Rust.
Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar
Rust crate for entity parsing
🎤 vibrato: Viterbi-based accelerated tokenizer
🕷️ The pipeline for the OSCAR corpus
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
Inference engine for GLiNER models, in Rust
Official Rust Implementation of Model2Vec
Created by Alan Turing