Generate BERT-compatible text embeddings locally in Rust without Python or external ML runtimes using a simple, efficient library.
-
Updated
Mar 26, 2026 - Rust
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
Generate BERT-compatible text embeddings locally in Rust without Python or external ML runtimes using a simple, efficient library.
🛠️ Manage and sync your coding skills across multiple AI tools with this cross-platform desktop app for streamlined organization and efficiency.
🖥️ Explore CPU-SLM, a Rust-based SLM/LLM project that runs on CPU, offering efficient inference and chat with minimal dependencies.
🚀 Navigate and manage your AWS resources with ease using taws, a terminal UI designed for efficient interaction with your cloud infrastructure.
🌐 Scrape web pages efficiently and analyze content with LLM using this high-performance Rust-based API server, supporting advanced features and robust performance.
A Japanese Grapheme-to-Phoneme (G2P) library.
📄 Ingest documents into structured datasets for LLMs, ensuring numeric integrity and easy export across multiple frameworks with doc2dataset.
🔔 Track live Solana meme coin alerts with quality filters, avoiding scams, and receive updates directly to your Telegram channel for safe trading insights.
⚙️ Build and manage decentralized applications with ease using Foundry, your comprehensive toolkit for smart contract development and deployment.
वर्णविन्यास — Open-source Nepali orthography toolkit based on Nepal Academy standards. Spell checking, sandhi analysis, punctuation diagnostics, and more.
🔍 Explore robust SQL parsing with error tracking and zero-copy techniques in this practical demo that enhances your understanding of parsing concepts.
Orthographic nativization for Filipino loanwords.
Rust implementation of the langid library for language identification. Easily classify text with a simple API. 🌍🔍
盲分词的高性能中文语料词频统计工具:1分钟内统计10亿字语料的2字词!
🚛 ELD Toolkit for WASM frameworks.
Geometric region embeddings (boxes, cones, octagons, Gaussians, hyperbolic intervals, sheaf networks) for subsumption, entailment, and logical query answering
Two-tier hybrid search for Rust: sub-millisecond initial results via potion-128M, quality-refined rankings in 150ms via MiniLM-L6-v2. Combines lexical (Tantivy BM25) and semantic (vector cosine) search with Reciprocal Rank Fusion. Progressive iterator API, f16 SIMD vector index, feature-gated compilation.
A modern, embeddable query engine for corpus linguistics.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Programming library for the Standoff Text Annotation Model (STAM), written in Rust. This is the primary software library for STAM with a focus on performance.
Created by Alan Turing