- Berlin
- in/romangrebennikov
Stars
Sub-millisecond cache for ML/AI workloads. Parquets in, Arrow-Flight out.
allRank is a framework for training learning-to-rank neural models based on PyTorch.
🦀 Rust crate that allows creating weighted prefix trees that can be used in autocomplete
JTokkit is a Java tokenizer library designed for use with OpenAI models.
NVIDIA Linux open GPU with P2P support
pytest fixture for benchmarking code
Buttplug.io Model Context Protocol (MCP) Server
NVIDIA Linux open GPU with P2P support
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Pure-Python Server Side Events (SSE) client
Full text search that feels like a numpy array
Mirror of the official repository repo.or.cz/wmaker-crm.git. Do not send pull requests here, send your patches to wmaker-dev@googlegroups.com instead
At-a-glance overview diagrams of Apache Lucene's default PostingsFormat (inverted index binary format).
PDF exporter for HTML presentations
Experimental code for our paper on informative and diverse sampling of negative examples for dense retrieval
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools