Skip to content
View numb3r3's full-sized avatar

Organizations

@jina-ai

Block or report numb3r3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Document parsing and knowledge extraction tools

C++ 2 1 Updated Mar 27, 2026

The fastest BM25 scoring engine: 2,300x faster than BM25S. 28K QPS on 8.8M docs. 5 BM25 variants (Robertson, Lucene, ATIRE, BM25L, BM25+). Memory-mapped persistence, BMW pruning, streaming indexing…

Rust 41 2 Updated Mar 23, 2026

Sparton: Fast and Memory-Efficient Triton Kernel for Learned Sparse Retrieval

Python 9 1 Updated Mar 26, 2026

Async-friendly WebTransport implementation in Rust

Rust 650 49 Updated Mar 17, 2026

Opencli-rs is a Blazing fast, memory-safe command-line tool — Fetch information from any website with a single command. Covers Twitter/X, Reddit, YouTube, HackerNews, Bilibili, Zhihu, Xiaohongshu, …

Rust 953 70 Updated Mar 29, 2026

KV cache store for distributed LLM inference

C++ 401 34 Updated Nov 13, 2025

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Python 101 4 Updated Feb 2, 2026

🍡 50x faster tokenization for every HuggingFace model

Rust 22 Updated Mar 29, 2026

Open benchmarks for evaluating search APIs

Python 83 15 Updated Mar 23, 2026

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 308 14 Updated Jun 2, 2025

FastAPI-compatible Python framework with Zig HTTP core; 7x faster, free-threading native

Python 837 26 Updated Mar 28, 2026

Per-collection OCR leaderboards using VLM-as-judge

Python 56 2 Updated Mar 23, 2026
Python 42 Updated Oct 20, 2025
Python 4 1 Updated Oct 6, 2025

Lora Anchor KL scripts.

Python 3 Updated Mar 14, 2026

Get clean data from tricky documents, powered by vision-language models ⚡

Python 1,524 98 Updated Mar 25, 2026

A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具

Python 1,677 129 Updated Jan 25, 2026

Sparse Embedding Compression for Scalable Retrieval in Recommender Systems

Python 35 2 Updated Nov 21, 2025
Python 3 Updated Jan 29, 2026

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 132 5 Updated Jun 24, 2025

Bayesian probability transforms for BM25 retrieval scores

Python 65 1 Updated Mar 28, 2026

[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation

Python 1,612 161 Updated Feb 27, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 614 67 Updated Mar 27, 2026

Pure Rust + CUDA LLM inference engine

Rust 257 24 Updated Mar 28, 2026

A high-performance and light-weight router for vLLM large scale deployment

Rust 166 58 Updated Mar 25, 2026

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,774 167 Updated Mar 29, 2026

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Python 167 9 Updated Oct 13, 2025

[NeurIPS 2025] Official Implementation of ViSpec: Accelerating Vision-Language Models with Vision-Aware Speculative Decoding.

Python 51 Updated Jan 28, 2026
Next