numb3r3

Follow

felix-wang numb3r3

Follow

Training Model at @elastic

201 followers · 1.4k following

@elastic
Singapore
@felix1987_

Achievements

Achievements

Organizations

Lists (2)

Sort

🪨 ANN

whisper

Starred repositories

TusKANNy / awesome-learned-sparse-retrieval

An extensive and commented list of resources on Learned Sparse Retrieval.

TeX 47 2 Updated Mar 29, 2026

dropbox / dKNOW

Document parsing and knowledge extraction tools

C++ 2 1 Updated Mar 27, 2026

TheSauceSuite / BM25-Turbo-Rust-Python-WASM-CLI

The fastest BM25 scoring engine: 2,300x faster than BM25S. 28K QPS on 8.8M docs. 5 BM25 variants (Robertson, Lucene, ATIRE, BM25L, BM25+). Memory-mapped persistence, BMW pruning, streaming indexing…

Rust 41 2 Updated Mar 23, 2026

chroma-core / context-1-data-gen

Python 315 28 Updated Mar 29, 2026

thongnt99 / sparton

Sparton: Fast and Memory-Efficient Triton Kernel for Learned Sparse Retrieval

Python 9 1 Updated Mar 26, 2026

pisa-engine / ConstBERT

13 1 Updated Jun 5, 2025

BiagioFesta / wtransport

Async-friendly WebTransport implementation in Rust

Rust 650 49 Updated Mar 17, 2026

nashsu / opencli-rs

Opencli-rs is a Blazing fast, memory-safe command-line tool — Fetch information from any website with a single command. Covers Twitter/X, Reddit, YouTube, HackerNews, Bilibili, Zhihu, Xiaohongshu, …

Rust 982 72 Updated Mar 29, 2026

bytedance / InfiniStore

KV cache store for distributed LLM inference

C++ 401 34 Updated Nov 13, 2025

hustvl / InfiniteVL

InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models

Python 101 4 Updated Feb 2, 2026

chonkie-inc / tokie

🍡 50x faster tokenization for every HuggingFace model

Rust 22 Updated Mar 29, 2026

exa-labs / benchmarks

Open benchmarks for evaluating search APIs

Python 83 15 Updated Mar 23, 2026

apple / ml-flextok

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 308 14 Updated Jun 2, 2025

justrach / turboAPI

FastAPI-compatible Python framework with Zig HTTP core; 7x faster, free-threading native

Python 845 26 Updated Mar 28, 2026

davanstrien / ocr-bench

Per-collection OCR leaderboards using VLM-as-judge

Python 56 2 Updated Mar 23, 2026

CoolDawnAnt / InfoChartQA

Python 42 Updated Oct 20, 2025

linhhtran / CoRe-Reranking

Python 4 1 Updated Oct 6, 2025

N8python / lora-anchor-kl

Lora Anchor KL scripts.

Python 3 Updated Mar 14, 2026

emcf / thepipe

Get clean data from tricky documents, powered by vision-language models ⚡

Python 1,525 98 Updated Mar 25, 2026

MarkPDFdown / markpdfdown

A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具

Python 1,679 130 Updated Jan 25, 2026

recombee / CompresSAE

Sparse Embedding Compression for Scalable Retrieval in Recommender Systems

Python 35 2 Updated Nov 21, 2025

zombak79 / compressed_elsa

Python 3 Updated Jan 29, 2026

tilde-research / nsa-impl

An efficient implementation of the NSA (Native Sparse Attention) kernel

Python 132 5 Updated Jun 24, 2025

cognica-io / bayesian-bm25

Bayesian probability transforms for BM25 retrieval scores

Python 66 1 Updated Mar 28, 2026

opendatalab / OmniDocBench

[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation

Python 1,612 161 Updated Feb 27, 2026

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 614 67 Updated Mar 27, 2026

xiaguan / pegainfer

Pure Rust + CUDA LLM inference engine

Rust 257 24 Updated Mar 28, 2026

vllm-project / router

A high-performance and light-weight router for vLLM large scale deployment

Rust 166 58 Updated Mar 25, 2026

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,774 167 Updated Mar 29, 2026

ByteDance-Seed / FlexPrefill

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Python 167 9 Updated Oct 13, 2025

Starred topics

infini-attention

large-language-models

Rust

3D

document-similarity

vosk

speech-recognition

adversarial-networks

Neural Network

Machine learning

See all starred topics