Lists (1)
Sort Name ascending (A-Z)
Stars
3
stars
written in C
Clear filter
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
LLM inference with 7x longer context. Pure C, zero dependencies. Lossless KV cache compression + single-header library.
Turbo1Bit: Combining 1-bit LLM weights (Bonsai) with TurboQuant KV cache compression for maximum inference efficiency. 4.2x KV cache compression + 16x weight compression = ~10x total memory reduction.