Skip to content
View tacibey's full-sized avatar

Block or report tacibey

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
3 stars written in C
Clear filter

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 3,904 313 Updated Apr 8, 2026

LLM inference with 7x longer context. Pure C, zero dependencies. Lossless KV cache compression + single-header library.

C 345 39 Updated Apr 8, 2026

Turbo1Bit: Combining 1-bit LLM weights (Bonsai) with TurboQuant KV cache compression for maximum inference efficiency. 4.2x KV cache compression + 16x weight compression = ~10x total memory reduction.

C 21 2 Updated Apr 2, 2026