Skip to content
View lieding's full-sized avatar

Block or report lieding

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
10 stars written in C++
Clear filter

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 9,095 592 Updated Dec 17, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,485 460 Updated Aug 2, 2025

The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.

C++ 4,266 405 Updated Dec 16, 2025

Fast inference engine for Transformer models

C++ 4,193 434 Updated Dec 5, 2025

Real-time image filter engine based on GPU

C++ 2,067 298 Updated Dec 17, 2025

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

C++ 1,557 123 Updated Mar 23, 2025

Fast Multimodal LLM on Mobile Devices

C++ 1,278 156 Updated Dec 13, 2025

Low-bit LLM inference on CPU/NPU with lookup table

C++ 902 74 Updated Jun 5, 2025

High accurate text detection (OCR) Javascript/Typescript library that runs on Node.js, Browser, React Native and C++. Based on PaddleOCR and ONNX runtime

C++ 141 29 Updated Oct 17, 2025

LLM101n: Let's build a Storyteller 中文版

C++ 136 14 Updated Aug 15, 2024