Lists (2)
Sort Name ascending (A-Z)
Stars
5
stars
written in C
Clear filter
Distribute and run LLMs with a single file.
antimatter15 / alpaca.cpp
Forked from ggml-org/llama.cppLocally run an Instruction-Tuned Chat-Style LLM
Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️
Dynamic Memory Management for Serving LLMs without PagedAttention
A PostgreSQL extension for collecting statistics about sorts, helping tuning work_mem