-
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedJul 16, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedMay 2, 2025 -
text-generation-inference Public
Forked from huggingface/text-generation-inferenceLarge Language Model Text Generation Inference
Python Apache License 2.0 UpdatedFeb 14, 2025 -
servers Public
Forked from modelcontextprotocol/serversModel Context Protocol Servers
JavaScript MIT License UpdatedNov 30, 2024 -