Popular repositories Loading
-
omlx-llm-runner
omlx-llm-runner PublicForked from jundot/omlx
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Python
-
vllm-llm-runner
vllm-llm-runner PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
-
vllm-metal-llm-runner
vllm-metal-llm-runner PublicForked from vllm-project/vllm-metal
Community maintained hardware plugin for vLLM on Apple Silicon
Python
-
LlamaBarn-llm-runner
LlamaBarn-llm-runner PublicForked from ggml-org/Llama
A cosy home for your LLMs.
Swift
-
Llamacpp-Model-Launcher
Llamacpp-Model-Launcher PublicForked from Kaspur2012/Llamacpp-Model-Launcher
Its purpose is to replace the tedious and error-prone process of typing long commands into a terminal. With this launcher, you can manage, edit, delete, duplicate and run all your language models w…
Python
-
lemonade-llm-runner
lemonade-llm-runner PublicForked from lemonade-sdk/lemonade
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
C++
If the problem persists, check the GitHub status page or contact support.