Lists (1)
Sort Name ascending (A-Z)
Stars
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
OpenAI-compatible API middleware for n8n workflows. Use your n8n agents and workflows as OpenAI models in any OpenAI-compatible client.
An example Python FastAPI proxy to call OpenAI
Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.
Community maintained hardware plugin for vLLM on Apple Silicon
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …
Breakthrough Method for Agile Ai Driven Development
KT GiGA 인터넷의 일일 150GB QoS제한 우회 팁
This repo powers my experiment where ChatGPT manages a real-money micro-cap stock portfolio.
Claude Code with any LLM provider (OpenRouter, Gemini, Kimi K2)
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…
misanthropic-ai / mlx_parallm
Forked from willccbb/mlx_parallmFast parallel LLM inference for MLX
Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models wi…
Claraverse is a opesource privacy focused ecosystem to replace ChatGPT, Claude, N8N, ImageGen with your own hosted llm, keys and compute. With desktop, IOS, Android Apps.
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
A secure reverse proxy for Ollama with OpenAI-compatible API key authentication.
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…
Simple go utility to download HuggingFace Models and Datasets
An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API
An OpenAI API compatible LLM inference server based on ExLlamaV2.
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.