Skip to content
View jundot's full-sized avatar

Block or report jundot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 44 7 Updated Feb 16, 2026

OpenAI-compatible API middleware for n8n workflows. Use your n8n agents and workflows as OpenAI models in any OpenAI-compatible client.

JavaScript 95 17 Updated Jan 29, 2026

An example Python FastAPI proxy to call OpenAI

Python 1 Updated Jul 22, 2024

Complete Claude Code configuration collection - agents, skills, hooks, commands, rules, MCPs. Battle-tested configs from an Anthropic hackathon winner.

JavaScript 46,784 5,792 Updated Feb 14, 2026

Community maintained hardware plugin for vLLM on Apple Silicon

Python 457 45 Updated Feb 15, 2026
TypeScript 32 7 Updated Jan 7, 2026

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX …

Python 372 46 Updated Feb 15, 2026

Breakthrough Method for Agile Ai Driven Development

JavaScript 35,759 4,476 Updated Feb 15, 2026

iClipboard 一个小巧精致的 macOS 剪贴板管理工具

Swift 15 Updated Dec 20, 2025
Python 13 2 Updated Jan 10, 2026

KT GiGA 인터넷의 일일 150GB QoS제한 우회 팁

Shell 241 36 Updated Feb 5, 2026

This repo powers my experiment where ChatGPT manages a real-money micro-cap stock portfolio.

Python 7,393 1,575 Updated Jan 25, 2026

Claude Code with any LLM provider (OpenRouter, Gemini, Kimi K2)

Go 65 12 Updated Aug 16, 2025

An extremely fast implementation of whisper optimized for Apple Silicon using MLX.

Python 5 Updated Feb 20, 2025

A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.

Python 100 8 Updated Jun 29, 2025

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…

Python 219 39 Updated Feb 13, 2026

Fast parallel LLM inference for MLX

Python 2 Updated Feb 6, 2026

Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and <think> tag filtering. Perfect for using advanced models wi…

Python 50 5 Updated May 19, 2025

Claraverse is a opesource privacy focused ecosystem to replace ChatGPT, Claude, N8N, ImageGen with your own hosted llm, keys and compute. With desktop, IOS, Android Apps.

Go 3,713 411 Updated Jan 27, 2026

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

Python 5,953 439 Updated Feb 15, 2026

Fast, flexible LLM inference

Rust 6,584 527 Updated Feb 15, 2026

A secure reverse proxy for Ollama with OpenAI-compatible API key authentication.

JavaScript 1 Updated Apr 15, 2025

MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…

Python 662 79 Updated Dec 21, 2025

Simple go utility to download HuggingFace Models and Datasets

Go 892 103 Updated Jan 30, 2026

An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API

Python 18 1 Updated Aug 21, 2025

A fast batching API to serve LLM models

Python 189 13 Updated Apr 26, 2024

An OpenAI API compatible LLM inference server based on ExLlamaV2.

Python 25 Updated Feb 9, 2024

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.

TypeScript 3,623 219 Updated Aug 7, 2025