localllm

Steno is the AI powered intelligence layer for all your confidential conversations. Capture beautiful notes whilst keeping your data confidential. Perfect for government, defence, legal and CXOs.

ai clinical meeting-minutes meeting-notes healthcare-ai apple-silicon localllm qwen deepseek llama3 agentic-ai

Updated May 12, 2026
TypeScript

SqueezeAILab / SqueezeLLM

Star

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

natural-language-processing text-generation transformer llama quantization model-compression efficient-inference post-training-quantization large-language-models llm small-models localllm

Updated Aug 13, 2024
Python

SqueezeAILab / KVQuant

Star

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

natural-language-processing compression text-generation transformer llama quantization mistral model-compression efficient-inference efficient-model large-language-models llm small-models localllm localllama

Updated Aug 13, 2024
Python

PromptEngineer48 / MemGPT-AutoGEN-LLM

Sponsor

Star

Run MemGPT-AutoGEN-Local LLM Together

autogen localllm memgpt

Updated Nov 2, 2023
Python

Red-Hex-Consulting / Ankou

Star

A flexible, AI powered C2 framework built with operators in mind

rust ai quic http3 c2 redteaming command-and-control redteam implant postexploitation c2-framework localllm ollama-app redteaming-tools redhex openwebui

Updated Apr 24, 2026
TypeScript

aaronrussell / ollama-ex

Star

A nifty little library for working with Ollama in Elixir.

elixir ai llms chatgpt-api localllm ollama

Updated Feb 10, 2026
Elixir

BodhiSearch / BodhiApp

Star

Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs

llama gemma mistral llm generative-ai local-llm localllm open-source-llm private-llm

Updated May 6, 2026
TypeScript

Ganador1 / FenixAI_tradingBot

Star

Fenix Ai Trading Bot with LangGraph and ollama and multipe providers

python ai trading-algorithms autonomous-agents tradingbot ganador llm localllm ollama crewai

Updated Apr 8, 2026
Python

lordmathis / llamactl

Star

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

self-hosted mlx openai-api llm llamacpp llama-cpp vllm llm-inference localllm localllama llama-server llm-router mlx-lm

Updated May 14, 2026
Go

iconben / z-image-studio

Star

A Cli, a webUI, and a MCP server for the Z-Image-Turbo text-to-image generation model (Tongyi-MAI/Z-Image-Turbo base model as well as quantized models)

python apple ai cuda webui lora mps text-to-image rocm text2image apple-silicon diffusers localllm mcp-server z-image-turbo z-image

Updated May 11, 2026
Python

MDGrey33 / pyvisionai

Star

The PyVisionAI Official Repo

python open-source ocr computer vision openai llama vlm vision-models localllm ollama claude-3-5-sonnet

Updated Jul 19, 2025
Python

tegridydev / dnd-llm-game

Star

MVP of an idea using multiple local LLM models to simulate and play D&D

dnd local multiagent streamlit llm localllm local-ai ollama multi-llm llm-generator ollama-python multi-ai llm-game ai-dnd

Updated Apr 23, 2025
Python

OpenCortexIDE / cortexide

Star

Open-source AI IDE powered by local & cloud LLMs. A privacy-first alternative to Cursor.

opensource devtools vscode code-editor openai rag groq aidev ai-ide anthropic vllm localllm ollama deepseek aicode cursor-alternative copilot-alternative

Updated May 10, 2026
TypeScript

wsmlby / homl

Star

The easiest & fastest way to run LLMs in your home lab

openai homelab llm localllm

Updated Feb 23, 2026
Python

perk11 / large-model-proxy

Star

Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on different ports and loading/unloading them on demand

proxy large-models llms localllm

Updated May 11, 2026
Go

Vesta macOS Distribution - Official releases and downloads.Vesta AI Chat Assistant for macOS - Built with SwiftUI, Swift MLX and Apple Intelligence using Apple's on device model on MacOs Tahoe (MacOS 26). Now with side-by-side Qwen3-VL for vison

macos apple ai mcp neural-networks lora chat-application mlx tahoe llamacpp localllm finetuning-llms appleintelligence mcp-server privateai apple-llm apple-llm-integration applefoundationalmodels qwen3-vl

Updated May 9, 2026

Improve this page

Add a description, image, and links to the localllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the localllm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

localllm

Here are 198 public repositories matching this topic...

n4ze3m / page-assist

mostlygeek / llama-swap

sauravpanda / BrowserAI

KwaiKEG / KwaiAgents

ruzin / stenoai

SqueezeAILab / SqueezeLLM

SqueezeAILab / KVQuant

PromptEngineer48 / MemGPT-AutoGEN-LLM

Red-Hex-Consulting / Ankou

aaronrussell / ollama-ex

BodhiSearch / BodhiApp

Ganador1 / FenixAI_tradingBot

lordmathis / llamactl

iconben / z-image-studio

MDGrey33 / pyvisionai

tegridydev / dnd-llm-game

OpenCortexIDE / cortexide

wsmlby / homl

perk11 / large-model-proxy

scouzi1966 / vesta-mac-dist

Improve this page

Add this topic to your repo