Skip to content
View karmakaze's full-sized avatar
💭
Improvising
💭
Improvising

Block or report karmakaze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AI gateway written in Go. Lightweight unified OpenAI-compatible API for OpenAI, Anthropic, Gemini, Groq, xAI & Ollama. LiteLLM alternative with observability, guardrails, streaming, costs and usage…

Go 953 66 Updated Jun 18, 2026

⚠️ Archived — BrainDrive is building a new system on personalaiarchitecture.org

Python 35 10 Updated Mar 8, 2026

llama.cpp fork with additional SOTA quants and improved performance

C++ 2,751 356 Updated Jun 18, 2026

IronClaw is an Agent OS focused on privacy, security and extensibility

Rust 12,456 1,457 Updated Jun 18, 2026

LLM inference in C/C++

C++ 6 Updated Jun 2, 2026

A minimalistic C++ Jinja templating engine for LLM chat templates

C++ 217 32 Updated Sep 22, 2025

An application to read and display the presets stored in the Arturia MicroFreak memory.

JavaScript 45 14 Updated Jan 4, 2023

An application to read and display the presets stored in the Arturia MicroFreak memory.

JavaScript 4 Updated Jun 7, 2025
TypeScript 15,391 1,199 Updated Jun 18, 2026

Simple internal event bus for Go applications

Go 1 Updated Jul 16, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,249 18,196 Updated Jun 18, 2026

Unified management and routing for llama.cpp, MLX and vLLM models with web dashboard.

Go 133 21 Updated Jun 16, 2026

The reproducible runtime for local agents (MCP).

Go 29 5 Updated Dec 25, 2025

NOW MANAGED ON CODEBERG

PHP 18,838 2,394 Updated Jun 18, 2026

trying to figure out this jujutsu thing

449 65 Updated Feb 23, 2026

Whipping all the Llamas and other Lamini asses into shape.

Go 8 1 Updated Dec 20, 2025

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

Go 46,960 4,148 Updated Jun 18, 2026

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

Go 4,658 350 Updated Jun 18, 2026

An OpenAI Compatible Proxy Server for Streaming Chat Completion

Go 7 1 Updated Nov 6, 2023

Nornicdb is a distributed low-latency, Graph+Vector, Temporal MVCC with all sub-ms HNSW search, graph traversal, and writes. Using Neo4j Bolt/Cypher and qdrant's gRPC means you can switch with no c…

Go 778 44 Updated Jun 16, 2026

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

JavaScript 61,773 6,740 Updated Jun 18, 2026

LLM inference in C/C++

C++ 117,141 19,696 Updated Jun 18, 2026

Like Vercel, but open source and for all languages.

Python 4,680 179 Updated Mar 3, 2026

An open-source coding agent for the Grok API

TypeScript 3,155 393 Updated Jun 17, 2026

JavaScript client SDK for communicating with OAuth 2.0 and OpenID Connect providers.

TypeScript 1,015 166 Updated Apr 22, 2024

Offline Hacker News Reader written in Dart/Flutter

Dart 1 Updated Jun 16, 2018

Composable / async / functional / type-safe / parallel-pipelined queries and relations without SQL injection or N+1s.

Java 1 Updated Jul 1, 2020

Composable / async / functional / type-safe / parallel-pipelined queries and relations without SQL injection or N+1s.

Java 1 Updated May 14, 2019

Ruby gem to rescue from MySQL, PostgreSQL and Sqlite duplicate errors

Ruby 91 11 Updated Feb 9, 2026
Next