A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Python 1,456 140 Updated Jun 15, 2026

russellromney / honker

SQLite extension + bindings for Postgres NOTIFY/LISTEN semantics with durable queues, streams, pub/sub, and scheduler

Python 2,849 69 Updated Jun 15, 2026

coleifer / huey

a little task queue for python

Python 5,977 395 Updated Jun 13, 2026

dr-Akari / agentic-search-context-1

Agentic search with ChromaDB and Context 1 model

3 1 Updated Mar 29, 2026

saagarjha / nvidia-sass-tools

Vibe-coded utilities for working with Nvidia's internal SASS architecture specification

Python 2 Updated Jan 30, 2026

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,561 260 Updated Jun 15, 2026

zby / commonplace

The theory of LLM wikis, running as one. A framework for agent-operated knowledge: typed, linked, review-gated markdown your agents execute.

Python 69 9 Updated Jun 15, 2026

jnuyens / gsd-plugin

Performance-optimized plugin packaging of GSD (Get Shit Done) for Claude Code. Based on open-gsd/get-shit-done-redux

TypeScript 71 7 Updated Jun 14, 2026

github / awesome-copilot

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 35,076 4,320 Updated Jun 15, 2026

Luce-Org / lucebox-hub

Fast LLM speculative inference server for consumer hardware.

C++ 2,513 230 Updated Jun 15, 2026

neurosnap / zmx

Session attach/detach for the terminal

Zig 1,608 92 Updated Jun 13, 2026

midea-ai / SemaClaw

SemaClaw is an open-source framework for general-purpose personal AI agents.

TypeScript 67 12 Updated Jun 15, 2026

DLYuanGod / MegaTrain

Python 608 60 Updated May 21, 2026

JuliusBrussee / caveman

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

JavaScript 73,009 4,120 Updated Jun 12, 2026

gsd-build / get-shit-done

A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.

JavaScript 64,248 5,465 Updated May 31, 2026

NangoHQ / nango

Build product integrations with AI.

TypeScript 10,562 1,123 Updated Jun 15, 2026

SharpAI / SwiftLM

⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, MACOS + iOS iPhone app.

Swift 689 39 Updated May 19, 2026

openyak / openyak

Open-source local-first AI agent for desktop work. No account, no telemetry: use local models with Ollama/Rapid-MLX or bring your own provider key.

Python 695 59 Updated Jun 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kedarkolluri

Block or report kedarkolluri

Lists (1)

✨ Inspiration

Stars

mohan-n-swamy / claude-clean-context

johnsk95 / latent_agents

utibeabasi6 / mercek

pewdiepie-archdaemon / odysseus

lightseekorg / tokenspeed

worldbench / awesome-ai-auto-research

poolsideai / pool

tilde-research / aurora-release

kitft / natural_language_autoencoders

addyosmani / agent-skills

alexellis / k3sup

gabewillen / atmux

intel / auto-round