- India
Stars
Unofficial description of the CUDA assembly (SASS) instruction sets.
Open-source LLM/VLM load balancer and serving platform for self-hosting LLMs (and VLMs) at scale 🏓🦙 Alternative to projects like llm-d, Docker Model Runner, etc but with less moving parts and simpl…
lifey / mjprof
Forked from AdoptOpenJDK/mjprofA monadic java profiler
Linux running inside a PDF file via a RISC-V emulator
A lightweight process isolation tool that utilizes Linux namespaces, cgroups, rlimits and seccomp-bpf syscall filters, leveraging the Kafel BPF language for enhanced security.
Simulate cache behavior with this CacheSimulator, exploring cache policies, performance, and related concepts.
Your personal AI assistant at all-in 888KiB (~35KB in app code). Running on an ESP32. GPIO, cron, custom tools, memory, and more.
Scriptable database and system performance benchmark
JStall is a small command-line tool for one-shot inspection of running JVMs using thread dumps and short, on-demand profiling.
Library for arbitrary precision arithmetic and computation
Minimal plugin that lets Claude Code call you on the phone.
A collection of AI Agents papers (Updated biweekly)
Advanced⚡ Emoji Picker😀 for Linux🐧, Windows🪟 and macOS🍎
Source code for the X Recommendation Algorithm
A platform independent tensor library with autograd for the JVM
A lightweight HTTP reverse proxy that routes requests to multiple Ollama servers. It includes features like rate limiting, API key validation, security filtering, metrics collection, and hot-reload…
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?