Skip to content
View aminya's full-sized avatar

Sponsors

@keygen-sh

Block or report aminya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The "Missing GitHub Status Page" -- a Flat Data attempt at historically documenting GitHub statuses

HTML 477 21 Updated Jun 13, 2026

Measuring frontier coding agents on original, long-horizon engineering tasks

Shell 788 40 Updated Jun 5, 2026
Python 293 7 Updated May 27, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,425 155 Updated Jun 13, 2026

How much experts do we need to serve a model?

Python 150 15 Updated Mar 18, 2026

Fast, lossless LLM inference via dual-view diffusion decoding.

Python 422 17 Updated May 18, 2026

DFlash & TurboQuant in llama.cpp with up to 3x faster generation and 7.5x more KV cache in same VRAM

C++ 637 32 Updated Jun 13, 2026

Monoscope lets you ingest and explore your logs, traces and metrics. We store these in S3 compatible buckets. Query in natural language via LLMs.

Haskell 1,094 45 Updated Jun 12, 2026

llama.cpp fork with TurboQuant WHT-rotated KV cache & weight compression + Gemma 4 MTP and Qwen 3.6 NextN speculative decoding (+30-50% throughput).

C++ 261 35 Updated Jun 10, 2026

Hierarchal Agent Loop Optimizer

TypeScript 845 63 Updated Jun 13, 2026

A filesystem designed for agents, with SOTA retrieval, automatic memory profiles, sync engine. Drop any file type (pdf, images, videos), and grep through them.

Rust 426 31 Updated Jun 12, 2026

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 10,443 1,104 Updated Jun 13, 2026

🐚bash/POSIX-compatible shell implemented in Rust 🦀

Rust 2,042 101 Updated Jun 13, 2026

Specification and Tools for Makefile-formatted Agent Skills.

Shell 102 3 Updated May 4, 2026

Reverse proxy for monitoring and debugging local LLM agents (Ollama). Real-time dashboard, request logging, and performance metrics in a single binary

C++ 7 Updated Mar 11, 2026

Warp is an agentic development environment, born out of the terminal.

Rust 61,671 4,995 Updated Jun 13, 2026

Dataset of hackable TerminalBench-style tasks and exploit trajectories

Python 27 1 Updated Apr 18, 2026

Structured Chain-of-Thought

Python 218 15 Updated May 16, 2026

VS Code extension that follows file changes in real-time, automatically opening editors and scrolling to edits. Perfect for watching CLI-based coding agents work.

TypeScript 17 4 Updated Feb 8, 2026

FlashKDA: high-performance Kimi Delta Attention kernels

Cuda 448 38 Updated May 26, 2026

👩‍🚒 Good content deserves good paper.

HTML 8,002 385 Updated Jun 13, 2026

A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.

Python 13,811 3,115 Updated May 23, 2026

Domain-fronted HTTP/SOCKS5 proxy tunneling traffic through Google Apps Script with MITM TLS interception, HTTP/1-2 multiplexing, and DPI evasion.

Python 3,864 447 Updated Jun 9, 2026

Instant, Concurrent, Secure & Lightweight Sandbox for AI Agents.

Rust 6,324 499 Updated Jun 13, 2026

Library for reducing tail latency in RAM reads

C++ 2,688 153 Updated Apr 11, 2026

Give Claude Code a memory that evolves with your codebase. Hooks automatically capture sessions, the Claude Agent SDK extracts key decisions and lessons, and an LLM compiler organizes everything in…

Python 1,153 299 Updated Apr 6, 2026
Python 607 59 Updated May 21, 2026

Codebase for a Marimba playing robot

Python 16 Updated Nov 6, 2024

A set of tool to use the Clarity-OMR Model.

Python 33 6 Updated Mar 20, 2026

Fast LLM speculative inference server for consumer hardware.

C++ 2,430 223 Updated Jun 13, 2026
Next