Skip to content
View abhinand5's full-sized avatar

Block or report abhinand5

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Generate, format and mask agent traces with ease.

Python 52 4 Updated Jun 17, 2026

AI coding agent that edits symbols, not strings. AST surgery, full LSP, and a live code graph wired to memory that resurfaces by file, co-change, and semantics.

TypeScript 798 61 Updated Jun 12, 2026

Personal Pi coding agent setup

TypeScript 226 26 Updated Jun 15, 2026

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 16,322 2,557 Updated Aug 8, 2024

Control panel for VLLM, Sglang, llama.cpp, exllamav3

TypeScript 1,153 91 Updated Jun 12, 2026

A terminal workspace with batteries included

Rust 33,707 1,267 Updated Jun 16, 2026

DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm

C 14,230 1,243 Updated Jun 16, 2026

Claude / Codex / Gemini API Proxy - CCX

Go 3,623 273 Updated Jun 17, 2026

How much experts do we need to serve a model?

Python 150 15 Updated Mar 18, 2026

The Pi desktop app you want to use.

TypeScript 311 20 Updated Jun 16, 2026

The pretty much "official" DSPy framework for Typescript

TypeScript 2,774 176 Updated Jun 16, 2026

Learn it. Build it. Ship it for others.

Python 33,769 5,499 Updated Jun 14, 2026

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 10,476 1,112 Updated Jun 14, 2026

Linux & Powershell scripts to easily set up and run the Qwen 3.5 series locally on Windows and Linux with llama.cpp.

PowerShell 89 14 Updated Apr 28, 2026

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 35,039 2,877 Updated Mar 4, 2026

Artifact integrity and drift detection for ML and data pipelines.

Python 2 Updated Apr 25, 2026

Autonomous experiment loop extension for pi

TypeScript 7,034 414 Updated Jun 8, 2026
TypeScript 12,693 2,539 Updated Jun 17, 2026

LLM inference in C/C++

C++ 1,844 321 Updated Jun 17, 2026

See where your AI tokens go. Interactive TUI dashboard for Claude Code, Codex, and Cursor cost observability. npx codeburn

TypeScript 8,045 629 Updated Jun 15, 2026

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

Go 4,626 351 Updated Jun 16, 2026

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,952 404 Updated Jun 16, 2026

llama.cpp fork with TQ3_1S/4S CUDA kernels — 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs w…

C++ 191 11 Updated Jun 16, 2026

Solve puzzles. Learn CUDA.

Jupyter Notebook 12,236 933 Updated Sep 1, 2024

omo/lazycodex: The coding agent for tokenmaxxers;the one and only agent harness for complex codebases. For your Codex, for your OpenCode

TypeScript 62,494 5,060 Updated Jun 17, 2026

KV cache compression via block-diagonal rotation. Beats TurboQuant: better PPL (6.91 vs 7.07), 28% faster decode, 5.3x faster prefill, 44x fewer params. Drop-in llama.cpp integration.

Python 1,016 87 Updated Apr 23, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 132,859 21,495 Updated Jun 16, 2026

OCR model that handles complex tables, forms, handwriting with full layout.

Python 11,222 1,158 Updated Apr 22, 2026

🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.

Jupyter Notebook 4,170 357 Updated May 25, 2026

Skills for Real Engineers. Straight from my .claude directory.

Shell 132,160 11,501 Updated Jun 12, 2026
Next