Skip to content
View nplay007's full-sized avatar

Block or report nplay007

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,386 698 Updated May 17, 2026

Nano vLLM

Python 14,010 2,209 Updated Apr 26, 2026

Provider-neutral Agent Skill for Codex, Claude Code, and agentic harness design.

1,938 165 Updated Jun 6, 2026

Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM

C++ 792 51 Updated Apr 14, 2026

This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models with 100-200M parameters.

Python 268 28 Updated May 21, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 97,086 14,853 Updated Jun 2, 2026

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 58,359 4,861 Updated Jun 11, 2026

A GPT-2 inference engine written from scratch in CUDA and C++. Implements custom CUDA kernels for tiled matrix multiplication, LayerNorm, fused attention, transformer blocks, KV cache management, a…

Cuda 39 1 Updated May 17, 2026

Generate beautiful dark-themed system architecture diagrams as standalone HTML/SVG files. Works as a Claude AI skill.

HTML 5,905 465 Updated May 13, 2026

🚀 An open-source, hands-on curriculum bridging the gap from basic RL concepts to LLM alignment, RLVR, and advanced Agentic systems.

Python 2,870 178 Updated Jun 12, 2026

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

JavaScript 53,451 10,654 Updated Jun 12, 2026

Sutskever 30 implementations inspired by https://papercode.vercel.app/ | For Agents, use https://github.com/pageman/Sutskever-Agent | Polyglot / Multi-Backed version at https://github.com/pageman/s…

Jupyter Notebook 3,272 443 Updated Mar 15, 2026

所有小初高、大学PDF教材。

Roff 74,157 16,582 Updated Oct 18, 2025