Skip to content
View chnaaam's full-sized avatar
😄
😄

Block or report chnaaam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Running a big model on a small laptop

Objective-C 3,581 432 Updated Mar 19, 2026

Exact speculative decoding on Apple Silicon, powered by MLX.

Python 176 19 Updated Apr 13, 2026

LLaMa implementation in plain C with no dependencies

C 2 Updated Mar 8, 2026

Step by step explanation/tutorial of llama2.c

C 230 20 Updated Oct 9, 2023

Inference Llama 2 in one file of pure C

C 13 4 Updated Nov 17, 2023

From-scratch C++ runtime for Llama 2 inference. Implements full transformer forward pass with RoPE, GQA, KV cache, SwiGLU, and a custom BPE tokenizer. No framework dependencies.

C++ 1 Updated Apr 12, 2026
1 Updated Apr 3, 2026

AI Agent Backend Platform on FastAPI — MCP server + AI orchestration + async DDD architecture. Zero-boilerplate CRUD, auto domain discovery, 14 Claude Code AI development skills.

Python 17 Updated Apr 13, 2026

"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"

Python 39,357 6,884 Updated Apr 13, 2026

A server implementation for Wikidata API using the Model Context Protocol (MCP).

Python 42 12 Updated Mar 27, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 154,021 23,903 Updated Apr 13, 2026

code to run evaluation on MermaidSeqBench using LLMaJ and a RESTful OpenAI-compatible API

Python 2 Updated Nov 20, 2025

test "vibe coding"

TypeScript 1 Updated Jan 13, 2026

LangGraph V1 Tutorial in Korean

Jupyter Notebook 114 50 Updated Mar 29, 2026

Train your own speech AI model from scratch

Python 150 15 Updated Feb 17, 2026

Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM stan…

C++ 532 157 Updated Mar 30, 2026

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

TypeScript 22,748 1,308 Updated Apr 13, 2026

TransInferSim is a simulation framework for analyzing transformer inference on hardware.

Python 5 2 Updated Oct 29, 2025

A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with clamping and residual connection, Mixture-of-Experts (MoE), Sel…

Python 227 15 Updated Dec 2, 2025

BBPE 底层实现

Python 38 3 Updated Apr 29, 2024

From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.

Python 108 7 Updated Nov 5, 2025

Inference Llama 2 in one file of pure Python

Python 425 28 Updated Nov 21, 2025

Video anonymization by face detection

Python 1,401 167 Updated Oct 13, 2024
Python 272 116 Updated Apr 14, 2025

A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.

C++ 84 15 Updated Nov 7, 2021

Python code to show how a systolic array works. Written for https://medium.com/@antonpaquin/whats-inside-a-tpu-c013eb51973e

Python 29 3 Updated Jun 8, 2018

An Eyeriss Chip (researched by MIT, a CNN accelerator) simulator and New DNN framework "Hive"

Python 222 57 Updated Dec 22, 2020

SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer

Python 46 3 Updated Nov 10, 2025

A powerful CLI tool that brings AI-powered code generation and file manipulation directly to your terminal.

TypeScript 2 Updated Oct 29, 2025

Build a Claude Code–like CLI coding agent from scratch.

Python 133 25 Updated Jan 22, 2026
Next