Skip to content
View TKH666's full-sized avatar

Highlights

  • Pro

Block or report TKH666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Heterogeneous GPU Sharing on Kubernetes

Go 3,595 580 Updated Jun 18, 2026

GPT Image 2 prompt gallery, image prompt library, agentic skill, and CLI for OpenAI image generation/editing

Python 3,140 281 Updated May 23, 2026

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 33,133 4,195 Updated Jun 21, 2026

Everything we actually know about the Apple Neural Engine (ANE)

2,483 97 Updated Mar 12, 2026
Mathematica 370 42 Updated Sep 25, 2025

Training neural networks on Apple Neural Engine via reverse-engineered private APIs

Objective-C 6,870 947 Updated Mar 10, 2026

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 38,910 17,546 Updated Jun 21, 2026

Awesome LLM compression research papers and tools.

1,847 128 Updated Feb 23, 2026
Python 303 23 Updated Feb 15, 2026

Memory library for building stateful agents

Python 5,320 645 Updated Jun 19, 2026

real time face swap and one-click video deepfake with only a single image

Python 94,013 13,704 Updated Jun 14, 2026
Python 18 5 Updated Apr 3, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 218,897 33,561 Updated Jun 21, 2026

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

Python 67,583 10,991 Updated Jun 15, 2026

LLM KV cache compression made easy

Python 1,115 155 Updated Jun 17, 2026

Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference

Python 48 3 Updated Mar 28, 2026

hexagon tutorial

C 50 9 Updated Mar 29, 2026

LLM inference in C/C++

C++ 52 7 Updated Jun 18, 2026

FastRPC is Qualcomm's userspace library that facilitates efficient remote procedure calls between the CPU and DSP for high-performance computing.

C 99 71 Updated Jun 17, 2026

AIOS: AI Agent Operating System

Python 5,944 832 Updated Jun 18, 2026

Spec-driven development (SDD) for AI coding assistants.

TypeScript 55,786 3,910 Updated Jun 13, 2026

Awesome resources for GPUs

628 60 Updated Mar 10, 2026
Python 782 71 Updated Jun 1, 2026

MobileFineTuner: Native C++ framework for fine-tuning LLMs directly on mobile devices. Features: LoRA/Full-FT, ZeRO-inspired parameter sharding, energy-aware throttling, custom autograd engine. Kee…

C++ 14 7 Updated Jun 10, 2026

[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance

Python 309 13 Updated Jul 30, 2025

Using AI for high quality writing

Python 473 54 Updated Jun 2, 2026
TypeScript 22,070 2,531 Updated Jun 20, 2026

一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework

C++ 1,830 222 Updated Apr 25, 2026
Next