yhyang201

Yuhao Yang yhyang201

Focus on MLSys. Feel free to contact me: yhyang201@gmail.com

128 followers · 146 following

Achievements

x3 x4

Achievements

x3 x4

Highlights

Stars

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,077 120 Updated Jun 12, 2026

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 890 155 Updated Jun 23, 2026

Menooker / KunQuant

A compiler, optimizer and executor for financial expressions and factors

C++ 292 56 Updated May 29, 2026

mit-han-lab / kernel-design-agents

622 51 Updated Jun 2, 2026

mit-han-lab / KernelWiki

Python 267 32 Updated Jun 9, 2026

BBuf / KDA-Pilot

Python 196 31 Updated Jun 23, 2026

Egonex-AI / Understand-Anything

Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…

TypeScript 66,525 5,520 Updated Jun 20, 2026

NVIDIA / kvpress

LLM KV cache compression made easy

Python 1,117 155 Updated Jun 22, 2026

NVIDIA / trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Python 1,662 349 Updated May 5, 2026

lightseekorg / tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,484 168 Updated Jun 23, 2026

NVIDIA-NeMo / Skills

A project to improve skills of large language models

Python 984 190 Updated Jun 22, 2026

QwenLM / FlashQLA

high-performance linear attention kernel library built on TileLang

Python 556 48 Updated May 7, 2026

PolyArch / humanize

From Automated Idea Factory to Realization

Shell 1,177 97 Updated Jun 13, 2026

NVIDIA / Model-Optimizer

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…

Python 2,970 453 Updated Jun 23, 2026

nektos / act

Run your GitHub Actions locally 🚀

Go 70,820 1,960 Updated Jun 1, 2026

inclusionAI / cuLA

CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.

Python 525 65 Updated Jun 23, 2026

BBuf / AI-Infra-Auto-Driven-SKILLS

Python 595 52 Updated Jun 20, 2026

ultraworkers / claw-code

An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.

Rust 194,195 109,906 Updated Jun 8, 2026

obra / superpowers

An agentic skills framework & software development methodology that works.

Shell 236,491 20,987 Updated Jun 23, 2026

shareAI-lab / learn-claude-code

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

Python 68,013 11,063 Updated Jun 22, 2026

SandAI-org / MagiCompiler

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 314 23 Updated Jun 23, 2026

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 903 265 Updated Jun 22, 2026

junyangwang0410 / AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 169 7 Updated Jan 15, 2024

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 88,230 12,770 Updated Mar 26, 2026

caoshiyi / K-Search

Automated High-Performance GPU Kernel Generation

Python 116 22 Updated Jun 1, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,710 1,063 Updated Apr 30, 2026

shareAI-lab / claw0

0 - 1 learn OpenClaw: sections to build an claw-AI agent from scratch

Python 2,979 344 Updated Mar 18, 2026

anthropics / skills

Public repository for Agent Skills

Python 154,164 18,171 Updated Jun 9, 2026

AyakaGEMM / Hands-on-GEMM

Cuda 154 20 Updated Mar 18, 2024

moeru-ai / airi

💖🧸 Self hosted, you-owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

TypeScript 41,197 4,146 Updated Jun 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuhao Yang yhyang201

Achievements

Achievements

Highlights

Block or report yhyang201

Stars

ovg-project / kvcached

pytorch / helion

Menooker / KunQuant

mit-han-lab / kernel-design-agents

mit-han-lab / KernelWiki

BBuf / KDA-Pilot

Egonex-AI / Understand-Anything

NVIDIA / kvpress

NVIDIA / trt-samples-for-hackathon-cn

lightseekorg / tokenspeed

NVIDIA-NeMo / Skills

QwenLM / FlashQLA

PolyArch / humanize

NVIDIA / Model-Optimizer

nektos / act

inclusionAI / cuLA

BBuf / AI-Infra-Auto-Driven-SKILLS

ultraworkers / claw-code

obra / superpowers

shareAI-lab / learn-claude-code

SandAI-org / MagiCompiler

sgl-project / SpecForge

junyangwang0410 / AMBER

karpathy / autoresearch

caoshiyi / K-Search

deepseek-ai / FlashMLA

shareAI-lab / claw0

anthropics / skills

AyakaGEMM / Hands-on-GEMM

moeru-ai / airi