xiuhu17

⛅

Zhihao Wang xiuhu17

⛅

12 followers · 11 following

10:41 (UTC -07:00)

Achievements

Highlights

Stars

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 1,063 87 Updated Sep 4, 2024

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 663 80 Updated Apr 29, 2026

NVIDIA / TileGym

Helpful kernel tutorials, examples and SKILLs for tile-based GPU programming

Python 708 68 Updated Apr 29, 2026

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,722 370 Updated Mar 19, 2025

dsl-learn / cutile-learn

NVIDIA cuTile learn

Python 167 2 Updated Dec 9, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,419 139 Updated Apr 22, 2026

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,032 134 Updated Apr 28, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,528 758 Updated Apr 29, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 13,181 2,017 Updated Apr 26, 2026

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10,818 1,093 Updated Apr 20, 2026

Supercomputing-System-AI-Lab / X-MoE

Python 25 4 Updated Apr 23, 2026

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,448 130 Updated Nov 9, 2025

Supercomputing-System-AI-Lab / MiLo

Code repo for efficient quantized MoE inference with mixture of low-rank compensators

Python 36 Updated Apr 14, 2025

bowenli6 / illinix

A Unix-like Operating System

C 5 Updated Jan 21, 2024

yirong-c / CLRS

Algorithms implementation in C++ and solutions of questions (both code and math proof) from “Introduction to Algorithms” (3e) (CLRS) in LaTeX.

C++ 54 7 Updated Dec 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhihao Wang xiuhu17

Achievements

Achievements

Highlights

Block or report xiuhu17

Stars

IST-DASLab / marlin

Dao-AILab / sonic-moe

NVIDIA / TileGym

BBuf / tvm_mlir_learn

dsl-learn / cutile-learn

ByteDance-Seed / Triton-distributed

NVIDIA / cutile-python

THUDM / slime

GeeeekExplorer / nano-vllm

xlite-dev / LeetCUDA

Supercomputing-System-AI-Lab / X-MoE

TsinghuaC3I / Awesome-RL-for-LRMs

Supercomputing-System-AI-Lab / MiLo

bowenli6 / illinix

yirong-c / CLRS