Skip to content
View kayzee3327's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report kayzee3327

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

📖 作为对《C++ Concurrency in Action - SECOND EDITION》的中文翻译。

2,327 456 Updated Jan 26, 2021

A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.

Go 106,187 15,030 Updated Apr 27, 2026

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Cuda 438 28 Updated Mar 30, 2026

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 4,120 568 Updated Mar 26, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 91,744 14,135 Updated Apr 16, 2026

Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"

Python 952 119 Updated Apr 14, 2026

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,758 263 Updated Nov 12, 2025

paper list, tutorial, and nano code snippet for Diffusion Large Language Models.

Jupyter Notebook 167 9 Updated Jan 19, 2026

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

766 37 Updated Feb 28, 2026

A curated list for Efficient Large Language Models

Python 1,997 163 Updated Jun 17, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,603 204 Updated Apr 20, 2026

📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉

Python 544 26 Updated Mar 19, 2026

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 5,188 372 Updated Apr 20, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,654 1,830 Updated Apr 25, 2026

Mirror of the Xen Repository (PRs not accepted see: http://wiki.xenproject.org/wiki/Submitting_Xen_Project_Patches)

C 805 389 Updated Apr 29, 2026

清华大学操作系统课程实验 (OS Kernel Labs)

C 2,237 454 Updated Aug 26, 2022

OS Labs for MOOC

C 419 199 Updated Sep 28, 2014

100 Days of ML Coding

50,802 11,495 Updated Dec 29, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,660 16,278 Updated Apr 30, 2026

Rust 程序设计语言(2024 edition 施工完毕)

Markdown 5,440 727 Updated Apr 23, 2026

Let's write an OS which can run on RISC-V in Rust from scratch!

Rust 2,025 549 Updated Mar 30, 2026

2023秋冬季开源操作系统训练营

1 Updated Apr 13, 2025

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 122,261 13,476 Updated Apr 30, 2026

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 9,127 2,321 Updated Mar 30, 2026

Material for gpu-mode lectures

Jupyter Notebook 6,031 607 Updated Apr 22, 2026

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10,827 1,093 Updated Apr 20, 2026

Several simple examples for popular neural network toolkits calling custom CUDA operators.

Python 1,533 204 Updated Apr 29, 2021

Machine Learning Engineering Open Book

Python 17,831 1,132 Updated Mar 16, 2026
Next