Skip to content
View aMarry's full-sized avatar

Organizations

@hanwuji-2017-13

Block or report aMarry

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 8 Updated Aug 22, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,303 1,163 Updated Jun 23, 2026

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,586 99 Updated Jan 28, 2026

Master Modern C++(11/14/17/20) Templates: TMP, SFINAE, Concepts, CRTP, Variadic Magic, and Compile-Time Sorcery

C++ 1,642 282 Updated Jan 24, 2025
C++ 183 45 Updated May 11, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,400 754 Updated Jun 23, 2026

Fast and memory-efficient exact attention

Python 24,218 2,851 Updated Jun 22, 2026

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,943 1,851 Updated Apr 19, 2026

compiler learning resources collect.

Python 2,749 370 Updated May 20, 2026

《Learn LLVM 12》的非专业个人翻译

TeX 1 Updated Dec 29, 2021

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 12,228 2,324 Updated Oct 30, 2023

Deep Learning Book Chinese Translation

TeX 37,286 9,131 Updated Dec 3, 2019

this records what I have read and learned from papers or book about machine learning

2 Updated Nov 12, 2017