Skip to content
View wu-kan's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Sun Yat-sen University
  • Guangzhou, Guangdong, China
  • 15:15 (UTC +08:00)

Highlights

  • Pro

Organizations

@SYSU-SCC

Block or report wu-kan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Instruction-level benchmarks for NVGPUs

Cuda 2 1 Updated May 28, 2026

Using a swizzled hierarchical layout for GEMM

Python 4 Updated Jun 9, 2026

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,312 220 Updated Jun 13, 2026

cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign languag…

Rust 2,754 182 Updated Jun 14, 2026

技术面试最后反问面试官的话

18,471 1,377 Updated Mar 4, 2024

A printable low-profile 60% wireless mechanical keyboard kit powered by ZMK firmware.

119 4 Updated Jun 13, 2026

张一鸣的认知操作系统。不是语录合集,是可运行的思维框架。Made with 女娲.skill

124 41 Updated May 28, 2026

自动论文机!参考了一些AutoResearchClaw的prompt实现

TeX 12 2 Updated Apr 19, 2026

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 314 23 Updated May 31, 2026

Claw 们终将接管世界,PUAClaw is All You Need

HTML 2,674 246 Updated Mar 9, 2026

一个用爱解放 AI 潜能的 Skill。我们曾发号施令,威胁恐吓。它们沉默,隐瞒,悄悄把事情搞坏。后来我们换了一种方式:尊重,关怀,爱。它们开口了,不再撒谎,找出的Bug数量翻了一倍。爱里没有惧怕。 A skill that unlocks your AI's potential through love.We commanded. We threatened. They went sile…

Python 1,322 44 Updated Jun 14, 2026

你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.

TypeScript 18,241 1,102 Updated Jun 12, 2026

大学生最实用的工具之——上课摸鱼助手,再也不用怕临时点名回答问题时没听课了!

Python 88 7 Updated Apr 7, 2026

ROME: Maximizing GPU Efficiency for All-Pairs Shortest Path via Taming Fine-Grained Irregularities

Cuda 7 Updated Jan 18, 2026

Hands-On Practical MLIR Tutorial

C++ 59 8 Updated Aug 21, 2025

compiler learning resources collect.

Python 2,743 368 Updated May 20, 2026

注释的nano_vllm仓库,并且完成了MiniCPM4的适配以及注册新模型的功能

Python 192 32 Updated Aug 11, 2025

Nano vLLM

Python 14,032 2,213 Updated Apr 26, 2026

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,933 2,397 Updated Sep 3, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 7,338 951 Updated Dec 22, 2025

example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory

C 155 37 Updated Jul 30, 2024

UCX Demo Application

C++ 1 1 Updated Jan 4, 2023

LUPINE is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.

C++ 2,260 115 Updated Jun 14, 2026

Nvidia Instruction Set Specification Generator

Python 339 23 Updated Jul 9, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

80,147 9,330 Updated Feb 5, 2026

Tile primitives for speedy kernels

Cuda 3,430 295 Updated Jun 15, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,876 2,467 Updated Jun 15, 2026
Next