Skip to content
View ch1y0q's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai Jiao Tong University
  • Shanghai, China
  • 03:05 (UTC +08:00)

Highlights

  • Pro

Organizations

@seumsc @seulinux

Block or report ch1y0q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

夸克网盘文件管理 CLI 工具 - Quark Cloud Drive File Management CLI Tool

Go 32 6 Updated Feb 14, 2026

Review automated kernel generation in the era of LLMs

98 1 Updated Jan 23, 2026

eBPF for GPU UVM offloading and scheduling in Linux kernel

C 26 1 Updated Feb 17, 2026

NVIDIA Linux open GPU kernel module source

C 1 Updated Dec 6, 2025

 Three-finger trackpad gestures for middle-click and middle-drag on macOS

Swift 109 2 Updated Feb 17, 2026

Predict the performance of LLM inference services

Jupyter Notebook 21 1 Updated Sep 18, 2025
Go 40 5 Updated Jun 30, 2025

Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"

HTML 64 11 Updated Oct 15, 2025

Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes" https://people.inf.ethz.ch/omutlu/…

C++ 50 17 Updated Aug 21, 2018
Cuda 3 Updated Nov 4, 2024

A Primer on Memory Consistency and Cache Coherence (Second Edition) 翻译计划

335 48 Updated May 5, 2024

NVIDIA Linux open GPU with P2P support

C 1,323 133 Updated Jun 6, 2025

GEMM multi-GPU example program

Cuda 4 2 Updated Jun 17, 2021

Multi-GPU Computing Benchmark Suite (CUDA)

C++ 43 10 Updated Jun 12, 2017

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model

Python 64 2 Updated Oct 18, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,898 300 Updated Feb 17, 2026

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 867 147 Updated Sep 26, 2025

High-level tracing language for Linux

C++ 9,944 1,438 Updated Feb 17, 2026

Allow torch tensor memory to be released and resumed later

Python 217 39 Updated Feb 9, 2026

Development repository for the Triton language and compiler

MLIR 18,439 2,586 Updated Feb 17, 2026

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,694 602 Updated Nov 10, 2025

best way to save what you love

Svelte 38,684 3,197 Updated Jan 24, 2026

Set the color of files/folders for OSX Finder from the command line.

Python 35 7 Updated Feb 24, 2022

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 6,094 829 Updated Dec 22, 2025

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 902 244 Updated Feb 16, 2026

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,193 214 Updated Feb 9, 2026

a taichi implementation of fast and differentiable stroke renderer

Python 2 Updated Jun 4, 2025
Verilog 1,898 438 Updated Feb 17, 2026

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,919 312 Updated Jan 14, 2026
Next