Skip to content
View ch1y0q's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai Jiao Tong University
  • Shanghai, China
  • 21:20 (UTC +08:00)

Highlights

  • Pro

Organizations

@seumsc @seulinux

Block or report ch1y0q

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Examples of CUDA implementations by Cutlass CuTe

Makefile 278 34 Updated Jul 1, 2025

A video wallpaper engine for macOS Tahoe

Swift 761 21 Updated May 24, 2026

MCP server for Treehole/YKST gRPC-Web APIs

JavaScript 1 Updated May 26, 2026

[HPCA 2026] AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 365 130 Updated Apr 22, 2026

Get shell to almost any OpenClaw host machine.

Python 287 41 Updated Mar 9, 2026

Virtualize macOS 12 and later on Apple Silicon, VirtualBuddy is a virtual machine GUI for macOS M1, M2, M3, M4

Swift 8,223 236 Updated Jun 19, 2026

中国轨道交通数据库(非技术类) - 另一角度看地铁/ Data base of China Rail Transit (Non-tech) - Another view of Rail Transit

SCSS 260 25 Updated Nov 22, 2025

阿里云盘命令行客户端,支持JavaScript插件,支持同步备份功能。

Go 5,053 390 Updated Apr 20, 2026

夸克网盘文件管理 CLI 工具 - Quark Cloud Drive File Management CLI Tool

Go 174 17 Updated May 19, 2026

Review automated kernel generation in the era of LLMs

238 18 Updated May 26, 2026

eBPF for GPU UVM offloading and scheduling in Linux kernel

C 59 5 Updated Apr 15, 2026

NVIDIA Linux open GPU kernel module source

C 4 Updated Mar 6, 2026

 Three-finger trackpad gestures for middle-click and middle-drag on macOS

Swift 210 5 Updated Apr 22, 2026

Predict the performance of LLM inference services

Jupyter Notebook 23 1 Updated Sep 18, 2025
Go 42 6 Updated May 19, 2026

Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"

HTML 95 19 Updated Oct 15, 2025

Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support for Multiple Page Sizes" https://people.inf.ethz.ch/omutlu/…

C++ 50 17 Updated Aug 21, 2018
Cuda 3 Updated Nov 4, 2024

A Primer on Memory Consistency and Cache Coherence (Second Edition) 翻译计划

356 56 Updated May 5, 2024

NVIDIA Linux open GPU with P2P support

C 1,381 142 Updated Jun 6, 2025

GEMM multi-GPU example program

Cuda 4 2 Updated Jun 17, 2021

Multi-GPU Computing Benchmark Suite (CUDA)

C++ 43 10 Updated Jun 12, 2017

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model

Python 118 6 Updated Apr 28, 2026

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 4,131 334 Updated Jun 20, 2026

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 897 152 Updated Sep 26, 2025

High-level tracing language for Linux

C++ 10,179 1,465 Updated Jun 19, 2026

Allow torch tensor memory to be released and resumed later

Python 251 60 Updated May 16, 2026

Development repository for the Triton language and compiler

MLIR 19,485 2,949 Updated Jun 20, 2026

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

Python 6,950 607 Updated Mar 8, 2026
Next