Skip to content
View Peter9606's full-sized avatar
  • Iluvatar CoreX, Inc
  • Shanghai, China
  • 05:59 (UTC +08:00)

Organizations

@cpprefjp

Block or report Peter9606

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

MLIR 192 15 Updated Dec 20, 2025

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,634 83 Updated Dec 20, 2025
Python 39 3 Updated Dec 14, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,985 777 Updated Dec 8, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,818 1,033 Updated Dec 5, 2025

LLVM/MLIR based compiler instrumentation of AMD GPU kernels

C++ 21 7 Updated Jul 13, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,520 810 Updated Dec 20, 2025

MLIR For Beginners tutorial

C++ 1,168 111 Updated Jul 18, 2025

shorten that long URL into a tiny URL

JavaScript 18 16 Updated Mar 23, 2017

Play with MLIR right in your browser

TypeScript 139 8 Updated May 25, 2023

A collection of out-of-tree LLVM passes for teaching and learning

C++ 3,325 432 Updated Dec 16, 2025

手を動かせばできるLLVMバックエンド チュートリアル(WIP)

Ruby 43 5 Updated May 12, 2022

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 183,441 26,303 Updated Dec 19, 2025

an architecture-independent decompiler to LLVM IR

C++ 395 47 Updated Aug 5, 2015

A modular configuration of Vim and Neovim

Vim Script 20,329 1,428 Updated Feb 17, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,995 1,588 Updated Dec 19, 2025

LLVM Tutorialを勉強するリポジトリ

C++ 45 3 Updated Nov 7, 2021

A interpreter of PL/0'

C++ 4 1 Updated Sep 5, 2018

Getting Started with LLVM Core Libraries (中文版),翻译:潘立丰

138 42 Updated Oct 6, 2023

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 36,016 15,546 Updated Dec 20, 2025

Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library

C++ 1,709 120 Updated Apr 16, 2025
C++ 398 76 Updated Dec 6, 2025

Efficient GPU kernels for block-sparse matrix multiplication and convolution

Cuda 1,063 198 Updated Jun 8, 2023

cuASR: CUDA Algebra for Semirings

Cuda 42 3 Updated Aug 22, 2022

DO NOT USE : Deprecated : Mirror of AMD llvm-project : The source repo is https://github.com/RadeonOpenCompute/llvm-project. Several times a day the default branch "amd-stg-open" is updated from th…

14 6 Updated Jul 12, 2023

modified cutlass

C++ 15 3 Updated Oct 26, 2020

Full-speed Array of Structures access

C++ 176 28 Updated Apr 25, 2023

The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++

CSS 44,616 5,545 Updated Dec 3, 2025

📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/

C++ 25,247 3,085 Updated Aug 17, 2024

LAPACK development repository

Fortran 1,765 483 Updated Dec 19, 2025
Next