-
Luthier Public
Forked from matinraayai/LuthierLuthier, a GPU binary instrumentation tool for AMD GPUs
C++ Other UpdatedDec 17, 2025 -
aiter Public
Forked from ROCm/aiterAI Tensor Engine for ROCm
Python MIT License UpdatedDec 10, 2025 -
gcnasm Public
Forked from carlushuang/gcnasmamdgpu example code in hip/asm
C++ UpdatedNov 17, 2025 -
rocm-libraries Public
Forked from ROCm/rocm-librariesmonorepo for rocm libraries
Assembly UpdatedSep 10, 2025 -
-
composable_kernel Public
Forked from ROCm/composable_kernelComposable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
C++ Other UpdatedSep 9, 2025 -
-
instrument-amdgpu-kernels Public
Forked from CRobeck/instrument-amdgpu-kernelsLLVM/MLIR based compiler instrumentation of AMD GPU kernels
C++ MIT License UpdatedJul 13, 2025 -
-
Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)
Python MIT License UpdatedJun 24, 2025 -
-
hip4jupyter Public
Forked from andreinechaev/nvcc4jupyterA plugin for Jupyter Notebook to run HIP C/C++ code
Jupyter Notebook MIT License UpdatedJun 4, 2025 -
luthier-ispass2025 Public
Forked from NUCAR-DEV/luthier-ispass2025Artifact for ISPASS 2025 Paper "Luthier: A Dynamic Binary Instrumentation Framework Targeting AMD GPUs"
C++ Apache License 2.0 UpdatedApr 7, 2025 -
-
fp32_sgemm_amd Public
Forked from seb-v/fp32_sgemm_amdSuper fast FP32 matrix multiplication on RDNA3
Assembly MIT License UpdatedMar 30, 2025 -
HIP_Course Public
Forked from pelagos-consulting/HIP_CourseAccelerated computing with HIP
HTML Other UpdatedMar 14, 2025 -
ROCm-ComputeABI-Doc Public
Forked from ROCm/ROCm-ComputeABI-DocROCm - AMDGPU Compute Application Binary Interface
UpdatedMar 19, 2022 -
HIP-Performance-Optmization-on-VEGA64 Public
Forked from fsword73/HIP-Performance-Optmization-on-VEGA6414 basic topics for VEGA64 performance optmization
C++ UpdatedMar 18, 2021 -
rocm_start_sample Public
Forked from feifei14119/rocm_start_samplehip rocm start sample for amd gpu
C++ UpdatedFeb 20, 2021 -
SGEMM_on_VEGA Public
Forked from fsword73/SGEMM_on_VEGAAn alternative SGEMM implementation on AMD Vega Series
Assembly UpdatedOct 16, 2019 -
LLVM-AMDGPU-Assembler-Extra Public
Forked from ROCm/LLVM-AMDGPU-Assembler-ExtraLLVM AMDGPU Assembler Helper Tools
CMake Other UpdatedJun 15, 2017