Skip to content
View K-Wu's full-sized avatar

Organizations

@NVIDIA @eesast @llvm @illinois-impact

Block or report K-Wu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
31 stars written in C
Clear filter

Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer

C 3,290 284 Updated Aug 28, 2024

The official SuiteSparse library: a suite of sparse matrix algorithms authored or co-authored by Tim Davis, Texas A&M University.

C 1,475 307 Updated Mar 17, 2026

DFA regular expression library & friends

C 980 56 Updated Feb 10, 2026

An MLIR-based toolchain for AMD AI Engine-enabled devices.

C 614 176 Updated Apr 6, 2026

STREAM benchmark

C 485 173 Updated Feb 17, 2025

IOR and mdtest

C 471 194 Updated Apr 3, 2026

Ext2/3/4 file system utilities

C 453 266 Updated Apr 3, 2026

Build userspace NVMe drivers and storage applications with CUDA support

C 423 55 Updated Dec 18, 2023

SuiteSparse:GraphBLAS: graph algorithms in the language of linear algebra. For production: (default) STABLE branch. Code development: ask me for the right branch before submitting a PR. video intro…

C 414 74 Updated Apr 2, 2026

A framework to understand RDMA

C 409 113 Updated Oct 12, 2023

Simple program to read & write to a pci device from userspace

C 334 117 Updated Apr 7, 2019

Programming language benchmarks

C 274 55 Updated Sep 9, 2020

Rodinia benchmark

C 200 111 Updated Apr 14, 2023

Automatically Tuned Linear Algebra Software (ATLAS)

C 190 38 Updated Dec 16, 2019

Hooked CUDA-related dynamic libraries by using automated code generation tools.

C 171 47 Updated Dec 12, 2023

Penglai Enclave is an open-sourced, secure and scalable TEE system for RISC-V.

C 148 33 Updated Mar 5, 2025

Simple machine mode program to probe RISC-V control and status registers

C 127 28 Updated Apr 28, 2023

A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data pointers

C 63 24 Updated Sep 11, 2020

A real-time memory trace visualizer using valgrind

C 55 15 Updated Nov 6, 2021

MPI benchmark to test and measure collective performance

C 53 19 Updated Jun 29, 2021
C 45 13 Updated Sep 18, 2020

Scrooge is a high-performance pairwise sequence aligner based on the GenASM algorithm. Scrooge includes three novel algorithmic improvements on top of GenASM, and high-performance CPU and GPU imple…

C 38 4 Updated Jun 23, 2023
C 34 29 Updated Nov 16, 2022

RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers

C 24 14 Updated Oct 5, 2020

The system call intercepting library

C 23 3 Updated Sep 18, 2022

Simulate system software behavior on machines with terabytes of main memory from your desktop.

C 7 Updated Sep 6, 2022
C 2 Updated Jul 30, 2020
Next