Skip to content
View sg0's full-sized avatar

Block or report sg0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

HPC Challenge Benchmark

C 66 16 Updated Sep 28, 2025

gups mirror

C 11 11 Updated Oct 25, 2015

Welcome to OptML! This repository is designed for those new to MLIR and machine learning-based optimizations. As a compiler enthusiast, I wanted to create a platform for hobbyists like myself to ex…

MLIR 20 4 Updated Sep 16, 2024
C 1 1 Updated Apr 2, 2025

Suitor Matching on GPUs

Cuda 2 Updated Jan 29, 2025
Python 1 1 Updated Sep 24, 2024

LLM training in simple, raw C/CUDA

Cuda 28,460 3,337 Updated Jun 26, 2025

GTgraph: A suite of synthetic random graph generators

C 9 1 Updated Dec 14, 2020

LLM inference in C/C++

C++ 91,989 14,244 Updated Dec 25, 2025

SST Macro Element Library

C++ 2 Updated Apr 12, 2024

Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)

C++ 113 94 Updated May 18, 2023

Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour under certain circumstances: task tiedness, throttle and cut-…

C 46 17 Updated Sep 20, 2019

Distributed View Extension for Kokkos

C++ 48 20 Updated Dec 2, 2024

Clang with JIT extensions

C++ 237 25 Updated Dec 11, 2022

C++ demo of deep neural networks (MLP, CNN)

C++ 31 11 Updated Dec 28, 2023

Proxy App of AAE for molecular dynamics

Python 2 Updated May 18, 2023

Library implementation of MPI-4 Partitioned Communication

C 8 Updated Dec 8, 2025

Proxy application for analyzing dynamical systems.

MLIR 1 2 Updated Sep 23, 2023

SpDNN Implementations using CuPY, cuSparse, and OpenMP

HTML 1 1 Updated Jun 30, 2023

CUDA Library Samples

C++ 2,255 432 Updated Dec 22, 2025

SST Macro Element Library

C++ 1 Updated Jun 11, 2020

Run TensorFlow models in C++ without installation and without Bazel

C++ 809 181 Updated Aug 16, 2024

A repository containing C++11/14/17 concepts and code snippets

C++ 96 26 Updated Jan 31, 2018

SparseP is the first open-source Sparse Matrix Vector Multiplication (SpMV) software package for real-world Processing-In-Memory (PIM) architectures. SparseP is developed to evaluate and characteri…

C 77 13 Updated Jun 29, 2022

Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators

C++ 98 29 Updated Jun 30, 2025

Compact subset of C++ MPL messaging library.

C++ 7 1 Updated Dec 4, 2023

A library of GPU kernels for sparse matrix operations.

C++ 280 53 Updated Nov 24, 2020

cuGraph - RAPIDS Graph Analytics Library

Cuda 2,095 341 Updated Dec 23, 2025

Header-only C++20 wrapper for MPI 4.0.

C++ 47 Updated Oct 21, 2023

Tickets for the MPI Forum

67 9 Updated Dec 10, 2021
Next