Skip to content
View mabdullahsoyturk's full-sized avatar

Organizations

@ParCoreLab @agucomputersociety

Block or report mabdullahsoyturk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 stars written in Cuda
Clear filter

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,810 464 Updated Oct 9, 2023

NCCL Tests

Cuda 1,382 337 Updated Nov 21, 2025

Introduction to Parallel Programming class code

Cuda 1,340 1,142 Updated Jun 27, 2022

Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)

Cuda 925 339 Updated Aug 19, 2024

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 846 145 Updated Sep 26, 2025

Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial

Cuda 341 68 Updated Dec 3, 2025

A CUDNN minimal deep learning training code sample using LeNet.

Cuda 268 93 Updated Jul 30, 2023

Efficient Top-K implementation on the GPU

Cuda 192 24 Updated Apr 9, 2019

Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"

Cuda 94 32 Updated Aug 14, 2023

Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite

Cuda 68 15 Updated Sep 12, 2018

Implementation and analysis of five different GPU based SPMV algorithms in CUDA

Cuda 40 12 Updated Feb 5, 2019

CUDA Dynamic Memory Allocator for SOA Data Layout

Cuda 38 5 Updated Dec 29, 2021
Cuda 32 12 Updated Aug 24, 2022

A Winograd Minimal Filter Implementation in CUDA

Cuda 28 2 Updated Aug 25, 2021

graph challenge 2021

Cuda 27 3 Updated Jul 9, 2021

Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involvement of the CPU beyond the initial kernel launch.

Cuda 22 3 Updated Apr 25, 2024

This repository contains the source code for our ACM SIGMOD '22 paper (Evaluating Multi-GPU Sorting with Modern Interconnects)

Cuda 5 1 Updated Apr 26, 2022

try newly released `cudaLaunchCooperativeKernelMultiDevice()` in CUDA C++

Cuda 2 1 Updated May 18, 2019