Skip to content
View ax3l's full-sized avatar

Block or report ax3l

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
10 stars written in Cuda
Clear filter

[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl

Cuda 1,825 464 Updated Oct 9, 2023

NVIDIA-accelerated zero latency video compression library for interactive remoting applications

Cuda 394 93 Updated Jun 3, 2020

Parrot is a C++ library for fused array operations using CUDA/Thrust. It provides efficient GPU-accelerated operations with lazy evaluation semantics, allowing for chaining of operations without un…

Cuda 259 16 Updated Mar 11, 2026

A fast and highly scalable GPU dynamic memory allocator

Cuda 112 9 Updated Mar 11, 2015

MPI accelerator-integrated communication extensions

Cuda 40 6 Updated Apr 4, 2023

CUDA Dynamic Memory Allocator for SOA Data Layout

Cuda 39 5 Updated Dec 29, 2021

TLB Benchmarks

Cuda 35 10 Updated Sep 11, 2017

CUDA Finite Difference Library

Cuda 16 5 Updated Aug 21, 2020
Cuda 10 4 Updated Apr 11, 2019

A much faster and OpenSource implementation of the NVIDIA Performance Primitives library.

Cuda 1 Updated Dec 28, 2025