Stars
8
stars
written in C++
Clear filter
Productive, portable, and performant GPU programming in Python.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Source codes for book <<<BeginningAlgorithmContests>> Second edition
General Resources for Competitive Programming
A transaction processor for a hypothetical, general-purpose, central bank digital currency
Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.
A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines