Python CUDA tech lead @NVIDIA. Open source contributor on my spare time.
- Greater NYC area
- https://leofang.github.io/about
Highlights
- Pro
Stars
6
results
for source starred repositories
written in Cuda
Clear filter
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
Collection of CUDA benchmarks, with a focus on unified vs. explicit memory management.
WIP Benchmarking code for Thrust and CUB
cuda stream benchmark: based on work by Massimiliano Fatica@NVIDIA