🚗
Auto-driving
[ OK ] Done building Yuting Xie. Enjoy!
-
Huawei Canada
- Markham, Canada
- @lovelydett
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
10
stars
written in Cuda
Clear filter
DeepEP: an efficient expert-parallel communication library
cuVS - a library for vector search and clustering on the GPU
A simple high performance CUDA GEMM implementation.
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU scheduling.
A tool for examining GPU scheduling behavior.
Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]