Highlights
- Pro
Starred repositories
2
stars
written in Cuda
Clear filter
An extension library of WMMA API (Tensor Core API)
Boosting 4-bit inference kernels with 2:4 Sparsity