- Greater Bay Area
- https://shesung.github.io
Stars
3
stars
written in Cuda
Clear filter
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Fully Convolutional Instance-aware Semantic Segmentation