Skip to content
View simveit's full-sized avatar

Block or report simveit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. effective_transpose effective_transpose Public

    Effective transpose on Hopper GPU

    Cuda 25 3

  2. persistent_dense_gemm persistent_dense_gemm Public

    Persistent dense gemm for Hopper in `CuTeDSL`

    Python 15

  3. load_and_store load_and_store Public

    Learn about PTX instructions ldmatrix and stmatrix

    Cuda 10

  4. cute_persistent_kernels cute_persistent_kernels Public

    Python 9

  5. effective_reduction effective_reduction Public

    Improve reduction kernel step by step

    Cuda 6 1

  6. effective_scan effective_scan Public

    Improve scan step by step

    Cuda 6