Highlights
- Pro
Stars
4
stars
written in Cuda
Clear filter
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11