🐢
working
MLSys Engineer @ Nvidia | FlashInfer and Machine Learning Compiler LLM co-design
- Redmond, WA
-
18:33
(UTC -07:00)
Highlights
- Pro
Stars
2
stars
written in Cuda
Clear filter
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
ROCm / flashinfer
Forked from flashinfer-ai/flashinferFlashInfer+ROCm: ROCm port of FlashInfer