Making models go 🚀 ⚡
Full stack AI engineer specializing in deploying computer vision models on edge devices for real-time inference.
- Kuala Lumpur, Malaysia
-
06:37
(UTC +08:00) - dicksonneoh.com
- @dicksonneoh7
- in/dickson-neoh
Lists (1)
Sort Name ascending (A-Z)
Stars
4
stars
written in Cuda
Clear filter
Instant neural graphics primitives: lightning fast NeRF and more
FlashInfer: Kernel Library for LLM Serving
Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This repository contains the code for the experiments in the paper.