#
🎯
Focusing
🔥 AI Infra Researcher
-
Alibaba Group
- Hangzhou ⇌ Hong Kong
-
07:02
(UTC +08:00) - https://www.lingyunyang.com/
- https://orcid.org/0000-0002-3186-3189
- @stephenyang1999
- in/stephenyang1999
Highlights
- Pro
Starred repositories
4
stars
written in Cuda
Clear filter
DeepEP: an efficient expert-parallel communication library
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"
FlashInfer: Kernel Library for LLM Serving