Yuqian Hong lavinal712

🎾

Master degree candidate of USTC

43 followers · 201 following

Achievements

Starred repositories

6 stars written in Cuda

Clear filter

NVlabs / instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 17,148 2,038 Updated Dec 14, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,814 1,033 Updated Dec 5, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,977 778 Updated Dec 8, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,869 289 Updated Dec 11, 2025

BBuf / how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Cuda 2,696 244 Updated Dec 6, 2025

graphdeco-inria / diff-gaussian-rasterization

Cuda 1,357 417 Updated Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yuqian Hong lavinal712

Achievements

Achievements

Block or report lavinal712

Starred repositories

NVlabs / instant-ngp

deepseek-ai / DeepEP

deepseek-ai / DeepGEMM

thu-ml / SageAttention

BBuf / how-to-optim-algorithm-in-cuda

graphdeco-inria / diff-gaussian-rasterization

Starred topics

reinforcement-learning

alphazero