Dacheng Li DachengLi1

😄

PhD at UC Berkeley working on ML and distributed systems.

313 followers · 104 following

dacheng-li.info

Achievements

x2 x3 x2

Achievements

x2 x3 x2

Highlights

Lists (1)

Sort

🚀 My stack

1 repository

Stars

4 results for source starred repositories written in Cuda

Clear filter

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,694 972 Updated Nov 5, 2025

thu-ml / SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,622 258 Updated Nov 6, 2025

baidu-research / baidu-allreduce

Cuda 601 113 Updated Apr 6, 2018

LeiWang1999 / AutoGPTQ.tvm

GPTQ inference TVM kernel

Cuda 39 1 Updated Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dacheng Li DachengLi1

Sponsors

Achievements

Achievements

Highlights

Block or report DachengLi1

Lists (1)

🚀 My stack

Stars

deepseek-ai / DeepEP

thu-ml / SageAttention

baidu-research / baidu-allreduce

LeiWang1999 / AutoGPTQ.tvm