mdy666

Follow

mdy666

Follow

36 followers · 0 following

Achievements

Achievements

Stars

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,813 12,083 Updated Dec 20, 2025

Espere-1119-Song / VideoNSA

VideoNSA: Native Sparse Attention Scales Video Understanding

Python 73 1 Updated Nov 16, 2025

mdy666 / Scalable-Flash-Native-Sparse-Attention

Jupyter Notebook 47 6 Updated Dec 13, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 1,376 111 Updated Nov 18, 2025

meituan-longcat / LongCat-Flash-Chat

1,245 61 Updated Dec 15, 2025

gaogaotiantian / dowhen

An intuitive and low-overhead instrumentation tool for Python

Python 1,184 39 Updated Jul 8, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 702 64 Updated Dec 16, 2025

OpenBMB / infllmv2_cuda_impl

Python 78 6 Updated Dec 2, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,261 350 Updated Dec 19, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,962 451 Updated Dec 19, 2025

toothacher17 / Megatron-LM

Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python 3 2 Updated Feb 24, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,087 333 Updated Dec 20, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,887 2,461 Updated Dec 20, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,984 778 Updated Dec 8, 2025

mdy666 / Qwen-Native-Sparse-Attention

qwen-nsa

Jupyter Notebook 85 7 Updated Oct 14, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,926 918 Updated Dec 15, 2025

FlagAI-Open / OpenSeek

OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models.

Python 241 39 Updated Dec 15, 2025

mdy666 / mdy_triton

Jupyter Notebook 148 13 Updated Jul 4, 2025