PanZaifeng

Zaifeng Pan PanZaifeng

机械工程学士

37 followers · 53 following

https://panzaifeng.github.io

Achievements

Highlights

Lists (1)

Sort

🌟 Useful Tools

6 repositories

Stars

byungsoo-oh / ml-systems-papers

Curated collection of papers in machine learning systems

542 36 Updated Feb 7, 2026

ultraworkers / claw-code

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 185,699 108,605 Updated Apr 17, 2026

Guan-JW / awesome-db-for-ai-systems

A curated survey of database systems, design patterns, and architectural practices in modern AI systems including multi-agent frameworks, RAG pipelines, and LLM applications.

2 Updated Mar 9, 2026

SJTU-Liquid / deterministic-FA3

This repository contains the code for the ICLR 2026 paper “DASH: Deterministic Attention Scheduling for High-Throughput Reproducible LLM Training”, developed on top of the FlashAttention codebase.

Python 8 Updated Jan 31, 2026

MiniMax-AI / Mini-Agent

A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.

Python 2,439 354 Updated Feb 14, 2026

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 635 73 Updated Apr 17, 2026

HKUDS / Paper2Slides

"Paper2Slides: From Paper to Presentation in One Click"

Python 3,311 434 Updated Mar 15, 2026

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,093 157 Updated Apr 17, 2026

ChandlerGuan / mercury_artifact

Python 24 6 Updated Oct 1, 2025

uccl-project / uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,306 137 Updated Apr 17, 2026

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,200 195 Updated Apr 17, 2026

open-neutrino / neutrino

C 249 26 Updated Dec 25, 2025

HazyResearch / Megakernels

Kernels, of the mega variety :)

Python 707 55 Updated Apr 17, 2026

donglinz / nvtx-cuda-graph

CUDA Graph aware nvtx

C++ 2 Updated Jun 6, 2025

yifuwang / symm-mem-recipes

Python 166 18 Updated Dec 27, 2024

PanZaifeng / FastTree-Artifact

Python 30 3 Updated Mar 24, 2025

Ash-Zheng / WLB-LLM-CP

Python 10 3 Updated May 11, 2025

Zefan-Cai / Awesome-LLM-KV-Cache

Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.

428 27 Updated Mar 3, 2025

Guan-JW / Metric_Study

Python 4 Updated Sep 30, 2024

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,408 138 Updated Apr 17, 2026

LINs-lab / DeFT

[ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference

Jupyter Notebook 51 2 Updated Jun 17, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 4,016 320 Updated Apr 17, 2026