FastTree

This repository provides the artifact for [MLSys'25] FastTree: Optimizing Attention Kernel and Runtime for Tree-Structured LLM Inference.

FastTree is implemented as an SGLang plugin to accelerate attention computation with the given radix tree for KV cache. This repository includes the kernel benchmark code from the paper, as well as end-to-end benchmark scripts.

System overview of FastTree.

Get Started

To reproduce the key kernel performance results from the paper, simply run the provided script:

./kernel_bench/run.sh

This script:

Builds a Docker image with all required dependencies.
Benchmarks FastTree and all baselines across different configurations.
Generates the performance figure at kernel_bench/norm_perf.pdf.

The expected output resembles:

Normalized performance of FastTree and baselines across various tree configurations. N: node number of each level. C: per-node context length of each level.

Our evaluations in the paper use an NVIDIA H100 GPU. Since FastTree is implemented with Triton and does not leverage Hopper-specific features, it should work on other GPUs. However, the provided hyperparameters are only tuned for H100.

Citation

If you find this work useful, please cite:

@inproceedings{pan2025fasttree,
  title = {FastTree: Optimizing Attention Kernel and Runtime for Tree-Structured LLM Inference},
  author = {Pan, Zaifeng and Ding, Yitong and Guan, Yue and Wang, Zheng and Yu, Zhongkai and Tang, Xulong and Wang, Yida and Ding, Yufei},
  booktitle = {Proceedings of Machine Learning and Systems},
  year = {2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
kernel_bench		kernel_bench
sglang_v0.2.13_bench		sglang_v0.2.13_bench
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FastTree

Get Started

Citation

About

Uh oh!

Releases

Packages

Languages

License

purdue-aalp/FastTree-Artifact

Folders and files

Latest commit

History

Repository files navigation

FastTree

Get Started

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages