Skip to content

Margen67/SageAttention

 
 

Repository files navigation

SageAttention fork for build system integration

This repo makes it easy to build SageAttention for multiple Python, PyTorch, and CUDA versions, then distribute the wheels to other people. See releases for the wheels, and the workflow to build them on Windows.

If you only need to build and run on your own machine, you can clone this repo, install the dependencies in pyproject.toml (include the correct torch version such as torch 2.7.1+cu128), then run python setup.py install (this avoids the environment checks of pip).

About

Fork of SageAttention for Windows wheels and easy installation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Cuda 62.3%
  • Python 32.8%
  • C++ 2.4%
  • C 2.2%
  • Shell 0.3%