Installation

“What I cannot create I do not understand” - This is why I started Penny, my own version of NCCL.

If you want to read about it, there is a worklog on my blogpost where I describe a step by step process of creating it:

Installation

To install Penny you need to export NVSHMEM_LIB and NVSHMEM_INC environment variables that point to the /lib and /include directories of your NVSHMEM installation

Afterwards just

git clone https://github.com/SzymonOzog/Penny.git
cd Penny
pip install -e . --no-build-isolation

Using Low Latency Intranode Allreduce

Penny provides a drop in replacement for the vLLM/SGLang custom all reduce class that allows it to run multinode. For SGLang there is a patch that you can apply to get it running:

cd YOUR_SGLANG_DIR
git apply YOUR_PENNY_DIR/extra/sglang.patch

You also need to export the number of nodes that you're running(Currently up to 4 nodes are templated and tested, for more edit extra/custom_all_reduce.cuh at your own risk)

export NNODES=2

Afterwards you can serve your favourite model with Low Latency allreduce

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
csrc		csrc
extra		extra
penny		penny
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Installation

Using Low Latency Intranode Allreduce

About

Uh oh!

Releases

Packages

Languages

License

SzymonOzog/Penny

Folders and files

Latest commit

History

Repository files navigation

Installation

Using Low Latency Intranode Allreduce

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages