nccl
Here are 45 public repositories matching this topic...
Experimental Explicit Communications API for Kokkos
-
Updated
Dec 18, 2025 - C++
EUMaster4HPC student challenge group 7 - EuroHPC Summit 2024 Antwerp
-
Updated
Apr 14, 2024 - Cuda
Blood Cell Simulation server
-
Updated
Jan 29, 2024 - C++
This is a tutorial for installing CUDA (v11.8) and cuDNN (8.6.9) to enable programming torch with GPU. It also mentions about implementation of NCCL for distributed GPU DNN model training.
-
Updated
Apr 14, 2025 - Jupyter Notebook
A practical model (with math + Python) to tell if you’re compute-, memory-, or network-bound—and what to buy next
-
Updated
Sep 4, 2025 - Jupyter Notebook
Default Docker image used to run experiments on csquare.run.
-
Updated
Mar 6, 2023 - Dockerfile
Advanced High Performance Computing in C with OpenMP, CUDA, MPI and NCCL. The folder project includes my final project for the special course. I implemented a Jacobi-solver for the Poisson partial differential problem both using OpenMP in the CPU, using CUDA on the GPU and using CUDA, MPI and NCCL on multiple GPUs.
-
Updated
Jun 20, 2024 - C++
KAI Data Center Builder
-
Updated
Aug 28, 2025 - Makefile
jupyter/scipy-notebook with CUDA Toolkit, cuDNN, NCCL, and TensorRT
-
Updated
Jul 15, 2019 - Dockerfile
Distributed deep learning framework based on pytorch/numba/nccl and zeromq.
-
Updated
Aug 10, 2023 - Python
Single-node data parallelism in Julia with CUDA
-
Updated
Nov 18, 2024 - Julia
NCCL pairwise communication benchmarking and topology visualization on multi‑node GPU clusters.
-
Updated
Nov 16, 2025 - Python
Librería de operaciones matemáticas con matrices multi-gpu utilizando Nvidia NCCL.
-
Updated
Sep 9, 2020 - Cuda
Improve this page
Add a description, image, and links to the nccl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nccl topic, visit your repo's landing page and select "manage topics."