hpc
Repository for building the NCCL OFI plugin from AWS and NVIDIA
A tool for bandwidth measurements on NVIDIA GPUs.
This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.
SC24 Deep Learning at Scale Tutorial Material
The reference implementation of the Linux FUSE (Filesystem in Userspace) interface
MLCommons Science benchmarking working group
Containers for running ML applications on TACC GPU systems
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
collection of benchmarks to measure basic GPU capabilities
A half-day lesson on tuning usage of LAMMPS for large-scale HPC systems
OVIS/LDMS High Performance Computing monitoring, analysis, and visualization project.
A powerful Python framework for writing and running portable regression tests and benchmarks for HPC systems.
This repository contains the results and code for the MLPerf™ HPC Training v3.0 benchmark.