- Lugano, Switzerland
Lists (1)
Sort Name ascending (A-Z)
Stars
NUMA-aware multi-CPU multi-GPU data transfer benchmarks
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
A Python framework for accelerated simulation, data generation and spatial computing.
A fast vectorized implementation of the XIELU activation function
Machine Learning Engineering Open Book
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Optimized primitives for collective multi-GPU communication
Convert PyPI entries to Spack package.py
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images
A Slurm cluster using docker-compose
Example codes from the book Parallel Programming With OpenACC
A powerful Python framework for writing and running portable regression tests and benchmarks for HPC systems.
A sequence of Jupyter notebooks featuring the "12 Steps to Navier-Stokes" http://lorenabarba.com/
MPI Cluster Automation Solution using Docker, based on Alpine Linux with MPICH (see IEEE paper)
A book-in-progress about the Linux kernel and its insides.