Stars
hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
Machine Learning Engineering Open Book
ONCache: A Cache-Based Low-Overhead Container Overlay Network
cluster data collected from production clusters in Alibaba for cluster management research
A fast and user-transparent parallel simulator implementation for ns-3
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Can large language models provide useful feedback on research papers? A large-scale empirical analysis.
P4 source code for ConWeave load balancing
NS3 simulator for RDMA load balancing
Stable Diffusion web UI
A NS-3 implementation of Poseidon congestion control algorithm (NSDI 2023).
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
DeSiNe is a modular flow-level network simulator aimed at performance analysis and benchmarking of Quality of Service routing algorithms and traffic engineering extensions.
An online request replication and TCP stream replay tool, ideal for real testing, performance testing, stability testing, stress testing, load testing, smoke testing, and more.
High-performance In-browser LLM Inference Engine
Compilation of P4 exercises, examples, documentation, slides for learning or teaching
Multi-user h5 version, 3rd party ChatGPT web page. Uses OpenAPI official web API.
yinwaii / nccl
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
ns.py: a Pythonic Discrete-Event Network Simulator