Skip to content
View caopeirui's full-sized avatar

Block or report caopeirui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 555 58 Updated Apr 27, 2026

Machine Learning Engineering Open Book

Python 18,105 1,149 Updated May 18, 2026

ONCache: A Cache-Based Low-Overhead Container Overlay Network

C 21 7 Updated Jun 7, 2025
C++ 1 Updated Apr 8, 2024

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 2,087 467 Updated Jun 3, 2026

A fast and user-transparent parallel simulator implementation for ns-3

C++ 108 21 Updated Nov 4, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 16,925 2,396 Updated Sep 3, 2025

Can large language models provide useful feedback on research papers? A large-scale empirical analysis.

Python 533 53 Updated Jan 11, 2024

P4 source code for ConWeave load balancing

P4 28 14 Updated Oct 27, 2023

NS3 simulator for RDMA load balancing

Python 101 40 Updated Oct 20, 2024

Maximum MultiPath TCP (MMPTCP)

C++ 6 8 Updated Jul 22, 2019

LLM inference in C/C++

C++ 116,342 19,531 Updated Jun 13, 2026

Stable Diffusion web UI

Python 163,668 30,368 Updated Mar 2, 2026

A NS-3 implementation of Poseidon congestion control algorithm (NSDI 2023).

Python 34 6 Updated Jan 28, 2024

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future

285 20 Updated Aug 13, 2023

DeSiNe is a modular flow-level network simulator aimed at performance analysis and benchmarking of Quality of Service routing algorithms and traffic engineering extensions.

C++ 3 4 Updated Apr 18, 2016

An online request replication and TCP stream replay tool, ideal for real testing, performance testing, stability testing, stress testing, load testing, smoke testing, and more.

C 4,676 1,025 Updated Jun 18, 2025

High-performance In-browser LLM Inference Engine

TypeScript 18,180 1,309 Updated Jun 9, 2026

Compilation of P4 exercises, examples, documentation, slides for learning or teaching

Python 598 197 Updated Oct 9, 2023

《操作系统真象还原》源码及学习笔记(os-elephant)还原真相

C 476 143 Updated Jul 6, 2018

Linux kernel source tree

C 236,286 62,686 Updated Jun 13, 2026

Multi-user h5 version, 3rd party ChatGPT web page. Uses OpenAPI official web API.

JavaScript 128 44 Updated Feb 10, 2023

Optimized primitives for collective multi-GPU communication

C++ 1 Updated Jul 29, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,689 2,244 Updated Dec 1, 2025

ns.py: a Pythonic Discrete-Event Network Simulator

Python 161 34 Updated Apr 1, 2026
Next