-
-
pysheeet Public
Python Cheat Sheet
-
-
Awesome-ML-SYS-Tutorial Public
Forked from zhaochenyang20/Awesome-ML-SYS-TutorialMy learning notes/codes for ML SYS.
Python Apache License 2.0 UpdatedDec 13, 2025 -
-
hello-agents Public
Forked from datawhalechina/hello-agents📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Python Other UpdatedDec 13, 2025 -
Foundations-of-LLMs Public
Forked from ZJU-LLMs/Foundations-of-LLMsA book for Learning the Foundations of LLMs
Other UpdatedDec 12, 2025 -
🐹 Dig deep like a mole to optimize you Mac. 像鼹鼠一样深入挖掘来优化你的 Mac
Shell MIT License UpdatedDec 11, 2025 -
-
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda GNU General Public License v3.0 UpdatedDec 4, 2025 -
asyncio Public
Forked from netcan/asyncioasyncio is a c++20 library to write concurrent code using the async/await syntax.
C++ MIT License UpdatedNov 28, 2025 -
-
efa-dp-direct Public
Forked from amzn/efa-dp-directElastic Fabric Adapter (EFA) data path interfaces and implementations
Cuda Apache License 2.0 UpdatedNov 20, 2025 -
-
efa-rdma Public
High-performance GPU-to-GPU communication library using AWS EFA and RDMA for distributed deep learning
Cuda UpdatedNov 8, 2025 -
-
pplx-garden Public
Forked from perplexityai/pplx-gardenPerplexity open source garden for inference technology
Rust MIT License UpdatedNov 5, 2025 -
mscclpp Public
Forked from microsoft/mscclppMSCCL++: A GPU-driven communication stack for scalable AI applications
C++ MIT License UpdatedNov 5, 2025 -
async-book Public
Forked from rust-lang/async-bookAsynchronous Programming in Rust
Shell MIT License UpdatedNov 2, 2025 -
nofx Public
Forked from NoFxAiOS/nofxNOFX: Defining the Next-Generation AI Trading Operating System. A multi-exchange Al trading platform(Binance/Hyperliquid/Aster) with multi-Ai competition(deepseek/qwen/claude)self-evolution, and re…
Go UpdatedNov 2, 2025 -
nvshmem-offical Public
Forked from NVIDIA/nvshmemNVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…
C++ Other UpdatedOct 31, 2025 -
-
multi-gpu-programming-models Public
Forked from NVIDIA/multi-gpu-programming-modelsExamples demonstrating available options to program multiple GPUs in a single node or a cluster
Cuda BSD 3-Clause "New" or "Revised" License UpdatedOct 23, 2025 -
torchcomms Public
Forked from meta-pytorch/torchcommstorchcomms: a modern PyTorch communications API
C++ BSD 3-Clause "New" or "Revised" License UpdatedOct 22, 2025 -
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
Cuda BSD 3-Clause "New" or "Revised" License UpdatedOct 17, 2025 -
-
-
perftest Public
Forked from linux-rdma/perftestInfiniband Verbs Performance Tests
C Other UpdatedOct 5, 2025 -
TradingAgents-CN Public
Forked from hsliuping/TradingAgents-CN基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版
Python Apache License 2.0 UpdatedOct 4, 2025 -