Stars
Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Crack WPA/WPA2 Wi-Fi Routers with Airodump-ng and Aircrack-ng/Hashcat
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Machine Learning Engineering Open Book
A Easy-to-understand TensorOp Matmul Tutorial
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs
《Effective Modern C++》- 完成翻译
A Datacenter Scale Distributed Inference Serving Framework
The official GitHub page for the survey paper "A Survey of Large Language Models".
One second to read GitHub code with VS Code.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.