Stars
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
FlatBuffers: Memory Efficient Serialization Library
《Machine Learning Systems: Design and Implementation》- Chinese Version
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
Following the RISC-V IME extension standard, and reusing Vector register resources, these instructions can bring more than a tenfold performance improvement to AI applications at a very small hardw…
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
AddressSanitizer, ThreadSanitizer, MemorySanitizer
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Play with MLIR right in your browser
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Unofficial mirror of sourceware binutils-gdb repository. Updated daily.
Development repository for the Triton language and compiler
Backward compatible ML compute opset inspired by HLO/MHLO
A model compilation solution for various hardware
RISC-V cryptography extensions standardisation work.
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch