Starred repositories
C++ based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
GoogleTest - Google Testing and Mocking Framework
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
An open-source C++ library developed and used at Facebook.
Productive, portable, and performant GPU programming in Python.
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
FlashMLA: Efficient Multi-head Latent Attention Kernels
中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
EASTL stands for Electronic Arts Standard Template Library. It is an extensive and robust implementation that has an emphasis on high performance.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
A collection of C++ HTTP libraries including an easy to use HTTP server.
Piccolo (formerly Pilot) – mini game engine for games104
《剑指Offer:名企面试官精讲典型编程面试题》第二版源代码
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Optimized primitives for collective multi-GPU communication
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。
A machine learning compiler for GPUs, CPUs, and ML accelerators