SuperCB

🏠

Working from home

CuiBo SuperCB

🏠

Working from home

Learning Machine Learning System

48 followers · 176 following

rednote-hilab
Beijing

Achievements

Lists (1)

Sort

MLsys

Starred repositories

145 stars written in C++

Clear filter

amhndu / SimpleNES

An NES emulator in C++

C++ 5,028 1,128 Updated Oct 5, 2025

google / tcmalloc

C++ 4,977 530 Updated Nov 7, 2025

microsoft / SPTAG

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,951 602 Updated Nov 8, 2025

cmu-db / bustub

The BusTub Relational Database Management System (Educational)

C++ 4,693 1,961 Updated Oct 22, 2025

andreasfertig / cppinsights

C++ Insights - See your source code with the eyes of a compiler

C++ 4,410 258 Updated Jun 26, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,245 421 Updated Nov 10, 2025

ztxz16 / fastllm

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型，任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型，单并发20tps；INT4量化模型单并发30tps，多并发可达60+。

C++ 4,068 412 Updated Oct 28, 2025

dpilger26 / NumCpp

C++ implementation of the Python Numpy library

C++ 3,898 580 Updated Sep 30, 2025

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,451 789 Updated Nov 10, 2025

zjhellofss / KuiperInfer

校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 3,170 349 Updated Jun 22, 2025

greg7mdp / parallel-hashmap

A family of header-only, very fast and memory-friendly hashmap and btree containers.

C++ 3,076 296 Updated Oct 24, 2025

David-Haim / concurrencpp

Modern concurrency for C++. Tasks, executors, timers and C++20 coroutines to rule them all

C++ 2,696 238 Updated May 1, 2025

CVCUDA / CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,599 241 Updated May 21, 2025

Neargye / nameof

Nameof operator for modern C++, simply obtain the name of a variable, type, function, macro, and enum

C++ 2,252 120 Updated Oct 14, 2024

felixguendling / cista

Cista is a simple, high-performance, zero-copy C++ serialization & reflection library.

C++ 2,138 149 Updated Oct 20, 2025

alibaba / async_simple

Simple, light-weight and easy-to-use asynchronous components

C++ 2,018 288 Updated Oct 10, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,943 149 Updated Nov 8, 2025

ashvardanian / less_slow.cpp

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,871 75 Updated Sep 10, 2025

uTensor / uTensor

TinyML AI inference library

C++ 1,870 239 Updated May 10, 2025

flexflow / flexflow-train

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,844 245 Updated Nov 4, 2025

max0x7ba / atomic_queue

C++14 lock-free queue.

C++ 1,757 203 Updated Nov 10, 2025

wangzhaode / mnn-llm

llm deploy project based mnn. This project has merged into MNN.

C++ 1,608 176 Updated Jan 20, 2025

dmlc / ps-lite

A lightweight parameter server interface

C++ 1,558 549 Updated Jan 11, 2023

microsoft / DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,543 342 Updated Nov 10, 2025

VcDevel / Vc

SIMD Vector Classes for C++

C++ 1,511 152 Updated Jun 6, 2024

gpgpu-sim / gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,486 584 Updated Feb 15, 2025