Skip to content
View SuperCB's full-sized avatar
🏠
Working from home
🏠
Working from home
  • rednote-hilab
  • Beijing

Block or report SuperCB

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

145 stars written in C++
Clear filter

An NES emulator in C++

C++ 5,028 1,128 Updated Oct 5, 2025
C++ 4,977 530 Updated Nov 7, 2025

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,951 602 Updated Nov 8, 2025

The BusTub Relational Database Management System (Educational)

C++ 4,693 1,961 Updated Oct 22, 2025

C++ Insights - See your source code with the eyes of a compiler

C++ 4,410 258 Updated Jun 26, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,245 421 Updated Nov 10, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,068 412 Updated Oct 28, 2025

C++ implementation of the Python Numpy library

C++ 3,898 580 Updated Sep 30, 2025

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,451 789 Updated Nov 10, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 3,170 349 Updated Jun 22, 2025

A family of header-only, very fast and memory-friendly hashmap and btree containers.

C++ 3,076 296 Updated Oct 24, 2025

Modern concurrency for C++. Tasks, executors, timers and C++20 coroutines to rule them all

C++ 2,696 238 Updated May 1, 2025

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,599 241 Updated May 21, 2025

Nameof operator for modern C++, simply obtain the name of a variable, type, function, macro, and enum

C++ 2,252 120 Updated Oct 14, 2024

Cista is a simple, high-performance, zero-copy C++ serialization & reflection library.

C++ 2,138 149 Updated Oct 20, 2025

Simple, light-weight and easy-to-use asynchronous components

C++ 2,018 288 Updated Oct 10, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,943 149 Updated Nov 8, 2025

Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

C++ 1,871 75 Updated Sep 10, 2025

TinyML AI inference library

C++ 1,870 239 Updated May 10, 2025

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,844 245 Updated Nov 4, 2025

C++14 lock-free queue.

C++ 1,757 203 Updated Nov 10, 2025

llm deploy project based mnn. This project has merged into MNN.

C++ 1,608 176 Updated Jan 20, 2025

A lightweight parameter server interface

C++ 1,558 549 Updated Jan 11, 2023

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search

C++ 1,543 342 Updated Nov 10, 2025

SIMD Vector Classes for C++

C++ 1,511 152 Updated Jun 6, 2024

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,486 584 Updated Feb 15, 2025

Async++ concurrency framework for C++11

C++ 1,413 202 Updated Oct 11, 2024

Portable header-only C++ low level SIMD library

C++ 1,291 130 Updated Aug 26, 2024

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,261 175 Updated Aug 19, 2025

workspace是基于C++11的轻量级异步执行框架,支持:通用任务异步并发执行、优先级任务调度、自适应动态线程池、高效静态线程池、异常处理机制等。

C++ 1,220 184 Updated Jul 16, 2025