Skip to content
View marchNA's full-sized avatar

Block or report marchNA

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
16 stars written in C++
Clear filter

LLM inference in C/C++

C++ 102,533 16,563 Updated Apr 8, 2026

这是一个用于显示当前网速、CPU及内存利用率的桌面悬浮窗软件,并支持任务栏显示,支持更换皮肤。

C++ 43,849 3,642 Updated Apr 6, 2026

A library for efficient similarity search and clustering of dense vectors.

C++ 39,653 4,322 Updated Apr 8, 2026

Android real-time display control software

C++ 29,145 3,502 Updated Apr 3, 2026

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,809 5,982 Updated Apr 8, 2026

MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

C++ 14,825 2,273 Updated Apr 8, 2026

A distributed, fast open-source graph database featuring horizontal scalability and high availability

C++ 12,112 1,302 Updated Oct 22, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 9,280 555 Updated Jan 24, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,058 656 Updated Apr 8, 2026

Tesseract Open Source OCR Engine (main repository)

C++ 4,210 537 Updated Feb 20, 2026

Spot Micro Quadruped Project

C++ 2,047 486 Updated Mar 20, 2021

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,824 198 Updated Mar 17, 2026

Large-scale LLM inference engine

C++ 1,686 190 Updated Mar 12, 2026

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,545 207 Updated Jul 18, 2025

follow me to study modern c++

C++ 953 321 Updated May 29, 2024

ONNX Model Exporter for PaddlePaddle

C++ 911 193 Updated Mar 18, 2026