Skip to content
View Jason-cs18's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@MLSysTeam

Block or report Jason-cs18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

98 stars written in C++
Clear filter

LLM inference in C/C++

C++ 94,500 14,784 Updated Feb 6, 2026

C++那些事

C++ 42,848 8,834 Updated Jun 14, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 39,006 4,218 Updated Feb 6, 2026

Caffe: a fast open framework for deep learning.

C++ 34,835 18,570 Updated Jul 31, 2024

OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

C++ 33,755 8,057 Updated Aug 3, 2024

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 27,973 8,843 Updated Feb 5, 2026

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 26,722 4,101 Updated Jun 19, 2025

A brief computer graphics / rendering course

C++ 23,192 2,209 Updated Nov 21, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,756 4,394 Updated Feb 5, 2026

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 14,093 2,187 Updated Feb 5, 2026

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,669 2,316 Updated Feb 4, 2026

Android NDK samples with Android Studio

C++ 10,466 4,257 Updated Oct 3, 2025

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

C++ 10,094 4,755 Updated May 15, 2024

A microbenchmark support library

C++ 9,995 1,738 Updated Feb 3, 2026

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 9,656 3,015 Updated Feb 6, 2026

cuDF - GPU DataFrame Library

C++ 9,481 1,004 Updated Feb 6, 2026

High-speed Large Language Model Serving for Local Deployment

C++ 8,636 482 Updated Jan 24, 2026

Cartographer is a system that provides real-time simultaneous localization and mapping (SLAM) in 2D and 3D across multiple platforms and sensor configurations.

C++ 7,772 2,325 Updated Jan 5, 2024

Implementation of popular deep learning networks with TensorRT network definition API

C++ 7,674 1,868 Updated Feb 2, 2026

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)

C++ 7,225 1,628 Updated May 22, 2025

Transformer related optimization, including BERT, GPT

C++ 6,392 929 Updated Mar 27, 2024

A Python-embedded modeling language for convex optimization problems.

C++ 6,091 1,149 Updated Feb 6, 2026

A C++ standalone library for machine learning

C++ 5,440 502 Updated Jan 12, 2026

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

C++ 5,036 825 Updated Jun 17, 2024

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…

C++ 4,620 773 Updated May 9, 2025

Tengine is a lite, high performance, modular inference engine for embedded device

C++ 4,504 982 Updated Mar 6, 2025

An optimization-based multi-sensor state estimator

C++ 4,370 1,557 Updated May 23, 2024

🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

C++ 4,365 773 Updated Jan 18, 2026

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,339 579 Updated Feb 5, 2026

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,144 418 Updated Jan 29, 2026
Next