Skip to content
View shamuiscoding's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report shamuiscoding

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
16 results for source starred repositories written in C++
Clear filter

LLM inference in C/C++

C++ 89,237 13,583 Updated Nov 6, 2025

Truly independent web browser

C++ 55,241 2,451 Updated Nov 6, 2025

Port of OpenAI's Whisper model in C/C++

C++ 44,294 4,894 Updated Nov 1, 2025

MLX: An array framework for Apple silicon

C++ 22,723 1,378 Updated Nov 6, 2025

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,303 3,535 Updated Nov 6, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,432 2,097 Updated Nov 6, 2025

Tensor library for machine learning

C++ 13,413 1,379 Updated Nov 4, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,332 2,264 Updated Sep 24, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,052 1,844 Updated Nov 6, 2025

A distributed, fast open-source graph database featuring horizontal scalability and high availability

C++ 11,796 1,271 Updated Oct 22, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,442 957 Updated Oct 24, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,657 679 Updated Nov 6, 2025

Kernels & AI inference engine for phones

C++ 3,637 214 Updated Nov 6, 2025

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,699 559 Updated Nov 5, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,935 148 Updated Nov 5, 2025

Perceptual Quality Estimator for speech and audio

C++ 827 140 Updated May 17, 2025