MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,432 2,097 Updated Nov 6, 2025

ggml-org / ggml

Tensor library for machine learning

C++ 13,413 1,379 Updated Nov 4, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,332 2,264 Updated Sep 24, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,052 1,844 Updated Nov 6, 2025

vesoft-inc / nebula

A distributed, fast open-source graph database featuring horizontal scalability and high availability

C++ 11,796 1,271 Updated Oct 22, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,442 957 Updated Oct 24, 2025

openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,657 679 Updated Nov 6, 2025

cactus-compute / cactus

Kernels & AI inference engine for phones

C++ 3,637 214 Updated Nov 6, 2025

pytorch / xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,699 559 Updated Nov 5, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,935 148 Updated Nov 5, 2025

google / visqol

Perceptual Quality Estimator for speech and audio

C++ 827 140 Updated May 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Toby Kim shamuiscoding

Achievements

Achievements

Highlights

Block or report shamuiscoding

Stars

ggml-org / llama.cpp

LadybirdBrowser / ladybird

ggml-org / whisper.cpp

ml-explore / mlx

microsoft / onnxruntime

alibaba / MNN