Skip to content
View akbar2habibullah's full-sized avatar

Organizations

@pabryk-org

Block or report akbar2habibullah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
30 stars written in C++
Clear filter

LLM inference in C/C++

C++ 89,180 13,577 Updated Nov 6, 2025

Port of OpenAI's Whisper model in C/C++

C++ 44,285 4,893 Updated Nov 1, 2025

MLX: An array framework for Apple silicon

C++ 22,721 1,379 Updated Nov 6, 2025

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,233 4,346 Updated Nov 6, 2025

Tensor library for machine learning

C++ 13,384 1,377 Updated Nov 4, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,049 1,842 Updated Nov 6, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,846 896 Updated Sep 30, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,442 957 Updated Oct 24, 2025

μWebSockets for Node.js back-ends 🤘

C++ 8,847 612 Updated Nov 5, 2025

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…

C++ 8,643 1,245 Updated Nov 5, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,377 450 Updated Aug 2, 2025

MariaDB server is a community developed fork of MySQL server. Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliver the most featureful, stab…

C++ 6,632 1,885 Updated Nov 6, 2025

The QuantLib C++ library

C++ 6,451 2,045 Updated Nov 5, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,225 420 Updated Nov 6, 2025

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,223 573 Updated Nov 4, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 3,859 299 Updated Nov 6, 2025

Local AI API Platform

C++ 2,761 181 Updated Jul 4, 2025

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for…

C++ 2,373 176 Updated Oct 5, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,468 674 Updated Nov 6, 2025

Swift API for MLX

C++ 1,419 122 Updated Nov 5, 2025

Tuned OpenCL BLAS

C++ 1,154 208 Updated Sep 26, 2025

Universal model exchange and serialization format for decision tree forests

C++ 794 105 Updated Nov 5, 2025

OpenCL SDK

C++ 707 151 Updated Sep 2, 2025

Trainable fast and memory-efficient sparse attention

C++ 432 38 Updated Nov 6, 2025

ROCm Communication Collectives Library (RCCL)

C++ 397 188 Updated Nov 6, 2025

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

C++ 312 26 Updated May 22, 2025

SIMD quantization kernels

C++ 92 4 Updated Sep 7, 2025

Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X

C++ 71 5 Updated Nov 5, 2025

An Arduino-based six legged robot with extensive documentation.

C++ 10 Updated Jan 12, 2025