- Tokyo, Japan
- https://ahmedmustahid.github.io/html-cv
Stars
Port of OpenAI's Whisper model in C/C++
A library for efficient similarity search and clustering of dense vectors.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
ncnn is a high-performance neural network inference framework optimized for the mobile platform
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
A SQLite extension for efficient vector search, based on Faiss!
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…
A bounded single-producer single-consumer wait-free and lock-free queue written in C++11
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
Customizable automatic UML diagram generator for C++ based on Clang.
Notes about modern C++, C++11, C++14 and C++17, Boost Libraries, ABI, foreign function interface and reference cards.
petermost / Sourcetrail
Forked from CoatiSoftware/SourcetrailSourcetrail - free and open-source interactive source explorer
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
A Libtorch implementation of the YOLO v3 object detection algorithm
A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)
simple C++11 ring buffer implementation, allocated and evaluated at compile time
C++ High Performance Second Edition, published by Packt
Real time Fight Detection Based on 2D Pose Estimation and RNN Action Recognition