Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Protocol Buffers - Google's data interchange format
Cross-platform, customizable ML solutions for live and streaming media.
PlayStation 4 emulator for Windows, Linux and macOS written in C++
Official source code of FreeCAD, a free and opensource multiplatform 3D parametric modeler.
ncnn is a high-performance neural network inference framework optimized for the mobile platform
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
FlashMLA: Efficient Multi-head Latent Attention Kernels
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
High-speed Large Language Model Serving for Local Deployment
JUCE is an open-source cross-platform C++ application framework for desktop and mobile applications, including VST, VST3, AU, AUv3, LV2 and AAX audio plug-ins.
🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.
lightweight, standalone C++ inference engine for Google's Gemma models.
Facebook AI Research's Automatic Speech Recognition Toolkit
Adds AMD FSR 3 Frame Generation to games by replacing Nvidia DLSS Frame Generation (nvngx_dlssg).
Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++
A lightweight library for portable low-level GPU computation using WebGPU.