Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
A flexible, high-performance 3D simulator for Embodied AI research.
OpenAL Soft is a software implementation of the OpenAL 3D audio API.
快速入门CMake,通过例程学习语法。在线阅读地址:https://sfumecjf.github.io/cmake-examples-Chinese/
llm deploy project based mnn. This project has merged into MNN.
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…
A high-quality speech analysis, manipulation and synthesis system
A lightweight and versatile audio player
Stable Diffusion in NCNN with c++, supported txt2img and img2img
an open-source implementation of sequence-to-sequence based speech processing engine
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
A collection of spatial audio related VST/LV2 plug-ins developed using JUCE and the Spatial_Audio_Framework
autocorrelation-based O(NlogN) pitch detection
📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.
SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be…
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
Benchmarking Neural Network Inference on Mobile Devices
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech