Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, L…

C++ 1,539 197 Updated Oct 20, 2025

Voine / ChatWaifu_Mobile

移动版二次元 AI 老婆聊天器

C++ 1,358 146 Updated Jun 19, 2023

mmorise / World

A high-quality speech analysis, manipulation and synthesis system

C++ 1,279 261 Updated Feb 21, 2025

YannickJadoul / Parselmouth

Praat in Python, the Pythonic way

C++ 1,205 129 Updated Nov 10, 2025

audacious-media-player / audacious

A lightweight and versatile audio player

C++ 1,080 134 Updated Oct 22, 2025

EdVince / Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

C++ 1,052 103 Updated Jul 3, 2023

athena-team / athena

an open-source implementation of sequence-to-sequence based speech processing engine

C++ 963 201 Updated Dec 2, 2022

alibaba / rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

C++ 923 122 Updated Nov 12, 2025

juandagilc / Audio-Effects

Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.

C++ 839 142 Updated Oct 16, 2020

google / visqol

Perceptual Quality Estimator for speech and audio

C++ 830 140 Updated May 17, 2025

leomccormack / SPARTA

A collection of spatial audio related VST/LV2 plug-ins developed using JUCE and the Spatial_Audio_Framework

C++ 682 50 Updated Nov 2, 2025

sevagh / pitch-detection

autocorrelation-based O(NlogN) pitch detection

C++ 630 74 Updated Jan 7, 2025

RapidAI / RapidASR

📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

C++ 588 69 Updated May 15, 2024

resonance-audio / resonance-audio

Resonance Audio Source Code

C++ 521 112 Updated Sep 8, 2022

huakunyang / SummerTTS

SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目，可以本地运行不需要网络，而且没有额外的依赖，一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be…

C++ 503 93 Updated Jul 10, 2025

KDE / kcachegrind

GUI to profilers such as Valgrind

C++ 485 48 Updated Nov 9, 2025

ROCm / composable_kernel

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

C++ 483 249 Updated Nov 12, 2025

ehabets / RIR-Generator

Generating room impulse responses

C++ 473 150 Updated Oct 24, 2025

shichaog / WebRTC-audio-processing

webrtc audio processing

C++ 409 142 Updated May 10, 2020

XiaoMi / mobile-ai-bench

Benchmarking Neural Network Inference on Mobile Devices

C++ 383 58 Updated Apr 10, 2023

xanguera / BeamformIt

BeamformIt acoustic beamforming software

C++ 370 113 Updated May 19, 2020

jzi040941 / PercepNet

Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech

C++ 357 94 Updated Jan 22, 2023

Previous Next

nanless

Lists (32)

academic

acoustic echo cancellation

AIGC

audio codec

audio codecs

audio separation

audio tools

bandwidth extension

beamforming

computer vision

deep learning

diffusion

entertainments

hearing aid

LLM

mircophone array

music tools

noise reduction

packet loss compensation

programming related

simulation tools

singing voice tools

sound source localization

spatial audio

speaker recognition

speech dereverberation

speech diarization

speech frontend

speech recognition

speech separation

speech signal processing

speech voice tools

Starred repositories

LaTeX

noise-reduction