Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
A sound synthesis framework for Python, designed for clear and concise expression of musical ideas
Android compatible real-time voice changer and Altered Auditory Feedback (DAF + FAF) app
Python bindings of WebRTC Audio Processing
spatial signal processing toolkit a.k.a beamforming toolkit 2.0 (BTK2.0)
Impulse response generation based on state-of-the-art geometric sound propagation engine.
Open software for the numerical calculation of head-related transfer functions
Kaldi-compatible online fbank extractor without external dependencies
This repository provides plugins, tools and samples for integrating spatial audio and acoustics into your Unity 3D applications and games.
Spatial Audio 3DOF Head Tracker (requires Arduino Pro Micro + MPU-9250 / MPU-9150)
SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be easily built standalone without any depencency.
A Python Room Spatial Impulse Response Ray-Tracing Toolkit
DiffSinger dataset processing tools, including audio processing, labeling.
libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻
参量均衡器不是采用固定频率点的均衡器,而是将整个音频频率范围分为几个频率段进行均衡处理,每一频率段的中心频率点可以进行调整。
重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。
Spherical Microphone array Impulse Response generator (SMIRgen)
The signal generator is a mex-function for MATLAB that can be used to generate the response of a moving sound source and receiver in a reverberant environment.
A simple demo shows how to use the SIMD,Single Instruction Multiple Data, to optimize and accelerate the FFT algorithm.
Windows下音视频对讲演示程序(声学回音消除、噪音抑制、语音活动检测、自动增益控制、自适应抖动缓冲)
2-pass noise reduction, pulled out of Audacity
C++ Audio programming tutorials, focused on VST and JUCE.