Lists (32)
Sort Name ascending (A-Z)
academic
acoustic echo cancellation
AIGC
audio codec
audio codecs
audio separation
audio tools
bandwidth extension
beamforming
computer vision
deep learning
diffusion
entertainments
hearing aid
LLM
mircophone array
music tools
noise reduction
packet loss compensation
programming related
simulation tools
singing voice tools
sound source localization
spatial audio
speaker recognition
speech dereverberation
speech diarization
speech frontend
speech recognition
speech separation
speech signal processing
speech voice tools
Starred repositories
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
A collection of resources and papers on Diffusion Models
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
How do we integrate AI generation tools into actual work? | 关于 Ai 绘画的Wiki | Wiki about Ai painting | Prompts Engineering| 指南 Guide | Seeking Maintainer&Translator🙌
Download twitter media with only one-click.
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
A collection of datasets for the purpose of emotion recognition/detection in speech.
Acoustic Echo Cancellation with Nerual Kalman Filtering
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…
Coders way of wishing Happy Birthday
An Open-Source Project to Unify Audio Processing and Generation
Repository containing samples produced by the method proposed in "Multi-channel separation of dynamic speech and sound events" and presented at Interspeech 2023.
UniSE: A Unified Framework for Decoder-Only Autoregressive LM-Based Speech Enhancement
Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Companion repository to the paper "On the calibration of powerset speaker diarization models" published at Interspeech 2024