-
Tencent
- Shanghai
-
17:37
(UTC +08:00)
Stars
A toolkit for speaker diarization.
Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).
[INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition"
A generative speech model for daily dialogue.
ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview
Expressive Anechoic Recordings of Speech (EARS)
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
A TTS model capable of generating ultra-realistic dialogue in one pass.
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.
Official Jax Implementation of MaskGIT
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
The personal information dashboard for your terminal
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
Speaker verification evaluation protocols simulating speaker diarisation
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)