-
AudioClassification-PaddlePaddle Public
Forked from yeyupiaoling/AudioClassification-PaddlePaddle基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Python Apache License 2.0 UpdatedMar 2, 2025 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit
Python Apache License 2.0 UpdatedOct 18, 2023 -
awesome-diarization Public
Forked from wq2012/awesome-diarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Apache License 2.0 UpdatedJul 4, 2023 -
DS-TDNN Public
Forked from YChenL/DS-TDNNOfficial implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
Python UpdatedApr 20, 2023 -
swav Public
Forked from facebookresearch/swavPyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
Python Other UpdatedApr 13, 2023 -
sort-google-scholar Public
Forked from WittmannF/sort-google-scholarSorting Google Scholar search results based on the number of citations
Jupyter Notebook UpdatedApr 6, 2023 -
DCA-PLDA Public
Forked from luferrer/DCA-PLDADiscriminative Condition-Aware PLDA
-
tuning_playbook Public
Forked from google-research/tuning_playbookA playbook for systematically maximizing the performance of deep learning models.
Other UpdatedFeb 11, 2023 -
SpeechT5 Public
Forked from microsoft/SpeechT5Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Python MIT License UpdatedFeb 9, 2023 -
wespeaker Public
Forked from wenet-e2e/wespeakerResearch and Production Oriented Speaker Recognition Toolkit
Python Apache License 2.0 UpdatedDec 6, 2022 -
ast Public
Forked from YuanGongND/astCode for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedDec 2, 2022 -
audio Public
Forked from pytorch/audioData manipulation and transformation for audio signal processing, powered by PyTorch
Python BSD 2-Clause "Simplified" License UpdatedNov 11, 2022 -
PaddleSpeech Public
Forked from PaddlePaddle/PaddleSpeechEasy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NA…
Python Apache License 2.0 UpdatedNov 10, 2022 -
CSASR_Challenge Public
Forked from MagicHub-io/CSASR_Challenge中英文code-swithing语音识别
Shell UpdatedSep 26, 2022 -
speech_dataset Public
Forked from double22a/speech_datasetThe dataset of Speech Recognition
Apache License 2.0 UpdatedAug 19, 2022 -
pytorch-book Public
Forked from chenyuntc/pytorch-bookPyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch:入门与实战》)
Jupyter Notebook MIT License UpdatedAug 14, 2022 -
open-speech-corpora Public
Forked from coqui-ai/open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
MIT License UpdatedJul 27, 2022 -
ECAPA-TDNN Public
Forked from TaoRuijie/ECAPA-TDNNUnofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Python MIT License UpdatedJun 1, 2022 -
python_speech_features Public
Forked from jameslyons/python_speech_featuresThis library provides common speech features for ASR including MFCCs and filterbank energies.
Python MIT License UpdatedOct 20, 2021 -
-
lihang-code Public
Forked from fengdu78/lihang-code《统计学习方法》的代码实现
Jupyter Notebook UpdatedMay 31, 2021 -
AESRC2020 Public
Forked from R1ckShi/AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Python Apache License 2.0 UpdatedOct 9, 2020 -
kaldi Public
Forked from kaldi-asr/kaldiThis is the official location of the Kaldi project.
Shell Other UpdatedJun 23, 2020 -
zhvoice Public
Forked from fighting41love/zhvoiceChinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。
UpdatedJun 12, 2020 -
speaker-recognition-py3 Public
Forked from crouchred/speaker-recognition-py3Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)
Python Apache License 2.0 UpdatedMar 13, 2019 -
voiceprint Public
Forked from RDShi/voiceprintA simple model implemented with tensorflow for voiceprint
Python UpdatedDec 14, 2018