wht2020

wht2020

2 followers · 11 following

AudioClassification-PaddlePaddle Public
Forked from yeyupiaoling/AudioClassification-PaddlePaddle

基于PaddlePaddle实现的音频分类，支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型，还有多种预处理方法

Python Apache License 2.0 Updated Mar 2, 2025
s3prl Public
Forked from s3prl/s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python Apache License 2.0 Updated Oct 18, 2023
awesome-diarization Public
Forked from wq2012/awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Apache License 2.0 Updated Jul 4, 2023
DS-TDNN Public
Forked from YChenL/DS-TDNN

Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch

Python Updated Apr 20, 2023
swav Public
Forked from facebookresearch/swav

PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882

Python Other Updated Apr 13, 2023
sort-google-scholar Public
Forked from WittmannF/sort-google-scholar

Sorting Google Scholar search results based on the number of citations

Jupyter Notebook Updated Apr 6, 2023
DCA-PLDA Public
Forked from luferrer/DCA-PLDA

Discriminative Condition-Aware PLDA

Python 1 Other Updated Apr 4, 2023
tuning_playbook Public
Forked from google-research/tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

Other Updated Feb 11, 2023
SpeechT5 Public
Forked from microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python MIT License Updated Feb 9, 2023
wespeaker Public
Forked from wenet-e2e/wespeaker

Research and Production Oriented Speaker Recognition Toolkit

Python Apache License 2.0 Updated Dec 6, 2022
ast Public
Forked from YuanGongND/ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook BSD 3-Clause "New" or "Revised" License Updated Dec 2, 2022
audio Public
Forked from pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python BSD 2-Clause "Simplified" License Updated Nov 11, 2022
PaddleSpeech Public
Forked from PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including SOTA/Streaming ASR with punctuation, influential TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NA…

Python Apache License 2.0 Updated Nov 10, 2022
CSASR_Challenge Public
Forked from MagicHub-io/CSASR_Challenge

中英文code-swithing语音识别

Shell Updated Sep 26, 2022
speech_dataset Public
Forked from double22a/speech_dataset

The dataset of Speech Recognition

Apache License 2.0 Updated Aug 19, 2022
pytorch-book Public
Forked from chenyuntc/pytorch-book

PyTorch tutorials and fun projects including neural talk, neural style, poem writing, anime generation (《深度学习框架PyTorch：入门与实战》)

Jupyter Notebook MIT License Updated Aug 14, 2022
open-speech-corpora Public
Forked from coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

MIT License Updated Jul 27, 2022
ECAPA-TDNN Public
Forked from TaoRuijie/ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python MIT License Updated Jun 1, 2022
python_speech_features Public
Forked from jameslyons/python_speech_features

This library provides common speech features for ASR including MFCCs and filterbank energies.

Python MIT License Updated Oct 20, 2021
AISHELL-4 Public
Forked from felixfuyihui/AISHELL-4

Python Apache License 2.0 Updated Jul 21, 2021
lihang-code Public
Forked from fengdu78/lihang-code

《统计学习方法》的代码实现

Jupyter Notebook Updated May 31, 2021
AESRC2020 Public
Forked from R1ckShi/AESRC2020

Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).

Python Apache License 2.0 Updated Oct 9, 2020
kaldi Public
Forked from kaldi-asr/kaldi

This is the official location of the Kaldi project.

Shell Other Updated Jun 23, 2020
zhvoice Public
Forked from fighting41love/zhvoice

Chinese voice corpus. 中文语音语料，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。

Updated Jun 12, 2020
speaker-recognition-py3 Public
Forked from crouchred/speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Python Apache License 2.0 Updated Mar 13, 2019
voiceprint Public
Forked from RDShi/voiceprint

A simple model implemented with tensorflow for voiceprint

Python Updated Dec 14, 2018

wht2020

AudioClassification-PaddlePaddle Public

Uh oh!

s3prl Public

Uh oh!

awesome-diarization Public

Uh oh!

DS-TDNN Public

Uh oh!

swav Public

Uh oh!

sort-google-scholar Public

Uh oh!

DCA-PLDA Public

Uh oh!

tuning_playbook Public

Uh oh!

SpeechT5 Public

Uh oh!

wespeaker Public

Uh oh!

ast Public

Uh oh!

audio Public

Uh oh!

PaddleSpeech Public

Uh oh!

CSASR_Challenge Public

Uh oh!

speech_dataset Public

Uh oh!

pytorch-book Public

Uh oh!

open-speech-corpora Public

Uh oh!

ECAPA-TDNN Public

Uh oh!

python_speech_features Public

Uh oh!

AISHELL-4 Public

Uh oh!

lihang-code Public

Uh oh!

AESRC2020 Public

Uh oh!

kaldi Public

Uh oh!

zhvoice Public

Uh oh!

speaker-recognition-py3 Public

Uh oh!

voiceprint Public

Uh oh!