Stars
Build your neural network easy and fast, 莫烦Python中文教学
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Free course for Resume, 整理和搜集网络免费的项目实战课程,包括 Java 项目实战,Python 项目实战,C++ 项目实战等
A python package to analyze and compare voices with deep learning
2021年最新整理,5000道秋招/提前批/春招/常用面试题(含答案),包括leetcode,校招笔试题,面试题,算法题,语法题。
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
You can find the speech algorithms you want here
A Python wrapper for the high-quality vocoder "World"
Unsupervised Speech Decomposition Via Triple Information Bottleneck
speech emotion recognition using a convolutional recurrent networks based on IEMOCAP
A collection of datasets for the purpose of emotion recognition/detection in speech.
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Implementation of "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" in PyTorch
End-to-End Automatic Speech Recognition on PyTorch
Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transformers, and everything in between
Implementation code of non-parallel sequence-to-sequence VC
TensorFlow binaries supporting AVX, FMA, SSE
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
A vocoder framework which had been widely used in research community since 1999.
Multi-modal Emotion detection from IEMOCAP on Speech, Text, Motion-Capture Data using Neural Nets.