Sound Related Deep Learning Tasks boosting repository with pytorch
-
Updated
Jul 25, 2024 - Python
Sound Related Deep Learning Tasks boosting repository with pytorch
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna
Metadata tools for the VCTK Corpus (SQLite with speakers, accents, and transcripts).
A Cloudflare Worker API for retrieving random transcripts with speaker data from the VCTK corpus, featuring flexible exclusion filtering and comprehensive parameter validation.
Add a description, image, and links to the vctk topic page so that developers can more easily learn about it.
To associate your repository with the vctk topic, visit your repo's landing page and select "manage topics."