-
National Taiwan University
- Taiwan
- @HsuBogi
Stars
Brevitas: neural network quantization in PyTorch
Speaker embedding (d-vector) trained with GE2E loss
UniSpeech - Large Scale Self-Supervised Learning for Speech
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Chinese text normalization for speech processing
Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.
Open source code for AlphaFold 2.
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
A PyTorch implementation of the universal neural vocoder
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
An evaluation toolkit for voice conversion models.
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
Real-Time High-Fidelity Speech Synthesis without GPU
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
A complete computer science study plan to become a software engineer.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Self-Supervised Speech Pre-training and Representation Learning Toolkit