Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,844 814 Updated Mar 25, 2026

AaronZ345 / StyleSinger

PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

Python 420 27 Updated Aug 15, 2025

ZhangShaozuo / FastSpeech2PromptGuidance

Python 7 Updated Feb 18, 2025

Choddeok / EmoSphere-TTS

[INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

Python 179 15 Updated May 20, 2025

Choddeok / EmoSpherepp

[TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector

Python 130 13 Updated Sep 7, 2025

KunZhou9646 / Emovox

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

Python 95 13 Updated Feb 9, 2022

softmax1 / Flash-Attention-Softmax-N

CUDA and Triton implementations of Flash Attention with SoftmaxN.

Python 74 5 Updated May 26, 2024

zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion

This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion".

Python 21 3 Updated Sep 18, 2023

HolmesShuan / Zero-shot-Style-Transfer-via-Attention-Rearrangement

[CVPR2024] Official implementation of the paper "Z∗: Zero-shot Style Transfer via Attention Rearrangement" a.k.a. "Z∗: Zero-shot Style Transfer via Attention Reweighting"

Python 98 3 Updated Sep 29, 2024

winddori2002 / TriAAN-VC

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Python 147 13 Updated Jan 15, 2024

BrightGu / RLVC

Python 15 1 Updated Sep 22, 2023

hzlsaber / IPMix

The offical repository of "IPMix: Label-Preserving Data Augmentation Method for Training Robust Classifiers"

Python 15 1 Updated May 7, 2024

light1726 / SpeechTripleNet

The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"

Jupyter Notebook 34 2 Updated Nov 23, 2023

feymanpriv / DOLG

Pytorch Implementation of DOLG (ICCV 2021)

Python 66 12 Updated Jun 21, 2022

ConsistencyVC / ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 154 23 Updated Oct 16, 2023

ai-dawang / PlugNPlay-Modules

Python 5,088 374 Updated Aug 5, 2025

quickvc / QuickVC-VoiceConversion

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Python 261 34 Updated Jul 13, 2023

wonjune-kang / lvc-vc

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions

Python 93 7 Updated Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TengNN

Block or report TengNN

Stars

metavoiceio / metavoice-src

Anduin2017 / HowToCook

lorenzo2beretta / multi-swap-k-means-pp

rishikksh20 / FastSpeech2

gallilmaimon / DISSC

revsic / torch-nansypp

ayangweb / BongoCat

aceliuchanghong / FAQ_Of_LLM_Interview

Trikaldarshi / SCORE_Finetuning

vectominist / spin

zhaox0 / SMVC

jaejunL / HYFace

open-mmlab / Amphion