NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers.

Kai Shen Zeqian Ju Xu Tan 0003 Yanqing Liu Yichong Leng Lei He 0005 Tao Qin 0001 Sheng Zhao 0002 Jiang Bian 0002 NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers. 2023 abs/2304.09116 CoRR https://doi.org/10.48550/arXiv.2304.09116 db/journals/corr/corr2304.html#abs-2304-09116