Skip to content
View TengNN's full-sized avatar

Block or report TengNN

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Foundational model for human-like, expressive TTS

Python 4,204 692 Updated Jul 30, 2024

Programmer's guide about how to cook at home.

100,855 11,007 Updated Jun 16, 2026

This repository contains the code used to run experiments on the multi-swap K-means++ algorithm from https://arxiv.org/pdf/2309.16384.

Python 5 Updated Oct 25, 2024

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Jupyter Notebook 237 54 Updated Jun 22, 2022

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730

Python 130 9 Updated Dec 8, 2023

NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis

Python 152 12 Updated Feb 11, 2023

🐱 跨平台互动桌宠 BongoCat,为桌面增添乐趣!

Vue 21,486 1,015 Updated Apr 28, 2026

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 1,937 133 Updated May 7, 2026

SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations

Python 9 Updated Jun 17, 2024

Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering"

Python 65 6 Updated May 19, 2023
HTML 1 Updated Feb 23, 2026
Python 8 Updated Oct 22, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,844 814 Updated Mar 25, 2026

PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

Python 420 27 Updated Aug 15, 2025

[INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

Python 179 15 Updated May 20, 2025

[TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector

Python 130 13 Updated Sep 7, 2025

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

Python 95 13 Updated Feb 9, 2022

CUDA and Triton implementations of Flash Attention with SoftmaxN.

Python 74 5 Updated May 26, 2024

This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion".

Python 21 3 Updated Sep 18, 2023

[CVPR2024] Official implementation of the paper "Z∗: Zero-shot Style Transfer via Attention Rearrangement" a.k.a. "Z∗: Zero-shot Style Transfer via Attention Reweighting"

Python 98 3 Updated Sep 29, 2024

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Python 147 13 Updated Jan 15, 2024
Python 15 1 Updated Sep 22, 2023

The offical repository of "IPMix: Label-Preserving Data Augmentation Method for Training Robust Classifiers"

Python 15 1 Updated May 7, 2024

The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"

Jupyter Notebook 34 2 Updated Nov 23, 2023

Pytorch Implementation of DOLG (ICCV 2021)

Python 66 12 Updated Jun 21, 2022

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 154 23 Updated Oct 16, 2023

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Python 261 34 Updated Jul 13, 2023

End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions

Python 93 7 Updated Nov 6, 2023
Next