gnitoah

gniToaH gnitoah

1 follower · 3 following

Lists (4)

Sort

Stars

yongaifadian1 / MAGIC-TTS

MAGIC-TTS: Fine-Grained Controllable Speech Synthesis with Explicit Local Duration and Pause Control

Python 25 2 Updated Apr 28, 2026

HappyColor / DrawSpeech_PyTorch

Python 24 5 Updated Nov 25, 2025

OpenMOSS / MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 1,691 159 Updated Apr 13, 2026

facebookresearch / ConvNeXt-V2

Code release for ConvNeXt V2 model

Python 2,016 171 Updated Aug 14, 2024

ozspeech / OZSpeech

[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Jupyter Notebook 45 6 Updated Feb 9, 2025

idiap / knn-tts

Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model

Python 36 7 Updated Apr 29, 2025

flamed-tts / Flamed-TTS

This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in speech synthesis.

Python 57 6 Updated Aug 9, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,192 6,062 Updated Aug 16, 2024

tarepan / SpeechMOS

Easy-to-Use Speech MOS predictors

Python 352 18 Updated Oct 24, 2023

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,281 192 Updated Apr 10, 2026

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,776 811 Updated Mar 25, 2026

FunAudioLLM / CV3-Eval

Python 181 16 Updated Aug 25, 2025

yangdongchao / SimpleSpeech

The open source code for SimpleSpeech series

Python 144 11 Updated Oct 8, 2024

justinlovelace / SESD

Python 61 3 Updated Oct 28, 2024

sh-lee-prml / HierSpeechpp

The official implementation of HierSpeech++

Python 1,238 151 Updated Feb 20, 2024

adelacvg / ttts

Train the next generation of TTS systems.

Python 170 17 Updated Sep 13, 2024

ERC-ITEA / MuduoLLM

82 4 Updated Oct 14, 2025

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,804 287 Updated Mar 31, 2026

mcf330 / efts2code

source code of EfficientTTS 2

Python 20 2 Updated Feb 18, 2024

BytedanceSpeech / seed-tts-eval

Python 1,554 145 Updated Jun 14, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,212 6,678 Updated Sep 30, 2025

IDRnD / redimnet

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 194 17 Updated Sep 24, 2025

chenqi008 / pymcd

Package pymcd

Python 40 3 Updated Sep 8, 2022

supertone-inc / super-monotonic-align

Python 167 10 Updated Sep 19, 2024

lingjzhu / charsiu

Charsiu: A neural phonetic aligner.

Jupyter Notebook 341 44 Updated Sep 19, 2022

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,592 1,955 Updated Apr 15, 2026