Skip to content
View gnitoah's full-sized avatar

Block or report gnitoah

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MAGIC-TTS: Fine-Grained Controllable Speech Synthesis with Explicit Local Duration and Pause Control

Python 25 2 Updated Apr 28, 2026

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 1,691 159 Updated Apr 13, 2026

Code release for ConvNeXt V2 model

Python 2,016 171 Updated Aug 14, 2024

[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching

Jupyter Notebook 45 6 Updated Feb 9, 2025

Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model

Python 36 7 Updated Apr 29, 2025

This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in speech synthesis.

Python 57 6 Updated Aug 9, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 45,192 6,062 Updated Aug 16, 2024

Easy-to-Use Speech MOS predictors

Python 352 18 Updated Oct 24, 2023

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,281 192 Updated Apr 10, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,776 811 Updated Mar 25, 2026
Python 181 16 Updated Aug 25, 2025

The open source code for SimpleSpeech series

Python 144 11 Updated Oct 8, 2024
Python 61 3 Updated Oct 28, 2024

The official implementation of HierSpeech++

Python 1,238 151 Updated Feb 20, 2024

Train the next generation of TTS systems.

Python 170 17 Updated Sep 13, 2024
82 4 Updated Oct 14, 2025

Command line utility for forced alignment using Kaldi

Python 1,804 287 Updated Mar 31, 2026

source code of EfficientTTS 2

Python 20 2 Updated Feb 18, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,212 6,678 Updated Sep 30, 2025

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 194 17 Updated Sep 24, 2025

Package pymcd

Python 40 3 Updated Sep 8, 2022

Charsiu: A neural phonetic aligner.

Jupyter Notebook 341 44 Updated Sep 19, 2022

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,592 1,955 Updated Apr 15, 2026
Python 476 42 Updated May 19, 2025
Python 22 3 Updated Apr 6, 2025

A generative speech model for daily dialogue.

Python 39,167 4,249 Updated Apr 10, 2026

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 57,027 6,224 Updated Apr 19, 2026
Next