Skip to content
View abylouw's full-sized avatar

Block or report abylouw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
114 stars written in Python
Clear filter

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Python 1,162 364 Updated Apr 14, 2023

Command-line tools for speech and intent recognition on Linux

Python 1,106 67 Updated Mar 7, 2024

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,077 215 Updated Oct 23, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 1,011 119 Updated Aug 7, 2024

💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

Python 884 105 Updated Dec 15, 2024

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

Python 880 87 Updated Aug 20, 2024

The Implementation of FastSpeech based on pytorch.

Python 877 216 Updated Jul 6, 2023

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Python 831 180 Updated Jul 26, 2021

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Python 698 155 Updated Jul 12, 2022

Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.

Python 661 169 Updated Jul 28, 2023

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 655 80 Updated Dec 27, 2023

Raspberry Pi surveillance

Python 648 100 Updated Sep 17, 2025

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python 594 164 Updated Jan 20, 2022

⏩ Generating speech in a single forward pass without any attention!

Python 578 111 Updated Jul 29, 2024

Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2

Python 559 109 Updated Jun 10, 2023

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 545 34 Updated Nov 7, 2025

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

Python 532 70 Updated Aug 29, 2023

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Python 463 67 Updated Nov 17, 2022

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 432 45 Updated Sep 13, 2024

Grapheme to phoneme conversion with deep learning.

Python 407 53 Updated Dec 8, 2023

Official repository for RawNet, RawNet2, and RawNet3

Python 389 57 Updated Mar 21, 2024

Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"

Python 369 103 Updated Oct 9, 2021

My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundam…

Python 329 91 Updated Aug 31, 2021

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 327 44 Updated Nov 15, 2024

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 326 43 Updated Sep 24, 2022

Deep neural networks for getting text-independent speaker embedding written in TensorFlow

Python 310 81 Updated Nov 19, 2018

Code for paper "SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows"

Python 288 36 Updated Feb 1, 2021

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model

Python 282 86 Updated Apr 16, 2019

Deye/Sunsynk Inverter Python library and Home Assistant OS Addon

Python 275 128 Updated Nov 6, 2025

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

Python 265 68 Updated Nov 28, 2022