Skip to content
View hlp2819's full-sized avatar

Block or report hlp2819

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Open Source Code of UniAudio

Python 604 40 Updated Jul 22, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,075 4,686 Updated Aug 19, 2024

A collection of resources and papers on Diffusion Models

HTML 12,308 1,018 Updated Aug 1, 2024

Text Normalization & Inverse Text Normalization

Python 751 100 Updated Feb 27, 2026

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,906 496 Updated Oct 12, 2024

A light-weight Python library for computing Kaldi-style acoustic features based on NumPy

Python 14 4 Updated Aug 17, 2020

List of speech synthesis papers.

1,071 123 Updated Jul 24, 2023

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Python 3,282 576 Updated Apr 14, 2023

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,167 615 Updated Oct 27, 2023

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,835 266 Updated Aug 19, 2025

The dataset of Speech Recognition

456 81 Updated Jan 4, 2026

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 30 4 Updated May 28, 2020

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,304 1,425 Updated Jun 12, 2024

A TensorFlow implementation of DeepMind's WaveNet paper

Python 5,434 1,274 Updated Jul 12, 2023

A clone of Darts (Double-ARray Trie System)

C++ 1 Updated Nov 17, 2018

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Python 1,834 430 Updated Jan 17, 2022

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,318 903 Updated Jul 6, 2023

This is now the official location of the Merlin project.

Python 1,321 431 Updated Mar 3, 2020

Blind Source Separation for Audio Recognition Tasks

C++ 2 Updated Jun 10, 2013

Chinese keras documents with more examples, explanations and tips.

1,560 277 Updated Apr 6, 2023