Skip to content
View hlp2819's full-sized avatar

Block or report hlp2819

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The Open Source Code of UniAudio

Python 605 39 Updated Jul 22, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 39,071 4,687 Updated Aug 19, 2024

A collection of resources and papers on Diffusion Models

HTML 12,300 1,018 Updated Aug 1, 2024

Text Normalization & Inverse Text Normalization

Python 739 101 Updated Feb 27, 2026

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,902 495 Updated Oct 12, 2024

A light-weight Python library for computing Kaldi-style acoustic features based on NumPy

Python 14 4 Updated Aug 17, 2020

List of speech synthesis papers.

1,070 123 Updated Jul 24, 2023

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Python 3,281 576 Updated Apr 14, 2023

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 2,163 616 Updated Oct 27, 2023

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,830 267 Updated Aug 19, 2025

The dataset of Speech Recognition

456 81 Updated Jan 4, 2026

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 30 4 Updated May 28, 2020

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,304 1,426 Updated Jun 12, 2024

A TensorFlow implementation of DeepMind's WaveNet paper

Python 5,435 1,276 Updated Jul 12, 2023

A clone of Darts (Double-ARray Trie System)

C++ 1 Updated Nov 17, 2018

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Python 1,834 430 Updated Jan 17, 2022

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,319 905 Updated Jul 6, 2023

This is now the official location of the Merlin project.

Python 1,321 431 Updated Mar 3, 2020

Blind Source Separation for Audio Recognition Tasks

C++ 2 Updated Jun 10, 2013

Chinese keras documents with more examples, explanations and tips.

1,560 277 Updated Apr 6, 2023