Skip to content
View Cortexelus's full-sized avatar

Block or report Cortexelus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
156 stars written in Python
Clear filter

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python 1,981 485 Updated Dec 19, 2023

Speedy Wavenet generation using dynamic programming ⚡

Python 1,770 307 Updated Jun 20, 2017

Pytorch library for fast transformer implementations

Python 1,748 188 Updated Mar 23, 2023

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,611 160 Updated Nov 4, 2025

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,593 193 Updated Aug 12, 2020

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Python 1,569 204 Updated Jun 23, 2025

Python audio and music signal processing library

Python 1,516 258 Updated Aug 25, 2024

This is now the official location of the Merlin project.

Python 1,320 437 Updated Mar 3, 2020

Question answering dataset featured in "Teaching Machines to Read and Comprehend

Python 1,295 246 Updated Apr 26, 2017
Python 1,187 146 Updated Sep 29, 2022

Differentiable Vector Graphics Rasterization

Python 1,152 188 Updated May 17, 2025

NumPy interface with mixed backend execution

Python 1,103 109 Updated Feb 19, 2018

Keras WaveNet implementation

Python 1,055 217 Updated Mar 24, 2023

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 1,025 215 Updated Aug 28, 2023

an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Python 1,022 167 Updated May 26, 2025

Chainer implementation of Deep Convolutional Generative Adversarial Network

Python 936 185 Updated Jul 6, 2020

A collection of links and notes on forced alignment tools

Python 932 88 Updated Nov 10, 2021

kapre: Keras Audio Preprocessors

Python 931 149 Updated Oct 26, 2025

Implementation of the Wave-U-Net for audio source separation

Python 915 182 Updated Mar 24, 2023

Generating faces with deconvolution networks

Python 891 129 Updated Jun 8, 2021

A method to generate speech across multiple speakers

Python 873 154 Updated Mar 21, 2019

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 869 128 Updated Mar 26, 2024

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

Python 848 101 Updated Sep 30, 2021

Speech Enhancement Generative Adversarial Network in TensorFlow

Python 848 282 Updated Mar 24, 2023

My implementation of Few-Shot Adversarial Learning of Realistic Neural Talking Head Models (Egor Zakharov et al.).

Python 831 191 Updated Jun 21, 2022

Multilayer LSTM and Mixture Density Network for modelling path-level SVG Vector Graphics data in TensorFlow

Python 810 127 Updated Oct 25, 2018

GRUV is a Python project for algorithmic music generation.

Python 800 162 Updated Nov 28, 2020

Neural Style Transfer For Chinese Characters

Python 775 122 Updated Apr 7, 2017

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Python 698 155 Updated Jul 12, 2022