Stars
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
š Text-Prompted Generative Audio Model
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllableā¦
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.