Stars
An extremely fast Python linter and code formatter, written in Rust.
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Official PyTorch implementation of BigVGAN (ICLR 2023)
Repository for Open Source Reinforcement Learning Framework JORLDY
Code for paper "Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions"
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Quill is a modern WYSIWYG editor built for compatibility and extensibility
Pytorch library for fast transformer implementations
Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equations
Google Research
PyTorch implementation of the paper "NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity." (NeurIPS 2020)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Adversarially Trained End-to-end Korean SInging Voice Synthesis System
A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio" (ICML 2020)