Skip to content
View chomeyama's full-sized avatar

Block or report chomeyama

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 229 32 Updated Dec 24, 2024

Speech Human Evaluation Estimation Toolkit (SHEET)

Python 135 9 Updated Mar 31, 2026

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,285 220 Updated Apr 13, 2026

Pytorch implementation of 2D Discrete Wavelet (DWT) and Dual Tree Complex Wavelet Transforms (DTCWT) and a DTCWT based ScatterNet

Python 1,170 158 Updated Aug 2, 2023

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python 405 60 Updated Oct 1, 2024

Unsupervised Rhythm Modeling for Voice Conversion

Python 85 9 Updated Aug 3, 2023

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 1,129 129 Updated Aug 7, 2024

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 1,224 144 Updated Sep 5, 2024

Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation

Python 116 10 Updated Nov 25, 2023

Pytorch implementation of stochastically quantized variational autoencoder (SQ-VAE)

Python 196 25 Updated Jul 20, 2022

Magenta: Music and Art Generation with Machine Intelligence

Python 19,795 3,775 Updated Jan 6, 2026

A high-quality speech analysis, manipulation and synthesis system

C++ 1,321 265 Updated Feb 18, 2026

The Implementation of FastSpeech based on pytorch.

Python 883 214 Updated Jul 6, 2023

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained …

Roff 293 42 Updated Apr 6, 2023

Neural network-based singing voice synthesis library for research

Python 743 83 Updated Oct 9, 2023

AIを使ったリアルタイムボイスチェンジャー(Trainer)

Jupyter Notebook 931 79 Updated Nov 22, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,353 552 Updated Jul 27, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,866 1,391 Updated Dec 6, 2023

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 888 131 Updated Mar 26, 2024

Pytorch implementation of MixNMatch

Python 967 187 Updated Jul 7, 2020

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 10,215 1,515 Updated Apr 24, 2024

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,642 350 Updated Apr 22, 2024

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Python 46 6 Updated Oct 29, 2022

Pytorch implementation of the CREPE pitch tracker

Python 516 79 Updated May 16, 2025