GUI for a Vocal Remover that uses Deep Neural Networks
Comprehensive Gradio WebUI for audio processing
1 min voice data can also be used to train a good TTS model
Repo of Qwen2-Audio chat & pretrained large audio language model
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Dia-1.6B generates lifelike English dialogue and vocal expressions