Docker for multiple TTS Engines with a GRadio interface
-
Updated
Aug 29, 2024 - Jupyter Notebook
Docker for multiple TTS Engines with a GRadio interface
This case study uses Multimodal Generative AI (text, image, audio, video) to create a complete, professional digital marketing campaign for the small bakery, demonstrating a cost-effective content creation process.
Voice-controlled robotic assistant with natural language processing, command validation, and speech synthesis. Built with a microservices architecture.
Fine-tuned Parler-TTS (600M) for Hinglish language, Indian accent, and emotion-conditioned speech synthesis. Published at arXiv:2506.16310.
Generate podcast-style audio locally with multiple free TTS engines. Supports edge-tts, Bark and Parler-TTS. Inspired by Google NotebookLM.
Open source multilingual voice OS — 22 Indian languages via AI4Bharat (IndicTrans2, IndicConformer, Parler-TTS, IndicXlit) + faster-whisper. Jarvis-style desktop assistant.
This repository serves as the official open-source evaluation hub for a premium, high-fidelity Conversational Female Monologue Dataset. This data addresses the critical shortage of natural human velocity, spontaneous breath placement, and unscripted vocal cadence in traditional training corpora.
Add a description, image, and links to the parler-tts topic page so that developers can more easily learn about it.
To associate your repository with the parler-tts topic, visit your repo's landing page and select "manage topics."