Skip to content
View eblessings's full-sized avatar

Block or report eblessings

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

TTS

27 repositories

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 4 1 Updated Jul 27, 2025

This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.

Python 16 1 Updated Dec 8, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9,627 915 Updated Dec 12, 2025

podcastfy.ai gradio demo app

Python 334 33 Updated Nov 30, 2024

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

Python 5,763 675 Updated Dec 9, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,094 644 Updated Aug 10, 2024

Open-source event-driven AI powered Softphone

JavaScript 148 21 Updated Jul 15, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,946 5,857 Updated Aug 16, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,194 832 Updated Nov 20, 2025
JavaScript 23 12 Updated Oct 30, 2025
Shell 1 Updated May 9, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,258 485 Updated Apr 15, 2025

AI-powered voice calling agent using OpenAI's Realtime API and Twilio

Python 36 19 Updated Dec 26, 2024

Run LLMs with MLX

Python 3,091 332 Updated Dec 19, 2025

A Full-Duplex Open-Domain Dialogue Agent with Continuous Turn-Taking Behavior

Python 36 Updated Oct 27, 2023

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,844 4,674 Updated Aug 19, 2024

Turn detection for full-duplex dialogue communication

Python 493 31 Updated Nov 11, 2025

True full-duplex acoustic echo cancellation.

C++ 26 12 Updated Oct 8, 2021

This repository is a real-time Voice-to-Voice AI Assistant that enables full duplex conversations using speech and large language models. Built with Python, Streamlit, SpeechRecognition, GTTS, and …

Python 2 Updated Apr 20, 2025

SOTA Open Source TTS

Python 24,369 2,002 Updated Dec 1, 2025

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,418 132 Updated Apr 24, 2024

An opensource music processing toolkit

Python 320 44 Updated Jun 25, 2023

Inworld TTS

Python 583 55 Updated Sep 19, 2025

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 3,003 322 Updated Dec 20, 2025

On-device TTS model by Neuphonic

Python 4,275 450 Updated Dec 15, 2025

A Conversational Speech Generation Model

Python 14,368 1,458 Updated May 27, 2025

Optimized Whisper models for streaming and on-device use

Python 768 54 Updated Dec 17, 2025