AI-Powered Podcast Generator: A Python-based tool that converts text scripts into realistic audio podcasts using Google's Generative AI API. This project leverages advanced text-to-speech technolog…

Python 50 7 Updated Dec 16, 2024

M-Taghizadeh / Persian_Question_Answering_Voice2Voice_AI

This repository hosts BonyadAI, a Persian question answering AI Model. We developed an initial web crawler and scraper to gather the dataset. The second phase involved building a machine learning m…

Jupyter Notebook 11 5 Updated Jul 7, 2024

Aefyr / SAI

Android split APKs installer

Java 3,189 287 Updated Jun 3, 2024

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,901 3,914 Updated Nov 5, 2025

exaloop / codon

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 15,992 557 Updated Nov 1, 2025

ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 1,184 98 Updated Jun 29, 2025

kyutai-labs / hibiki

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1,313 103 Updated Apr 15, 2025

bryan-brancotte / subtitle_to_speech

convert subtitle (.srt) to speech (.wav) using google API

Python 44 23 Updated Jan 5, 2022

ripienaar / free-for-dev

A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev

HTML 114,607 11,725 Updated Nov 3, 2025

SadeghKrmi / pertts-streamlit

Persian text-to-speech streamlit interface

Python 42 8 Updated Dec 9, 2024

MohammadJavadArdestani / NLP-persian-poet-identification

NLP-persian-poet-identification

Python 4 Updated Sep 8, 2022

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 32,769 4,608 Updated Nov 24, 2024

mohbadar / pashto-text-dataset

Text Dataset for Pashto Language

Jupyter Notebook 13 4 Updated Mar 3, 2019

marytts / marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Java 2,549 769 Updated Jan 17, 2025

katanemo / archgw

The data plane for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & hand off, guardrails, zero-code logs and traces, unified access to LLMs from Op…

Rust 4,284 238 Updated Oct 31, 2025

Masoud Azizi mablue

Lists (1)

Proxy

Starred repositories

motion-detection

cloudflare-workers

Game engine

LaTeX