Skip to content
View karim23657's full-sized avatar

Block or report karim23657

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Soprano: Instant, Ultra-Realistic Text-to-Speech

Python 1,218 108 Updated Jan 15, 2026

SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀

Python 20 1 Updated May 20, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 20,509 2,351 Updated Mar 16, 2026

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of…

Python 60,452 7,711 Updated Apr 11, 2026

Helloworld for agentic frameworks, minimial but runnable! LangGraph, Agno, AutoGen, Smolagents, OpenAI Agents, etc.

Python 59 13 Updated Aug 8, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 223 17 Updated Feb 19, 2026

Run Orpheus 3B Locally With LM Studio

Python 528 115 Updated Mar 20, 2025

Towards Human-Sounding Speech

Python 6,074 518 Updated Dec 5, 2025

NeMo text processing for ASR and TTS

Python 451 155 Updated Apr 8, 2026

Automatically create a crew and tasks for CrewAI

Python 190 35 Updated Mar 5, 2024
Python 1 Updated Dec 8, 2024

Phonetisaurus G2P

Shell 516 129 Updated Jun 1, 2024

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Python 1,235 192 Updated Dec 7, 2025
TypeScript 27 9 Updated Aug 10, 2024
Jupyter Notebook 12,424 917 Updated Oct 25, 2025

Repository for research project about watermarkng audio

Python 6 1 Updated Dec 30, 2024

A lightweight end-to-end text-to-speech model

Python 126 18 Updated Feb 23, 2025

Download YouTube video (or supply your own) and generate dual languange subtitles with OpenAI Whisper and translation API (GPT) 下载 YouTube 视频(或提供您自己的视频)并使用 Whisper 和翻译API (GPT) 生成双语字幕

Jupyter Notebook 116 21 Updated Jun 4, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 5,017 735 Updated Jan 21, 2025

[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file

Python 4,193 1,644 Updated Mar 22, 2024

Learn Python with Colaboratory (colab.research.google.com)

Jupyter Notebook 4 2 Updated Oct 23, 2025

ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

Jupyter Notebook 49 5 Updated Jul 12, 2025

Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant,…

Python 545 39 Updated Apr 21, 2025

A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!

Python 323 30 Updated Nov 21, 2024

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

Python 209 24 Updated Apr 30, 2023

Simple, fast unsupervised word aligner

C++ 770 163 Updated Jul 19, 2022

A neural word aligner based on multilingual BERT

Python 375 61 Updated Mar 10, 2022

A Telegram Bot that automatically reacts to posts in Telegram Channels, groups, and private messages, developed as a server-less application.✨

JavaScript 112 203 Updated Feb 2, 2026

Fine-Tuning your VITS model using a pre-trained model

Python 551 86 Updated May 2, 2023
Next