Skip to content
View crlotwhite's full-sized avatar

Organizations

@Coda-SVS

Block or report crlotwhite

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
96 stars written in Python
Clear filter

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,319 9,411 Updated Dec 15, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,811 3,406 Updated Feb 7, 2026

Open-Sora: Democratizing Efficient Video Production for All

Python 28,500 2,885 Updated Apr 30, 2025

SOTA Open Source TTS

Python 24,838 2,066 Updated Feb 2, 2026

Open-Source Frontier Voice AI

Python 22,980 2,503 Updated Feb 7, 2026

SoTA open-source TTS

Python 22,419 2,933 Updated Feb 3, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,743 2,033 Updated Jan 13, 2026

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,089 1,664 Updated Nov 19, 2025

Lets make video diffusion practical!

Python 16,607 1,640 Updated Oct 16, 2025

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,451 1,266 Updated Aug 4, 2025

End-to-End Speech Processing Toolkit

Python 9,720 2,378 Updated Feb 5, 2026

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,685 794 Updated May 27, 2025

State-of-the-art TTS model under 25MB 😻

Python 9,595 499 Updated Feb 2, 2026

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,816 1,389 Updated Dec 6, 2023

Sharp Monocular View Synthesis in Less Than a Second

Python 7,475 518 Updated Dec 19, 2025
Python 6,070 470 Updated Aug 29, 2025

Event-driven networking engine written in Python.

Python 5,939 1,207 Updated Jan 19, 2026

Fully automatic censorship removal for language models

Python 4,707 452 Updated Feb 2, 2026

The most powerful local music generation model that outperforms most commercial alternatives

Python 4,543 467 Updated Feb 7, 2026

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,404 501 Updated Feb 6, 2026

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,993 807 Updated Jul 5, 2024

Noise supression using deep filtering

Python 3,817 400 Updated Oct 17, 2024

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,295 266 Updated Sep 6, 2023

A Flow-based Generative Network for Speech Synthesis

Python 2,335 536 Updated Oct 19, 2023

A sketch extractor for anime/illustration.

Python 2,086 173 Updated Aug 16, 2023

YouTube Full Text Search - Search all of YouTube from the command line

Python 1,788 97 Updated Jan 22, 2026
Python 1,776 78 Updated Dec 16, 2025

Kakao Hangul Analyzer III

Python 1,447 302 Updated Sep 1, 2025

The official implementation of HierSpeech++

Python 1,241 151 Updated Feb 20, 2024

SincNet is a neural architecture for efficiently processing raw audio samples.

Python 1,228 271 Updated Apr 28, 2021
Next