Skip to content
View SpenserCai's full-sized avatar

Block or report SpenserCai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI Voice

13 repositories

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,993 5,863 Updated Aug 16, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,691 3,978 Updated Apr 19, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 53,480 5,854 Updated Dec 25, 2025

a comfyui custom node for CosyVoice

Python 281 38 Updated Sep 10, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,298 2,039 Updated Dec 23, 2025

SOTA Open Source TTS

Python 24,409 2,007 Updated Dec 1, 2025

SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.

Python 208 22 Updated Feb 5, 2024

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Python 401 34 Updated Sep 11, 2023
Python 337 24 Updated Mar 17, 2025

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,631 1,122 Updated Sep 14, 2024

ComfyUI nodes for LivePortrait

Python 2,109 165 Updated Aug 5, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,155 1,466 Updated Dec 24, 2025