Starred repositories
Easily train a good VC model with voice data <= 10 mins!
A TTS model capable of generating ultra-realistic dialogue in one pass.
Lets make video diffusion practical!
litagin02 / Style-Bert-VITS2
Forked from fishaudio/Bert-VITS2Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
AivisSpeech: AI Voice Imitation System - Text to Speech Software
A list of Free Software network services and web applications which can be hosted on your own servers
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A feature-rich command-line audio/video downloader
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
kaldi-asr/kaldi is the official location of the Kaldi project.
Robust Speech Recognition via Large-Scale Weak Supervision
A popular & widely deployed Open Source Container Native Storage platform for Stateful Persistent Applications on Kubernetes.
An open-source, low-code machine learning library in Python
Easily check your clusters for use of deprecated APIs
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
Machine Learning Pipelines for Kubeflow
A repository to host extended examples and tutorials
Python packaging and dependency management made easy