Stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Train transformer language models with reinforcement learning.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
⚡机器学习实战(Python3):kNN、决策树、贝叶斯、逻辑回归、SVM、线性回归、树回归
Google Drive Public File Downloader when Curl/Wget Fails
Muzic: Music Understanding and Generation with Artificial Intelligence
Align Anything: Training All-modality Model with Feedback
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Hidden Markov Models in Python, with scikit-learn like API
Core Engine of Singing Voice Conversion & Singing Voice Clone
Self-Supervised Speech Pre-training and Representation Learning Toolkit
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
A fundamental toolkit designed for music, song, and audio generation
A Framework for Speech, Language, Audio, Music Processing with Large Language Model
Neural network-based singing voice synthesis library for research
Unified automatic quality assessment for speech, music, and sound.
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
D's Machine Learning is a machine learning toolkit for python,focus on rightness but efficiency
speech self-supervised representations
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi