Lists (1)
Sort Name ascending (A-Z)
Stars
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
High-Quality Voice Cloning TTS for 600+ Languages
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
Training code and dataset cleasing with Sidon
anon-uscf / uscf
Forked from kamperh/linearvcUniversal Speech Content Factorization
🌋LavaSR: Fast Speech restoration and enhancement
A state-of-the-art, open-source deepfake detection system built with PyTorch and EfficientNet-B0, featuring a user-friendly web interface for real-time image and video analysis.
A list of tools, papers and code related to Fake Audio Detection.
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
A Neovim plugin that provides VSCode-style diff rendering with two-tier highlighting (line + character level) in side-by-side and inline layouts, using VSCode's algorithm implemented in C.
Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"
Multilingual Voice Understanding Model
zero-shot voice conversion & singing voice conversion, with real-time support
[ICASSP'23] Online speaker clustering
A Conversational Speech Generation Model
A TTS model capable of generating ultra-realistic dialogue in one pass.
This repository contains the code and experiments for the paper "Exploring Flan-T5 for Post-ASR Error Correction".
View HTTP/HTTPS requests made by any Linux program
A simple reader/parser for Matrix Market (.mtx) files to represent sparse matrix in text format.
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
Avoids race condition when acquiring GPUs in exclusive mode
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications