-
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
-
-
-
-
-
Noresqa Public
Forked from shimhz/NoresqaThis github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.
Python Other UpdatedJul 5, 2025 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedJul 5, 2025 -
ssl-singer-identity Public
Forked from SonyCSLParis/ssl-singer-identity -
emotion2vec Public
Forked from ddlBoJack/emotion2vec[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Python UpdatedJun 5, 2025 -
vox-profile-release Public
Forked from tiantiaf0627/vox-profile-releaseVox-Profile Benchmark
Python Apache License 2.0 UpdatedMay 30, 2025 -
s3prl Public
Forked from s3prl/s3prlSelf-Supervised Speech Pre-training and Representation Learning Toolkit.
-
SongEval Public
Forked from ASLP-lab/SongEvalA song aesthetic evaluation toolkit trained on SongEval.
Python Apache License 2.0 UpdatedMay 17, 2025 -
UTMOSv2 Public
Forked from sarulab-speech/UTMOSv2UTokyo-SaruLab MOS Prediction System
-
pysepm Public
Forked from schmiph2/pysepmPython implementation of performance metrics in Loizou's Speech Enhancement book
Python GNU General Public License v3.0 UpdatedApr 23, 2025 -
DiscreteSpeechMetrics Public
Forked from Takaaki-Saeki/DiscreteSpeechMetricsReference-aware automatic speech evaluation toolkit
Python MIT License UpdatedApr 14, 2025 -
audiobox-aesthetics Public
Forked from facebookresearch/audiobox-aestheticsUnified automatic quality assessment for speech, music, and sound.
Python Creative Commons Attribution 4.0 International UpdatedFeb 28, 2025 -
ParallelWaveGAN Public
Forked from kan-bayashi/ParallelWaveGANUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
-
fadtk Public
Forked from microsoft/fadtkA simple library for Fréchet Audio Distance (FAD) calculation
Python MIT License UpdatedDec 15, 2024 -
scoreq Public
Forked from alessandroragano/scoreqSCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
-
shinjiwlab.github.io Public
Forked from wavlab-speech/shinjiwlab.github.ioJavaScript MIT License UpdatedNov 22, 2024 -
CLAP Public
Forked from microsoft/CLAPLearning audio concepts from natural language supervision
Python MIT License UpdatedOct 30, 2024 -
sheet Public
Forked from unilight/sheetSpeech Human Evaluation Estimation Toolkit (SHEET)
Python MIT License UpdatedOct 16, 2024 -
speech_evaluation Public
A toolkit dedicate for speech evaluation.
-
-
WARP-Q Public
Forked from wjassim/WARP-QThis code is to run the WARP-Q speech quality metric.
Python Apache License 2.0 UpdatedAug 27, 2024 -
fairseq-1 Public
Forked from espnet/fairseqPython code for Fairseq maintained by ESPnet
Python MIT License UpdatedAug 26, 2024 -
SpeechMOS Public
Forked from tarepan/SpeechMOSEasy-to-Use Speech MOS predictors
Python MIT License UpdatedJun 5, 2024 -
-
-
Pai-Megatron-Patch Public
Forked from alibaba/Pai-Megatron-PatchThe official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Python Apache License 2.0 UpdatedMay 16, 2024