Skip to content
View ftshijt's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@SJTMusicTeam

Block or report ftshijt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
74 stars written in Python
Clear filter

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi

Python 353 53 Updated Dec 25, 2020

🤗 R1-AQA Model: mispeech/r1-aqa

Python 321 29 Updated Mar 28, 2025

An opensource music processing toolkit

Python 319 45 Updated Jun 25, 2023

An example starter repo for Python projects

Python 311 58 Updated Jun 16, 2025

UTokyo-SaruLab MOS Prediction System

Python 308 30 Updated Apr 2, 2026

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 288 23 Updated Mar 17, 2026

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。

Python 286 21 Updated Apr 8, 2026

Official implementation of compound word transformer (AAAI'21)

Python 278 45 Updated Nov 27, 2023

A pure python module for reading and writing kaldi ark files

Python 268 36 Updated Mar 6, 2025

ESPnet Model Zoo

Python 259 44 Updated Jul 9, 2023

A simple library for Fréchet Audio Distance (FAD) calculation

Python 255 24 Updated Aug 22, 2025
Python 226 16 Updated Dec 29, 2022

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Python 220 13 Updated Feb 28, 2025

Reference-aware automatic speech evaluation toolkit

Python 181 15 Updated Dec 5, 2024

State-of-the-art pretrained music models for training, evaluation, inference

Python 172 18 Updated Jan 20, 2026

Onnx wrapper for espnet infrernce model

Python 169 25 Updated Aug 11, 2025

Speech Human Evaluation Estimation Toolkit (SHEET)

Python 134 9 Updated Mar 31, 2026

Python implementation of the SRMR toolbox

Python 129 45 Updated Jun 17, 2024

SimulEval: A General Evaluation Toolkit for Simultaneous Translation

Python 124 40 Updated Sep 13, 2024

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…

Python 118 5 Updated Oct 24, 2025

INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"

Python 117 12 Updated Jan 26, 2024

A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating generative audio.

Python 97 9 Updated Jun 12, 2025

A system works on singing voice synthesis

Python 79 19 Updated Jan 11, 2023

Extracting character conversations in Genshin Project

Python 75 8 Updated Feb 6, 2025

Vox-Profile Benchmark

Python 75 12 Updated Feb 16, 2026

logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source separation systems.

Python 46 1 Updated Jan 29, 2026

原神多语言文本搜索工具,可按关键字搜索所有文本、语音,可用于外语学习,剧情考据,模型训练等用途

Python 46 4 Updated Sep 3, 2024

Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.

Python 43 3 Updated Jan 15, 2026

logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.

Python 38 3 Updated Jun 24, 2025

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

Python 37 2 Updated Feb 24, 2023