Skip to content
View ftshijt's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Carnegie Mellon University
  • Pittsburgh, U.S.A.

Organizations

@SJTMusicTeam

Block or report ftshijt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 28 2 Updated Nov 4, 2025

A project that extracts Honkai: Star Rail text corpus

Python 33 2 Updated Jul 12, 2024

Extracting character conversations in Genshin Project

Python 70 9 Updated Feb 6, 2025

原神多语言文本搜索工具,可按关键字搜索所有文本、语音,可用于外语学习,剧情考据,模型训练等用途

Python 35 2 Updated Sep 3, 2024
Python 41 3 Updated Dec 4, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 309 27 Updated Mar 28, 2025

Train transformer language models with reinforcement learning.

Python 16,684 2,365 Updated Dec 17, 2025

Align Anything: Training All-modality Model with Feedback

Python 4,605 507 Updated Nov 27, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,387 319 Updated Jun 21, 2025

Official Repository for "SingFake: Singing Voice Deepfake Detection"

JavaScript 63 8 Updated Feb 26, 2024

Your faithful, impartial partner for audio evaluation — know yourself and your rivals.真实评测,知己知彼。

Python 182 9 Updated Dec 4, 2025
JavaScript 1 5 Updated Aug 6, 2025

Vox-Profile Benchmark

Python 58 10 Updated Sep 12, 2025

Open-source framework for the research and development of foundation models.

HTML 664 67 Updated Dec 17, 2025

An example starter repo for Python projects

Python 305 50 Updated Jun 16, 2025

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Python 216 12 Updated Feb 28, 2025

A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating generative audio.

Python 92 8 Updated Jun 12, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 649 48 Updated Jun 5, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,310 3,237 Updated Dec 17, 2025

Code reporsitory for the INTERSPEECH 2024 paper - IndicMOS: Multilingual MOS Prediction for 7 Indian languages

Python 5 Updated Apr 20, 2025

Python implementation of the SRMR toolbox

Python 124 43 Updated Jun 17, 2024

A fundamental toolkit designed for music, song, and audio generation

Python 1,260 129 Updated May 20, 2025

A simple library for Fréchet Audio Distance (FAD) calculation

Python 240 24 Updated Aug 22, 2025

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 270 23 Updated Dec 6, 2025

UTokyo-SaruLab MOS Prediction System

Python 272 28 Updated Dec 10, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,190 74 Updated Aug 13, 2025

Versatile Evaluation of Speech and Audio

Python 365 46 Updated Dec 9, 2025

Speech Human Evaluation Estimation Toolkit (SHEET)

Python 127 10 Updated Oct 2, 2025

Google Drive CLI Client

Rust 1,962 127 Updated Aug 3, 2024
Python 26 2 Updated Nov 29, 2025
Next