Skip to content
View ftshijt's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@SJTMusicTeam

Block or report ftshijt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails

Python 14,853 1,551 Updated Apr 9, 2026

Warcraft III Peon voice notifications (+ more!) for Claude Code, Codex, IDEs, and any AI agent. Stop babysitting your terminal. Employ a Peon today.

Shell 4,396 307 Updated Apr 8, 2026
Python 1 Updated Feb 23, 2026
Python 29 2 Updated Nov 4, 2025

A project that extracts Honkai: Star Rail text corpus

Python 37 2 Updated Jul 12, 2024

Extracting character conversations in Genshin Project

Python 75 8 Updated Feb 6, 2025

原神多语言文本搜索工具,可按关键字搜索所有文本、语音,可用于外语学习,剧情考据,模型训练等用途

Python 46 4 Updated Sep 3, 2024

Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.

Python 43 3 Updated Jan 15, 2026

🤗 R1-AQA Model: mispeech/r1-aqa

Python 321 29 Updated Mar 28, 2025

Train transformer language models with reinforcement learning.

Python 17,995 2,633 Updated Apr 10, 2026

Align Anything: Training All-modality Model with Feedback

Python 4,646 507 Updated Nov 27, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,558 343 Updated Jun 21, 2025

Official Repository for "SingFake: Singing Voice Deepfake Detection"

JavaScript 63 8 Updated Feb 26, 2024

Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。

Python 286 21 Updated Apr 8, 2026
JavaScript 1 5 Updated Aug 6, 2025

Vox-Profile Benchmark

Python 75 12 Updated Feb 16, 2026

Open-source framework for the research and development of foundation models.

Python 843 103 Updated Apr 10, 2026

An example starter repo for Python projects

Python 311 58 Updated Jun 16, 2025

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction

Python 220 13 Updated Feb 28, 2025

A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating generative audio.

Python 97 9 Updated Jun 12, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 706 51 Updated Jun 5, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,068 3,402 Updated Apr 10, 2026

Code reporsitory for the INTERSPEECH 2024 paper - IndicMOS: Multilingual MOS Prediction for 7 Indian languages

Python 7 Updated Apr 20, 2025

Python implementation of the SRMR toolbox

Python 129 45 Updated Jun 17, 2024

A fundamental toolkit designed for music, song, and audio generation

Python 1,332 136 Updated May 20, 2025

A simple library for Fréchet Audio Distance (FAD) calculation

Python 255 24 Updated Aug 22, 2025

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 288 23 Updated Mar 17, 2026

UTokyo-SaruLab MOS Prediction System

Python 308 30 Updated Apr 2, 2026

Awesome speech/audio LLMs, representation learning, and codec models

1,219 73 Updated Apr 4, 2026

Versatile Evaluation of Speech and Audio

Python 398 45 Updated Dec 9, 2025
Next