Skip to content
View ftshijt's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Carnegie Mellon University
  • Pittsburgh, U.S.A.

Organizations

@SJTMusicTeam

Block or report ftshijt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 16,694 2,366 Updated Dec 18, 2025

Open-source framework for the research and development of foundation models.

HTML 667 67 Updated Dec 18, 2025

UTokyo-SaruLab MOS Prediction System

Python 272 28 Updated Dec 18, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,319 3,238 Updated Dec 18, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,740 12,615 Updated Dec 17, 2025

End-to-End Speech Processing Toolkit

Python 9,646 2,363 Updated Dec 16, 2025

FAIR Sequence Modeling Toolkit 2

Python 1,099 132 Updated Dec 16, 2025

A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.

Python 643 59 Updated Dec 15, 2025

A PyTorch-based Speech Toolkit

Python 10,946 1,615 Updated Dec 15, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,473 216 Updated Dec 15, 2025

The uncompromising Python code formatter

Python 41,227 2,690 Updated Dec 12, 2025

Versatile Evaluation of Speech and Audio

Python 365 46 Updated Dec 9, 2025

Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.

Python 270 23 Updated Dec 6, 2025
Python 41 3 Updated Dec 4, 2025

Your faithful, impartial partner for audio evaluation — know yourself and your rivals.真实评测,知己知彼。

Python 183 9 Updated Dec 4, 2025
Python 26 2 Updated Nov 29, 2025

Align Anything: Training All-modality Model with Feedback

Python 4,605 507 Updated Nov 27, 2025

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,295 232 Updated Nov 19, 2025
Python 28 2 Updated Nov 4, 2025

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementa…

Python 117 4 Updated Oct 24, 2025

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 939 100 Updated Oct 24, 2025

State-of-the-art pretrained music models for training, evaluation, inference

Python 147 14 Updated Oct 9, 2025

Speech Human Evaluation Estimation Toolkit (SHEET)

Python 127 10 Updated Oct 2, 2025

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,269 5,368 Updated Sep 22, 2025

Vox-Profile Benchmark

Python 58 10 Updated Sep 12, 2025

Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository

HTML 7 12 Updated Aug 26, 2025

A simple library for Fréchet Audio Distance (FAD) calculation

Python 240 24 Updated Aug 22, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,190 74 Updated Aug 13, 2025

Google Drive Public File Downloader when Curl/Wget Fails

Python 5,005 400 Updated Aug 12, 2025
Next