Skip to content
View ftshijt's full-sized avatar
🏠
Working from home
🏠
Working from home
  • Carnegie Mellon University
  • Pittsburgh, U.S.A.

Organizations

@SJTMusicTeam

Block or report ftshijt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,909 5,853 Updated Aug 16, 2024

The uncompromising Python code formatter

Python 41,229 2,690 Updated Dec 12, 2025

The official Meta Llama 3 GitHub site

Python 29,138 3,502 Updated Jan 26, 2025

Train transformer language models with reinforcement learning.

Python 16,692 2,366 Updated Dec 18, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,315 3,237 Updated Dec 18, 2025

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,269 5,368 Updated Sep 22, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,739 12,616 Updated Dec 17, 2025

pathogen.vim: manage your runtimepath

Vim Script 12,144 1,155 Updated Aug 24, 2022

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,726 1,168 Updated Nov 14, 2024

A PyTorch-based Speech Toolkit

Python 10,947 1,616 Updated Dec 15, 2025

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,214 863 Updated Jul 6, 2024

⚡机器学习实战(Python3):kNN、决策树、贝叶斯、逻辑回归、SVM、线性回归、树回归

Python 10,131 5,108 Updated Jul 12, 2024

End-to-End Speech Processing Toolkit

Python 9,646 2,363 Updated Dec 16, 2025

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,663 1,367 Updated Jan 20, 2024

Google Drive Public File Downloader when Curl/Wget Fails

Python 5,005 400 Updated Aug 12, 2025

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,887 496 Updated Oct 12, 2024

Align Anything: Training All-modality Model with Feedback

Python 4,605 507 Updated Nov 27, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,387 319 Updated Jun 21, 2025

Hidden Markov Models in Python, with scikit-learn like API

Python 3,329 748 Updated Oct 31, 2024

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,841 923 Updated Apr 23, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,487 520 Updated Jun 13, 2025

Google Drive CLI Client

Rust 1,962 127 Updated Aug 3, 2024

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,630 349 Updated Apr 22, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,473 216 Updated Dec 15, 2025

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,295 232 Updated Nov 19, 2025

A fundamental toolkit designed for music, song, and audio generation

Python 1,261 130 Updated May 20, 2025

Awesome speech/audio LLMs, representation learning, and codec models

1,190 74 Updated Aug 13, 2025

FAIR Sequence Modeling Toolkit 2

Python 1,099 132 Updated Dec 16, 2025

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 939 100 Updated Oct 24, 2025

Mingus is a music package for Python

Python 919 170 Updated Apr 21, 2024
Next