ftshijt

🏠

Working from home

Jiatong ftshijt

🏠

Working from home

165 followers · 45 following

Carnegie Mellon University
Pittsburgh, U.S.A.
shijt.site

Achievements

x3 x2 x3

Achievements

x3 x2 x3

Organizations

Stars

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,909 5,853 Updated Aug 16, 2024

psf / black

The uncompromising Python code formatter

Python 41,229 2,690 Updated Dec 12, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,138 3,502 Updated Jan 26, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,692 2,366 Updated Dec 18, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,315 3,237 Updated Dec 18, 2025

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,269 5,368 Updated Sep 22, 2025

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 14,739 12,616 Updated Dec 17, 2025

tpope / vim-pathogen

pathogen.vim: manage your runtimepath

Vim Script 12,144 1,155 Updated Aug 24, 2022

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,726 1,168 Updated Nov 14, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,947 1,616 Updated Dec 15, 2025

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,214 863 Updated Jul 6, 2024

Jack-Cherish / Machine-Learning

⚡机器学习实战（Python3）：kNN、决策树、贝叶斯、逻辑回归、SVM、线性回归、树回归

Python 10,131 5,108 Updated Jul 12, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,646 2,363 Updated Dec 16, 2025

bentrevett / pytorch-seq2seq

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,663 1,367 Updated Jan 20, 2024

wkentaro / gdown

Google Drive Public File Downloader when Curl/Wget Fails

Python 5,005 400 Updated Aug 12, 2025

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,887 496 Updated Oct 12, 2024

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 4,605 507 Updated Nov 27, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,387 319 Updated Jun 21, 2025

hmmlearn / hmmlearn

Hidden Markov Models in Python, with scikit-learn like API

Python 3,329 748 Updated Oct 31, 2024

PlayVoice / whisper-vits-svc

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,841 923 Updated Apr 23, 2024

s3prl / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,487 520 Updated Jun 13, 2025

glotlabs / gdrive

Google Drive CLI Client

Rust 1,962 127 Updated Aug 3, 2024

kan-bayashi / ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,630 349 Updated Apr 22, 2024

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,473 216 Updated Dec 15, 2025

k2-fsa / k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,295 232 Updated Nov 19, 2025

FunAudioLLM / FunMusic

A fundamental toolkit designed for music, song, and audio generation

Python 1,261 130 Updated May 20, 2025

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

1,190 74 Updated Aug 13, 2025

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 1,099 132 Updated Dec 16, 2025

X-LANCE / SLAM-LLM

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 939 100 Updated Oct 24, 2025

bspaans / python-mingus

Mingus is a music package for Python

Python 919 170 Updated Apr 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiatong ftshijt

Achievements

Achievements

Organizations

Block or report ftshijt

Stars

coqui-ai / TTS

psf / black

meta-llama / llama3

huggingface / trl

NVIDIA-NeMo / NeMo

kaldi-asr / kaldi

alshedivat / al-folio

tpope / vim-pathogen

facebookresearch / seamless_communication

speechbrain / speechbrain

AIGC-Audio / AudioGPT

Jack-Cherish / Machine-Learning

espnet / espnet

bentrevett / pytorch-seq2seq

wkentaro / gdown

microsoft / muzic

PKU-Alignment / align-anything

MoonshotAI / Kimi-Audio

hmmlearn / hmmlearn

PlayVoice / whisper-vits-svc

s3prl / s3prl

glotlabs / gdrive

kan-bayashi / ParallelWaveGAN

alibaba / Pai-Megatron-Patch

k2-fsa / k2

FunAudioLLM / FunMusic

ga642381 / speech-trident

facebookresearch / fairseq2

X-LANCE / SLAM-LLM

bspaans / python-mingus