Ph.D. Student at TTIC | Speech & NLP | NTU Speech Lab.
- Chicago, IL
- https://ming024.github.io/
Highlights
- Pro
-
-
-
NeMo Public
Forked from NVIDIA-NeMo/NeMoA scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python Apache License 2.0 UpdatedSep 7, 2024 -
dynamic-superb Public
Forked from dynamic-superb/dynamic-superbThe official repository of Dynamic-SUPERB.
Python UpdatedAug 20, 2024 -
SpeechLLM_Survey Public
Codebase for benchmarking several open-sourced SpeechLLM models
-
layerwise-analysis Public
Forked from ankitapasad/layerwise-analysisLayer-wise analysis of self-supervised pre-trained speech representations
Python UpdatedMar 6, 2024 -
-
FastSpeech2 Public
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
-
-
SpeechLM_finetuning Public
Forked from microsoft/SpeechT5Final project for TTIC 31120 2023 Spring
-
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedApr 12, 2023 -
-
-
-
-
-