queen2891

Wei-Ting Chen queen2891

MS student @ NTHU EECS. Obsessed with making speech models smaller and faster. Currently poking at codec LMs and streaming ASR.

Highlights

stream-whisper stream-whisper Public

Real-time streaming ASR built on Whisper with VAD-based chunking and WebSocket support

Python
speech-data-pipe speech-data-pipe Public

End-to-end data preparation pipeline for speech LLM training: diarize, filter, codec-tokenize, manifest

Python
codec-eval codec-eval Public

Evaluation toolkit for comparing neural speech codecs (EnCodec, DAC, SpeechTokenizer) across quality, speaker, and ASR metrics

Python
queen2891 queen2891 Public

Profile README
PSIVG PSIVG Public

Forked from MarkHershey/PSIVG

[CVPR 2026] Physical Simulator In-the-Loop Video Generation

Python
SoulNexus SoulNexus Public

Forked from LingByte/SoulNexus

TypeScript