#
🎯
Focusing
Research Interests: audio-visual speech recognition, lip-reading, NLP, deep learning
-
UESTC PhD, TJU Master's
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
5
stars
written in C
Clear filter
Python interface to the WebRTC Voice Activity Detector
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
daanzu / py-webrtcvad-wheels
Forked from wiseman/py-webrtcvadPython interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]