#
🎯
Focusing
Research Interests: audio-visual speech recognition, lip-reading, NLP, deep learning
-
UESTC PhD, TJU Master's
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
1
result
for source starred repositories
written in PHP
Clear filter
A full-stack web application that integrates AV-HuBERT, a state-of-the-art audio-visual speech recognition model developed by Meta, to enable lip-reading and transcription from uploaded videos.