#
🎯
Focusing
Research Interests: audio-visual speech recognition, lip-reading, NLP, deep learning
-
UESTC PhD, TJU Master's
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
6
results
for forked starred repositories
Clear filter
This is an official implementation for "Video Swin Transformers".
daanzu / py-webrtcvad-wheels
Forked from wiseman/py-webrtcvadPython interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]
YosukeHiguchi / espnet
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
edgarGracia / av_hubert
Forked from facebookresearch/av_hubertA self-supervised learning framework for audio-visual speech