-
UESTC PhD, TJU Master's
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
π Geometric Computer Vision Library for Spatial AI
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
π Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Wav2Lip version 288 and pipeline to train
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Implementation of ViViT: A Video Vision Transformer
π spafe: Simplified Python Audio Features Extraction
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Pytorch implementation of deep audio embedding calculation
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
Audio-Visual Speech Recognition using Sequence to Sequence Models
PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)