Skip to content
View LindgeW's full-sized avatar
🎯
Focusing
🎯
Focusing
  • UESTC PhD, TJU Master's

Block or report LindgeW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

17 results for sponsorable starred repositories written in Python
Clear filter

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 20,998 2,856 Updated Oct 21, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,660 1,979 Updated Oct 21, 2025

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,840 1,068 Updated Nov 10, 2025

View model summaries in PyTorch!

Python 2,878 131 Updated Nov 3, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,183 333 Updated Sep 10, 2025

πŸ”“ Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Python 1,893 331 Updated Nov 7, 2022

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,085 186 Updated Dec 22, 2023

Wav2Lip version 288 and pipeline to train

Python 637 160 Updated Aug 13, 2025

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 633 194 Updated May 27, 2023

Implementation of ViViT: A Video Vision Transformer

Python 554 68 Updated Jun 21, 2021

πŸ”‰ spafe: Simplified Python Audio Features Extraction

Python 476 79 Updated Mar 20, 2025

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python 264 47 Updated Jul 15, 2025

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python 114 33 Updated May 22, 2019

Pytorch implementation of deep audio embedding calculation

Python 106 11 Updated Jul 23, 2023

A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

Python 93 22 Updated Jul 23, 2025

Audio-Visual Speech Recognition using Sequence to Sequence Models

Python 83 28 Updated Jul 10, 2020

PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)

Python 26 1 Updated Mar 5, 2021