Skip to content
View LindgeW's full-sized avatar
🎯
Focusing
🎯
Focusing
  • UESTC PhD, TJU Master's

Block or report LindgeW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

22 results for sponsorable starred repositories
Clear filter

Python interface to the WebRTC Voice Activity Detector (VAD) [released with binary wheels!]

C 33 2 Updated Oct 27, 2025

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Python 633 194 Updated May 27, 2023

PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)

Python 25 1 Updated Mar 5, 2021

A repository that will hold my experiments with various variational models

Jupyter Notebook 14 2 Updated Aug 28, 2024

Wav2Lip version 288 and pipeline to train

Python 637 160 Updated Aug 13, 2025

View model summaries in PyTorch!

Python 2,876 131 Updated Nov 3, 2025

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,826 1,065 Updated Nov 5, 2025

Implementation of ViViT: A Video Vision Transformer

Python 554 68 Updated Jun 21, 2021

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,590 1,968 Updated Oct 21, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 20,966 2,848 Updated Oct 21, 2025

Pytorch implementation of deep audio embedding calculation

Python 104 10 Updated Jul 23, 2023

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python 264 47 Updated Jul 15, 2025

🔉 spafe: Simplified Python Audio Features Extraction

Python 476 79 Updated Mar 20, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,182 333 Updated Sep 10, 2025

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python 114 33 Updated May 22, 2019

Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained

Jupyter Notebook 4,551 1,092 Updated Jul 1, 2021

Audio-Visual Speech Recognition using Sequence to Sequence Models

Python 83 28 Updated Jul 10, 2020

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 1,082 186 Updated Dec 22, 2023

🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures

Python 1,893 331 Updated Nov 7, 2022

《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation

Java 118,238 14,517 Updated Oct 30, 2025

🔥Highlighting the top ML papers every week.

12,051 741 Updated Jul 20, 2025

A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.

Python 93 22 Updated Jul 23, 2025