LindgeW

Follow

🎯

Focusing

Lam Chi LindgeW

🎯

Focusing

Follow

Research Interests: audio-visual speech recognition, lip-reading, NLP, deep learning

32 followers · 75 following

UESTC PhD, TJU Master's

Achievements

Achievements

Lists (6)

Sort

AVSE

AVSR

29 repositories

Lip2Speech/Speech2Lip

PaperReading

Super Star

Mark some fundamental multimodal repos

14 repositories

VAE

Starred repositories

596 stars written in Python

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,542 1,987 Updated Nov 3, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,358 1,354 Updated Oct 1, 2025

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,333 2,548 Updated Jun 26, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 12,898 1,193 Updated Nov 4, 2025

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,564 2,729 Updated Jun 22, 2025

allenai / allennlp

An open-source NLP research library, built on PyTorch.

Python 11,882 2,242 Updated Nov 22, 2022

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,929 949 Updated Nov 7, 2025

openai / DALL-E

PyTorch package for the discrete VAE used for DALL·E.

Python 10,875 1,905 Updated Jan 31, 2024

kkroening / ffmpeg-python

Python bindings for FFmpeg - with complex filtering support

Python 10,837 929 Updated Aug 4, 2024

kornia / kornia

🐍 Geometric Computer Vision Library for Spatial AI

Python 10,828 1,065 Updated Nov 6, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,740 1,597 Updated Nov 6, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,682 1,139 Updated Apr 9, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,568 2,343 Updated Nov 5, 2025

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,888 2,439 Updated Oct 28, 2025

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,852 827 Updated Oct 3, 2025

lucidrains / imagen-pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,386 791 Updated Oct 7, 2024

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 8,283 1,908 Updated Sep 6, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,965 789 Updated Feb 11, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,726 1,382 Updated Dec 6, 2023

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,596 1,699 Updated Apr 25, 2024

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 7,419 1,176 Updated Mar 21, 2025

1adrianb / face-alignment

🔥 2D and 3D Face alignment library build using pytorch

Python 7,414 1,380 Updated Aug 30, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 6,889 642 Updated Aug 15, 2025

facebookresearch / ConvNeXt

Code release for ConvNeXt model

Python 6,179 725 Updated Jan 8, 2023

lucidrains / DALLE-pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

Python 5,629 646 Updated Feb 17, 2024

mozillazg / python-pinyin

汉字转拼音(pypinyin)

Python 5,192 624 Updated Oct 6, 2025

timesler / facenet-pytorch

Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models

Python 5,033 1,000 Updated Sep 16, 2025

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,889 1,165 Updated Nov 3, 2025

hojonathanho / diffusion

Denoising Diffusion Probabilistic Models

Python 4,805 446 Updated Aug 29, 2023

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,801 1,317 Updated Aug 14, 2024

Starred topics

vector-quantization

speaker-embedding

language-modelling

beam-search

seq2seq

Machine learning

variational-inference

information-bottleneck

listen-attend-and-spell

chinese-speech-recognition

See all starred topics