LindgeW

🎯

Focusing

Lam Chi LindgeW

🎯

Focusing

Research Interests: audio-visual speech recognition, lip-reading, NLP, deep learning

32 followers · 75 following

UESTC PhD, TJU Master's

Achievements

Lists (6)

Sort

Starred repositories

596 stars written in Python

Clear filter

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,535 46,105 Updated Nov 7, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,192 31,066 Updated Nov 6, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,467 11,330 Updated Sep 8, 2025

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 77,043 15,056 Updated May 10, 2024

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,158 6,506 Updated Sep 19, 2025

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 58,779 9,373 Updated Sep 23, 2025

ageitgey / face_recognition

The world's simplest facial recognition api for Python and the command line

Python 55,703 13,700 Updated Aug 21, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,065 8,220 Updated Dec 9, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,332 5,742 Updated Aug 16, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,625 4,613 Updated Nov 7, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,430 3,116 Updated Nov 7, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,113 4,133 Updated Jul 6, 2025

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,921 6,623 Updated Sep 30, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,511 6,477 Updated Nov 7, 2025

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,963 5,813 Updated Sep 27, 2025

facefusion / facefusion

Industry leading face manipulation platform

Python 25,721 4,103 Updated Nov 5, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,271 1,763 Updated Oct 13, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,364 3,428 Updated Oct 28, 2025

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,346 5,817 Updated Aug 14, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,909 2,659 Updated Aug 12, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,816 2,665 Updated Jul 3, 2025

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 20,976 2,852 Updated Oct 21, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,657 1,640 Updated Sep 30, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 18,897 1,570 Updated Oct 31, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,614 1,975 Updated Oct 21, 2025

state-spaces / mamba

Mamba SSM architecture

Python 16,350 1,481 Updated Oct 10, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,051 3,181 Updated Nov 6, 2025

microsoft / Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Lam Chi LindgeW

Lists (6)

AVSE

AVSR

Lip2Speech/Speech2Lip

PaperReading

Super Star

VAE

Starred repositories

vector-quantization

speaker-embedding

language-modelling

beam-search

seq2seq

Machine learning

variational-inference

information-bottleneck

listen-attend-and-spell

chinese-speech-recognition