LindgeW

Follow

🎯

Focusing

Lam Chi LindgeW

🎯

Focusing

Follow

Research Interests: audio-visual speech recognition, lip-reading, NLP, deep learning

32 followers · 75 following

UESTC PhD, TJU Master's

Achievements

Achievements

Lists (6)

Sort

AVSE

AVSR

29 repositories

Lip2Speech/Speech2Lip

PaperReading

Super Star

Mark some fundamental multimodal repos

14 repositories

VAE

Starred repositories

747 results for source starred repositories

practical-tutorials / project-based-learning

Curated list of project-based tutorials

249,102 32,581 Updated Aug 15, 2024

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 179,530 46,105 Updated Nov 6, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,187 31,066 Updated Nov 6, 2025

krahets / hello-algo

《Hello 算法》：动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新，English version in translation

Java 118,260 14,521 Updated Oct 30, 2025

excalidraw / excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 109,799 11,429 Updated Nov 6, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,462 11,328 Updated Sep 8, 2025

fighting41love / funNLP

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 77,039 15,054 Updated May 10, 2024

CompVis / stable-diffusion

A latent text-to-image diffusion model

Jupyter Notebook 71,762 10,516 Updated Jun 18, 2024

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,155 6,506 Updated Sep 19, 2025

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 58,779 9,373 Updated Sep 23, 2025

ageitgey / face_recognition

The world's simplest facial recognition api for Python and the command line

Python 55,703 13,700 Updated Aug 21, 2024

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,394 6,132 Updated Sep 18, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,060 8,216 Updated Dec 9, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,330 5,740 Updated Aug 16, 2024

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,625 4,613 Updated Nov 6, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,429 3,116 Updated Nov 7, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,110 4,133 Updated Jul 6, 2025

exacity / deeplearningbook-chinese

Deep Learning Book Chinese Translation

TeX 36,865 9,169 Updated Dec 3, 2019

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,921 6,623 Updated Sep 30, 2025

google-ai-edge / mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

C++ 31,845 5,600 Updated Nov 6, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,510 6,476 Updated Nov 6, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 31,444 3,821 Updated Jul 23, 2024

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 26,961 5,814 Updated Sep 27, 2025

facefusion / facefusion

Industry leading face manipulation platform

Python 25,717 4,104 Updated Nov 5, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,271 1,763 Updated Oct 13, 2025

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,361 3,427 Updated Oct 28, 2025

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,346 5,816 Updated Aug 14, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,908 2,659 Updated Aug 12, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,816 2,665 Updated Jul 3, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

21,638 2,052 Updated May 19, 2025

Starred topics

vector-quantization

speaker-embedding

language-modelling

beam-search

seq2seq

Machine learning

variational-inference

information-bottleneck

listen-attend-and-spell

chinese-speech-recognition

See all starred topics