Skip to content
View AlvinZheng's full-sized avatar

Block or report AlvinZheng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
48 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,275 32,575 Updated Mar 22, 2026

The agent engineering platform

Python 130,681 21,526 Updated Mar 23, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,460 11,914 Updated Dec 15, 2025

🏡 Open source home automation that puts local control and privacy first.

Python 85,729 37,060 Updated Mar 23, 2026

Models and examples built with TensorFlow

Python 77,693 45,217 Updated Mar 17, 2026

scikit-learn: machine learning in Python

Python 65,511 26,836 Updated Mar 22, 2026

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Python 57,073 17,441 Updated Mar 18, 2026

The world's simplest facial recognition api for Python and the command line

Python 56,224 13,708 Updated Aug 21, 2024

Deepfakes Software For All

Python 55,060 13,414 Updated Mar 21, 2026

Ultralytics YOLO 🚀

Python 54,832 10,534 Updated Mar 23, 2026

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 44,897 6,008 Updated Aug 16, 2024

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 42,263 5,094 Updated Feb 6, 2026

结巴中文分词

Python 34,815 6,714 Updated Aug 21, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,129 6,865 Updated Mar 23, 2026

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Python 25,538 11,696 Updated Jun 7, 2024

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,169 1,875 Updated Mar 7, 2026

Python scraper based on AI

Python 23,095 2,021 Updated Mar 18, 2026

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 22,583 4,045 Updated Mar 23, 2026

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 15,782 2,216 Updated Jul 24, 2024

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 14,305 3,841 Updated Feb 18, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,891 2,795 Updated Jun 22, 2025

StyleGAN2 - Official TensorFlow Implementation

Python 11,182 2,514 Updated May 18, 2024

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,912 1,094 Updated Aug 29, 2025

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,514 698 Updated Feb 11, 2026

Stable Diffusion built-in to Blender

Python 8,130 439 Updated Aug 26, 2024

BoxMOT: Pluggable SOTA multi-object tracking modules with support for axis-aligned and oriented bounding boxes

Python 8,068 1,893 Updated Mar 19, 2026

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,230 1,068 Updated Aug 5, 2024

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,898 743 Updated Feb 4, 2026

NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Python 6,177 1,098 Updated Aug 8, 2024

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 4,436 515 Updated Feb 18, 2026
Next