Skip to content
View webliupeng's full-sized avatar

Block or report webliupeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
14 stars written in Python
Clear filter

The world's simplest facial recognition api for Python and the command line

Python 56,112 13,715 Updated Aug 21, 2024

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,880 5,251 Updated Jan 7, 2026

We write your reusable computer vision tools. 💜

Python 36,478 3,097 Updated Feb 11, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,440 2,729 Updated Aug 12, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 20,086 2,140 Updated Feb 10, 2026

微信机器人 / 可能是最优雅的微信个人号 API ✨✨

Python 14,294 2,392 Updated Jul 14, 2019

Generate 3D objects conditioned on text or images

Python 12,205 1,062 Updated Jun 22, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,713 995 Updated Aug 12, 2024
Python 7,846 528 Updated Apr 14, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,724 449 Updated May 29, 2024

中文古诗自动作诗机器人,x炸天,基于tensorflow1.10 api,正在积极维护升级中,快star,保持更新!

Python 3,648 932 Updated Apr 7, 2024

Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced Reranking and Noise-injected Training.

Python 91 5 Updated Nov 15, 2024