Skip to content
View enjoyteach's full-sized avatar
🌴
假期中
🌴
假期中

Block or report enjoyteach

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
147 stars written in Python
Clear filter

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,566 2,730 Updated Jun 22, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,339 1,947 Updated Oct 20, 2025

Generate 3D objects conditioned on text or images

Python 12,127 1,048 Updated Jun 22, 2024

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

Python 12,118 1,676 Updated Nov 8, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,062 1,138 Updated Jul 13, 2024
Python 11,584 1,521 Updated Nov 1, 2025

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,872 1,107 Updated Aug 29, 2025

A PyTorch-based Speech Toolkit

Python 10,748 1,598 Updated Nov 7, 2025

🎮 Chinese DOS games collections.

Python 9,865 1,201 Updated Aug 7, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9,323 889 Updated Aug 28, 2025

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Python 8,809 962 Updated Aug 29, 2025

FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU needed. The user can ask a question and the system will make a mu…

Python 8,726 913 Updated Apr 18, 2024

CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)

Python 8,676 667 Updated Aug 13, 2024

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,432 1,050 Updated Jun 26, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,367 734 Updated Aug 13, 2024

Simultaneous speech-to-text model

Python 8,318 781 Updated Nov 8, 2025

Fast Segment Anything

Python 8,137 745 Updated Jul 30, 2024

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 8,015 831 Updated Aug 21, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,726 1,382 Updated Dec 6, 2023

CodeGeeX2: A More Powerful Multilingual Code Generation Model

Python 7,610 536 Updated Jul 10, 2024

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,148 1,057 Updated Aug 5, 2024

Multilingual Voice Understanding Model

Python 6,897 641 Updated Aug 15, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,690 445 Updated May 29, 2024

支持更多游戏规则,让SSTap成为真正的“网游加速器”

Python 6,468 1,199 Updated Nov 7, 2025

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 6,165 1,222 Updated Aug 4, 2025

yolo3+ocr

Python 6,106 1,723 Updated Aug 29, 2022

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,044 631 Updated Aug 10, 2024

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,753 451 Updated Sep 26, 2024

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,671 651 Updated Jun 4, 2025