ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

Python 12,090 1,666 Updated Nov 6, 2025

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,054 1,161 Updated Mar 14, 2025

magic-research / magic-animate

[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"

Python 10,869 1,107 Updated Aug 29, 2025

jianchang512 / clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

Python 8,807 961 Updated Aug 29, 2025

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,603 1,245 Updated Nov 4, 2025

YaoFANGUK / video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,405 1,047 Updated Jun 26, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,965 789 Updated Feb 11, 2024

microsoft / UFO

The Desktop AgentOS.

Python 7,695 937 Updated Sep 5, 2025

jianchang512 / ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 7,394 905 Updated Aug 29, 2025

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,148 1,057 Updated Aug 5, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 6,885 641 Updated Aug 15, 2025

sczhou / ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Python 6,342 737 Updated Feb 19, 2025

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,035 629 Updated Aug 10, 2024

Hillobar / Rope

GUI-focused roop

Python 5,232 924 Updated May 28, 2024

modelscope / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 5,106 607 Updated Jul 11, 2025

Previous Next

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chong5u

Block or report chong5u

Stars

mlc-ai / mlc-llm

1Panel-dev / MaxKB

SYSTRAN / faster-whisper

iperov / DeepFaceLab

KwaiVGI / LivePortrait

FunAudioLLM / CosyVoice

lllyasviel / FramePack

jianchang512 / pyvideotrans

index-tts / index-tts

camel-ai / camel

SWivid / F5-TTS

OpenTalker / SadTalker

LibreTranslate / LibreTranslate

FujiwaraChoki / MoneyPrinter

agent0ai / agent-zero

Comfy-Org / ComfyUI-Manager