Skip to content
View maincat's full-sized avatar

Block or report maincat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
136 stars written in Python
Clear filter

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

Python 12,103 1,670 Updated Nov 7, 2025

NAS媒体库自动化管理工具

Python 9,812 1,204 Updated Nov 6, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9,314 888 Updated Aug 28, 2025

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Python 8,808 961 Updated Aug 29, 2025

vits2 backbone with multilingual-bert

Python 8,603 1,245 Updated Nov 4, 2025

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,417 1,049 Updated Jun 26, 2025

github release、archive以及项目文件的加速项目

Python 8,397 2,289 Updated Oct 9, 2025

基于「Docker」/「青龙面板」/「群晖」的每日签到脚本(支持多账号)签到列表: |爱奇艺|全民K歌|有道云笔记|百度贴吧|Bilibili|V2EX|AcFun|什么值得买|阿里云盘|i茅台申购|小米运动|百度搜索资源平台|恩山论坛|奥拉星|

Python 8,045 1,303 Updated Sep 29, 2025

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 7,395 905 Updated Aug 29, 2025

自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili

Python 7,165 1,271 Updated Nov 3, 2025

使用小爱音箱播放音乐,音乐使用 yt-dlp 下载。

Python 6,927 693 Updated Nov 6, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 6,905 769 Updated Nov 7, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,690 445 Updated May 29, 2024

Macast is a cross-platform application which using mpv as DLNA Media Renderer.

Python 6,632 420 Updated Jan 26, 2023

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 5,776 659 Updated Sep 8, 2025

Search plugins for qBittorrent search feature

Python 5,669 532 Updated Sep 27, 2025

One-stop Proxies Crawling and Aggregation Platform

Python 5,610 5,109 Updated Nov 3, 2025

去广告合并规则,每8个小时更新一次。

Python 5,566 358 Updated Nov 7, 2025

Inference and training library for high-quality TTS models.

Python 5,464 581 Updated Dec 10, 2024

QD [v20240210] —— HTTP请求定时任务自动执行框架 base on HAR Editor and Tornado Server

Python 5,235 622 Updated Aug 16, 2025

GUI-focused roop

Python 5,232 924 Updated May 28, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,985 738 Updated Jan 21, 2025

音乐标签编辑器,可编辑本地音乐文件的元数据(Editable local music file metadata.)

Python 4,846 334 Updated Nov 4, 2025

收集全国各高校招生时不会写明,却会实实在在影响大学生活质量的要求与细节

Python 4,670 705 Updated Aug 21, 2025

汇总多站点数据的AV元数据刮削器

Python 4,576 398 Updated Feb 4, 2025

CapsWriter 的离线版,一个好用的 PC 端的语音输入工具

Python 4,342 373 Updated Jul 10, 2024

[WIP] Layer Diffusion for WebUI (via Forge)

Python 4,096 350 Updated Aug 30, 2024

[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 4,093 450 Updated Aug 5, 2025

Understand Human Behavior to Align True Needs

Python 4,017 389 Updated Aug 13, 2025