Stars
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
The official Python SDK for Model Context Protocol servers and clients
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
TikTok 发布/喜欢/合辑/直播/视频/图集/音乐;抖音发布/喜欢/收藏/收藏夹/视频/图集/实况/直播/音乐/合集/评论/账号/搜索/热榜数据采集工具/下载工具
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
小红书(XiaoHongShu、RedNote)链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书作品文件
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Multilingual Document Layout Parsing in a Single Vision-Language Model
A practical Douyin downloader for both single-item and profile batch downloads, with progress display, retries, SQLite deduplication, and browser fallback support. 抖音批量下载工具,去水印,支持视频、图集、合集、音乐(原声)。免费…
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.
A Model Context Protocol server for Excel file manipulation
英语字典 英语词库 字典词库 四级单词 六级单词 考研单词 雅思 托福 SAT GMAT TOEFL GRE
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
Upload videos to Youtube from the command line
Thai natural language processing in Python
👏Download all douyin videos of user(including favorites) , 下载指定用户的所有抖音视频以及收藏的视频(无水印)
小红书自动化,自动登录、可选择Cookie登录、支持上传图文、视频并自动发布
自动搬运youtube视频上传到B站脚本 bilibiliuploader