Stars
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
A natural language interface for computers
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Official Code for DragGAN (SIGGRAPH 2023)
Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
Open-Sora: Democratizing Efficient Video Production for All
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
✨ Agentic IM ChatBot Infrastructure — 聊天智能体基础设施 ✨ 多消息平台集成(QQ / Telegram / 企微 / 飞书 / 钉钉等),强大易用的插件系统,支持 OpenAI / Gemini / Anthropic / Dify / Coze / 阿里云百炼 / 知识库 / Agent 智能体
Automate Creation of YouTube Shorts using MoviePy.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、Youtube、快手、虎牙、斗鱼、B站、小红书、pandatv、sooplive、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK、shopee等40+平台直播录制
该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic.
Use LLMs to track and extract websites, RSS feeds, and social media
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
抖音批量下载工具,去水印,支持视频、图集、合集、音乐(原声)。免费!免费!免费!
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网
[CVPR'25] Official Implementations for Paper - AniDoc: Animation Creation Made Easier
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)