Stars
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platfo…
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构…
Free OpenAI-compatible AI API with 16,000+ models, image generation, tool calling, and Discord key signup.
Fast passive subdomain enumeration tool.
FireRed-OpenStoryline is an AI video editing agent that transforms manual editing into intention-driven directing through natural language interaction, LLM-powered planning, and precise tool orches…
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai
The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.
TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]
Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]
Skill for Agent automating JianYing (CapCut Chinese version) video editing.
Python library for finding similar content in videos.
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
DiffuEraser is a diffusion model for video Inpainting, you can use it in ComfyUI
A powerful ComfyUI workflow for removing watermarks from videos using DiffuEraser technology. Originally designed for Sora 2 generated content, this workflow can be adapted and used for removing an…
DiffuEraser is a diffusion model for video inpainting, which performs great content completeness and temporal consistency while maintaining acceptable efficiency.
S-Trespassing/Trae-Account-Creator 的复活+小幅改进版。
一键生成数字人视频系统| 批处理AI视频自动生成系统|基于文本自动生成AI图像生成/数字人视频|TTS|音色克隆|ASR|OCR|支持水印和logo添加转场效果|字幕自动生成|元素叠加|剪辑合并|ffmpeg
一个智能自媒体管理平台,支持头条号、公众号、抖音、小红书、哔哩哔哩一键发布,定时发布,阅读状态,用户活动等智能管理,集成DeepSeek、Qwen3、O3、Gemini2.0、 Claude 4等主流大模型,实现一句话创作爆款自媒体文章和视频。
🚀 爆款内容生成器 v3.0 - 全平台内容创作系统(图文/短视频/长视频)
一键产出爆款视频:1.自动提取对标文案 2.自动进行文案仿写 3.自动根据文案声音克隆 4.自动生成数字人口播 5.自动添加字幕 6.自动添加背景音乐 7.自动添加视频标题 8.自动生成视频封面 9.自动将视频发布到各平台
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
AI-Powered Watermark Remover using Florence-2 and LaMA: Remove watermarks from images and videos, including AI-generated content from Sora, Runway, and others. Features a modern PyWebview GUI.
Multi-platform auto-proxy client, supporting Sing-box, X-ray, TUIC, Hysteria, Reality, Trojan, SSH etc. It’s an open-source, secure and ad-free.
[ICCV 2025] SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree
🎬 seedance2接入 开源本地 AI 短剧 & 漫剧生成工具 —— 从故事到成片一站式完成,数据不出本机,短剧工作流管理平台,高灵活度,AI真人剧,AI漫剧本地搞定。 Open-source local AI short drama maker: story → storyboard → video, fully offline, your data stays yours. 纳米流水线