Stars
抖音弹幕姬,只需输入房间号,即可实时获取对应直播间的弹幕信息,并可将其转发到自己的后端服务
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
An innovative open-source project that revolutionizes short drama production in the AIGC era. This comprehensive platform integrates cutting-edge AI technologies to democratize video creation, enab…
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
DistroAV (formerly OBS-NDI): NDI integration for OBS Studio
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
This project is to remove the watermark from the sora2 generated videos, with best quality.
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Learning English through the method of constructing sentences with conjunctions
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
可循环值守和多人录制的直播录制软件,支持抖音、TikTok、Youtube、快手、虎牙、斗鱼、B站、小红书、pandatv、sooplive、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK、shopee等40+平台直播录制
Gemini polling proxy service (gemini轮询代理服务)
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
An open-source AI agent that brings the power of Gemini directly into your terminal.
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
OBS Studio filter for applying an arbitrary shader to a source.
智能闲鱼客服机器人系统:专为闲鱼平台打造的AI值守解决方案,实现闲鱼平台7×24小时自动化值守,支持多专家协同决策、智能议价和上下文感知对话。
An easy and fast way to create a Python GUI 🐍
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…
一个用于解析小说并通过TTS转换为语音的JavaFX软件(A novel parser and converter use TTS)
小白自建代理神器!ArgoSBX一键无交互小钢炮脚本💣:Sing-box、Xray、Argo三内核自动搭配;支持VPS、Docker、容器多环境部署;套CDN的4大方案+套WARP的15种组合;已支持协议:AnyTLS、Any-reality、Vless-xhttp-reality-vision-enc、Vless-tcp-reality-vision、Vless-xhttp-vision-…