Stars
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
✨ WithAnyone is capable of generating high-quality, controllable, and ID consistent images
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Chrome DevTools for coding agents
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
An open-source AI agent that brings the power of Gemini directly into your terminal.
AI prompt engineering workbench for crafting, testing, and systematically evaluating prompts with powerful analysis tools.
Protocol Buffers for JavaScript & TypeScript.
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
It's like v0 but in your Cursor/WindSurf/Cline. 21st dev Magic MCP server for working with your frontend like Magic
[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
🦭 Video/Audio Downloader for Android, based on yt-dlp
Lets make video diffusion practical!
A Conversational Speech Generation Model
No fortress, purely open ground. OpenManus is Coming.
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.
Model Context Protocol Servers