Starred repositories
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
ByteDance's All-in-One Video Generation Model for Human-Object Interaction Video Generation
ERNIE-Image is an open text-to-image generation model developed by the ERNIE-Image team at Baidu. It is built on a single-stream Diffusion Transformer (DiT), with only 8B DiT parameters, it reaches…
Web dashboard for Hermes Agent — multi-platform AI chat, session management, scheduled jobs, usage analytics & channel configuration (Telegram, Discord, Slack, WhatsApp)
JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet,…
A Cinematic audio dubbing, Cloning and voice generation studio
Hermes WebUI: The best way to use Hermes Agent from the web or from your phone!
The agent that grows with you
OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.
🚀 An 800KB RAM ultra-lightweight Cloudflare WARP SOCKS5 proxy in Docker. 仅需 800KB 内存的纯内核态 Cloudflare WARP 代理 - Docker
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
tradingagents-cn中文增强版二次开发,增加多股批量分析功能,超强ai多智能体多维度分析股票,全面支持A股,极大的节省分析股票的精力和时间
TradingAgents-cn-PLUS版,增加股票批量分析,增加会员管理功能,增加会员点数功能,可部署到服务器对外进行运营收费,部署方法同原版。测试账号密码:user,财力有限,仅有20个测试点数,开通批量分析,模型推荐使用deepseek,且用且珍惜
基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版
Wav2Lip version 288 and pipeline to train
A comprehensive AI-powered video production studio. Features local batch processing for automated dubbing (XTTS), smart audio censorship (Whisper), and visual NSFW blurring (NudeNet) wrapped in a m…
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
real time face swap and one-click video deepfake with only a single image
High-Quality Voice Cloning TTS for 600+ Languages