Stars
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
OpenChat: Advancing Open-source Language Models with Imperfect Data
📷 EasyPhoto | Your Smart AI Photo Generator.
OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Official implementations for paper: Anydoor: zero-shot object-level image customization
Ikaros-521 / AI-Vtuber
Forked from sandboxdream/AI-VtuberAI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
BleachBit system cleaner for Windows and Linux
WiFi密码暴力破解工具-图形界面,支持WPA/WPA2/WPA3、多开并发、自动破解、自定义密码本、自动生成密码字典
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Generative models for conditional audio generation
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
Core Engine of Singing Voice Conversion & Singing Voice Clone
faster_whisper GUI with PySide6
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
A Web UI for easy subtitle using whisper model.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Help you discover excellent English projects and get rid of disturbing by other spoken language.
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extension
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网
自动收集的IPv4酒店电视直播源,自动测试播放速度,每日自动更新。 有CCTV央视卫视频道,及部分地方频道,播放流畅。也可在openwrt或群辉的docker运行。