Lists (1)
Sort Name ascending (A-Z)
Stars
Stable Diffusion web UI
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Easily train a good VC model with voice data <= 10 mins!
Generative Models by Stability AI
DeepFaceLab is the leading software for creating deepfakes.
Avatars for Zoom, Skype and other video-conferencing apps.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
openvpi / DiffSinger
Forked from MoonInTheRiver/DiffSingerAn advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
A python module for configuration of block devices