模型
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
A generative speech model for daily dialogue.
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A Conversational Speech Generation Model