Stars
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Controllable and fast Text-to-Speech for over 7000 languages!
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
Generate audiobooks from e-books, voice cloning & 1107+ languages!
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Face Swapping on Image and Video With Deep Fake Methods
AI voice assistant made with Streamlit python and powered by Gemini, Mistral and PHI-3. This is a virtual assistant application built in Python that can understand voice commands and complete tasks…
a python based voice assistant that uses voice recognition, speech synthesis, and natural language processing (NLP) to provide a service through a particular application. This project provides an …
Virtual Voice Assistant is a project that utilizes machine learning and natural language processing to enable users to control their devices using voice commands. Technologies used include TensorFl…