Stars
real time face swap and one-click video deepfake with only a single image
The definitive Web UI for local AI, with powerful features and easy setup.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
リアルタイムボイスチェンジャー Realtime Voice Changer
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
A Web UI for easy subtitle using whisper model.
🏆 📚 A list of awesome MkDocs projects and plugins.
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
视频字幕翻译,输入srt文件生成翻译后的srt文件。无需申请第三方API,本地实现字幕翻译。基于深度学习的视频字幕翻译框架。Srt file translation, generate translated srt file from input SRT file. No need to apply third-party API, local implementation of subti…