Stars
Stable Diffusion web UI
Command-line program to download videos from YouTube.com and other video sites
Robust Speech Recognition via Large-Scale Weak Supervision
SoftVC VITS Singing Voice Conversion
リアルタイムボイスチェンジャー Realtime Voice Changer
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Largest list of models for Core ML (for iOS 11+)
Perceptual video quality assessment based on multi-method fusion.
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
A series of large language models developed by Baichuan Intelligent Technology
Addon scripts, plugins, and skins for XBMC Media Center. Special for chinese laguage.
Misc; latest version of waifu2x; 2D video to stereo 3D video conversion
GitHub Sensitive Information Leakage(GitHub敏感信息泄露监控)
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
wechat workflow for Alfred:微信快速发送消息 & 打开聊天窗口 & 查看聊天记录 & more…
A PT Cross-seeding tool. 易于使用的跨站辅种工具,此repo为本项目的后端代码
学英语和写工具
Real-Time end-to-end 2D-to-3D Video Conversion, based on deep learning.
Class Decompile is a python script for Hopper Disassembler. This script can export pseudo code of the classes.
A simple pitch shifting script (Time-Domain Pitch-Synchronous Overlap and Add)
网易云音乐ncm格式转换python脚本, 递归文件夹下所有文件