Lists (1)
Sort Name ascending (A-Z)
Stars
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
Robust Speech Recognition via Large-Scale Weak Supervision
Interact with your documents using the power of GPT, 100% privately, no data leaks
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🤱🏻 Turn any webpage into a desktop app with one command.
A generative speech model for daily dialogue.
程序员延寿指南 | A programmer's guide to live longer
Easily train a good VC model with voice data <= 10 mins!
SoftVC VITS Singing Voice Conversion
Industry leading face manipulation platform
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Rembg is a tool to remove images background
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Lets make video diffusion practical!
This repository contains the source code for the paper First Order Motion Model for Image Animation
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Official implementation of AnimateDiff.
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
首家工业级全流程 AI 影视生产平台。Industry-first professional AI Agent platform for controllable film & video production. From shorts to live-action with Hollywood-standard workflows.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili