Lists (1)
Sort Name ascending (A-Z)
Stars
We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.
The homepage of LongCat-Video-Avatar
PersonaLive! : Expressive Portrait Image Animation for Live Streaming
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
🏂 Training-Free Human Mesh Recovery from Videos, based on SAM-3, Diffusion-VAS, and SAM-3D-Body.
A collection of models designed for compression artifact removal
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.
Force Remove Copilot, Recall and More in Windows 11
Ultra-Realistic Portrait Animation Studio Transform still portraits into lifelike, animated videos using the power of AI. PresentaPulse combines LivePortrait for sophisticated facial animation and …
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
TTS model capable of streaming conversational audio in realtime.
A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
Real-time streaming voice anonymization & voice conversion
Advanced RVC Inference for quicker and effortless model downloads
Gausian - Rust-based local video editor for AI video production
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
real time face swap and one-click video deepfake with only a single image
Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions
lihaoyun6 / FlashVSR_plus
Forked from OpenImagingLab/FlashVSRTowards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
A beginner-friendly inference to finetune & run inference on open TTS models 🗣️