Lists (3)
Sort Name ascending (A-Z)
Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
all of the workflows of n8n i could find (also from the site itself)
State-of-the-art 2D and 3D Face Analysis Project
Turns Data and AI algorithms into production-ready web applications in no time.
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Wan: Open and Advanced Large-Scale Video Generative Models
StyleGAN2 - Official TensorFlow Implementation
Noise supression using deep filtering
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA …
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
Uses machine learning to denoise audio containing speech