Starred repositories
Segment Anything for Stable Diffusion WebUI
roop extension for StableDiffusion web-ui
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Character Animation (AnimateAnyone, Face Reenactment)
DeepFill v1/v2 with Contextual Attention and Gated Convolution, CVPR 2018, and ICCV 2019 Oral
Awesome work on hand pose estimation/tracking
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Unofficial Implementation of Animate Anyone
Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Image Deblurring using Generative Adversarial Networks
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis
pytorch implementation for "Deep Flow-Guided Video Inpainting"(CVPR'19)
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
The collection of pre-trained, state-of-the-art AI models for ailia SDK
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
VideoSys: An easy and efficient system for video generation
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
A CNN based pytorch implementation on facial expression recognition (FER2013 and CK+), achieving 73.112% (state-of-the-art) in FER2013 and 94.64% in CK+ dataset
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"
mobilenetv3 with pytorch,provide pre-train model
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code