Stars
A feature-rich command-line audio/video downloader
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A PyTorch implementation of EfficientNet
🔥 2D and 3D Face alignment library build using pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
StyleGAN2-ADA - Official PyTorch implementation
StarGAN v2 - Official PyTorch Implementation (CVPR 2020)
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.
Schedule-Free Optimization in PyTorch
Speech Recognition using DeepSpeech2.
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
Code for the paper: Detecting Photoshopped Faces by Scripting Photoshop
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives: https://nvlabs.github.io/instant-ngp/
This is a implementation of the 3D FLAME model in PyTorch
Wrapper of 50+ image matching models with a unified interface
A 3DMM fitting framework using Pytorch.
Custom shape predictor model trained to find 81 facial feature landmarks given any image
This repository contains a pytorch implementation of "HeadNeRF: A Real-time NeRF-based Parametric Head Model (CVPR 2022)".
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)
The code of multi-attention deepfake detection
Audio-Visual Speech Separation with Cross-Modal Consistency
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)