Skip to content
View shehrum's full-sized avatar

Block or report shehrum

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code for "AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation" (CVPR2023)

Python 260 26 Updated Apr 26, 2023

[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 4,244 485 Updated Jul 10, 2024

An open source, self-hosted implementation of the Shotstack API backend

Go 46 5 Updated Apr 19, 2024

Python SDK for Shotstack, the cloud video editing API

Python 25 4 Updated Jul 19, 2024

Code and dataset for photorealistic Codec Avatars driven from audio

Python 2,847 280 Updated Sep 15, 2024

Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video

Python 486 56 Updated Aug 6, 2023

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,449 2,575 Updated Jun 26, 2024

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 3,267 581 Updated Aug 16, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,031 6,637 Updated Sep 30, 2025

Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)

Python 1,970 139 Updated Dec 1, 2025

This repository contains the code for my master thesis on Emotion-Aware Facial Animation

Jupyter Notebook 147 29 Updated Dec 8, 2022

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,932 206 Updated Nov 15, 2024

Audio-Visual Speech Separation with Cross-Modal Consistency

Python 243 38 Updated Jul 25, 2023

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Python 433 92 Updated Oct 23, 2023

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Python 2,210 287 Updated Nov 10, 2023

A Blender add-on for importing a sequence of OBJ meshes as frames

Python 727 53 Updated Sep 2, 2025

Resources of Neural Rendering

2,334 214 Updated Sep 20, 2025

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

Python 1,281 218 Updated Jun 19, 2023

StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.

Python 3,485 347 Updated Aug 9, 2024

A repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.

3,259 743 Updated Aug 7, 2024

TensorFlow's Visualization Toolkit

TypeScript 7,068 1,687 Updated Dec 19, 2025

VIP cheatsheets for Stanford's CS 230 Deep Learning

6,860 1,437 Updated May 20, 2020

A curated list of different papers and datasets in various areas of audio-visual processing

752 66 Updated Jan 30, 2024

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 12,695 2,752 Updated Jun 22, 2025

Machine Learning and Computer Vision Engineer - Technical Interview Questions

4,301 694 Updated May 20, 2025

Deep neural networks for voice conversion (voice style transfer) in Tensorflow

Python 3,937 840 Updated Sep 30, 2022

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

Python 148 42 Updated Jul 6, 2023

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 898 174 Updated Jul 6, 2023

FSGAN - Official PyTorch Implementation

Jupyter Notebook 776 150 Updated Nov 13, 2025
Next