Skip to content
View yerfor's full-sized avatar
🧲
Focusing on new projects. I may be slow to respond.
🧲
Focusing on new projects. I may be slow to respond.

Block or report yerfor

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Out of time: automated lip sync in the wild

Python 852 184 Updated Jan 23, 2024

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 9,196 1,193 Updated Apr 2, 2024

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

Rust 8,254 673 Updated Dec 19, 2025
Python 1,452 152 Updated Nov 15, 2025

How to use our public wav2vec2 dimensional emotion model

Jupyter Notebook 531 50 Updated May 22, 2023

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,901 1,502 Updated Dec 17, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,365 407 Updated Jun 28, 2024

MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models

Python 180 8 Updated Jul 21, 2025

[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving

Python 259 12 Updated Aug 4, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,113 63 Updated Aug 7, 2025

Official repository for LTX-Video

Python 8,914 834 Updated Oct 25, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,762 103 Updated Nov 4, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,196 105 Updated Oct 15, 2025

[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Python 1,605 125 Updated Aug 20, 2025

Lets make video diffusion practical!

Python 16,357 1,592 Updated Oct 16, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,869 289 Updated Dec 11, 2025

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Python 507 19 Updated Dec 11, 2024

GLaDOS Checkin Automatically

Python 1 Updated Apr 27, 2025

Scalable and memory-optimized training of diffusion models

Python 1,312 144 Updated Jun 4, 2025

"Swimmin' in the money, come and find me, Nemo"

Jupyter Notebook 6 1 Updated Oct 8, 2025

wip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting

Python 308 38 Updated Nov 24, 2025

Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars

Python 94 11 Updated Nov 4, 2024

Diffusion-based Portrait and Animal Animation

Python 846 83 Updated Dec 9, 2025

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 391 25 Updated Apr 8, 2025
Python 1,558 198 Updated Dec 15, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,129 54 Updated Mar 5, 2025

Enjoy the magic of Diffusion models!

Python 11,173 1,054 Updated Dec 19, 2025
Python 6,052 467 Updated Aug 29, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 614 74 Updated Dec 17, 2025
Next