Skip to content
View benf22's full-sized avatar

Block or report benf22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of "Learn-to-Steer: Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image" (WACV 2026)

Jupyter Notebook 4 Updated Mar 19, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,303 2,892 Updated Mar 5, 2026

Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs, LAION-CLAP, MS-CLAP, DeSync

Python 74 8 Updated Feb 14, 2026

AblationBench is evaluation framework for language models on ablation planning in empricial AI research

Python 10 1 Updated Feb 2, 2026

[ISMIR 2025] A curated list of vision-to-music generation: methods, datasets, evaluation and challenges.

124 6 Updated Aug 9, 2025

Source for https://fullstackdeeplearning.com

HTML 1,337 215 Updated May 12, 2026

Official implementation of "Single Image Iterative Subject-driven Generation and Editing".

Python 99 5 Updated May 30, 2025

ImageBind One Embedding Space to Bind Them All

Python 9,046 843 Updated Nov 21, 2025

[InterSpeech 2023] The official PyTorch implementation of: "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation"

Python 89 6 Updated May 18, 2026