Skip to content
View filipstrand's full-sized avatar

Block or report filipstrand

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025 Spotlight] LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation

Python 83 5 Updated Dec 16, 2025

Taming large-scale full-parameter few-step training with self-adversarial flows! 👏🏻

Python 308 17 Updated Dec 15, 2025

Official SeedVR2 Video Upscaler for ComfyUI

Python 1,642 118 Updated Dec 13, 2025
Python 7,723 455 Updated Dec 14, 2025

Official inference repo for FLUX.2 models

Python 1,260 64 Updated Dec 1, 2025

Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)

Python 849 50 Updated Jul 2, 2025
Python 329 20 Updated Aug 28, 2025

FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.

Python 287 13 Updated Dec 4, 2025

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,059 41 Updated Dec 22, 2025

[ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models

Python 89 7 Updated Sep 11, 2025

An inference and training framework for multiple image input in Flux Kontext dev

Jupyter Notebook 426 31 Updated Sep 1, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,468 365 Updated Dec 23, 2025

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,853 348 Updated Jul 15, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,843 909 Updated Sep 1, 2024

Making Flux go brrr on GPUs.

Python 158 16 Updated Jul 18, 2025

Official implementation of "Normalized Attention Guidance"

Jupyter Notebook 175 8 Updated Jul 1, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,570 246 Updated Dec 18, 2025

ConceptAttention: A method for interpreting multi-modal diffusion transformers.

Jupyter Notebook 401 26 Updated Nov 13, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,048 114 Updated Dec 19, 2025
Python 2,483 239 Updated Jul 16, 2025

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

Python 1,076 53 Updated Nov 3, 2025

An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.

Python 391 33 Updated Aug 15, 2025

[CVPR 2025] Diffusion Self-Distillation for Zero-Shot Customized Image Generation

Python 461 41 Updated Mar 18, 2025

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

Jupyter Notebook 808 73 Updated Jul 29, 2025

Training-free Regional Prompting for Diffusion Transformers 🔥

Python 690 32 Updated Nov 28, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,841 321 Updated Dec 21, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 890 43 Updated Dec 23, 2025

🚀 Cross attention map tools for huggingface/diffusers

Python 374 27 Updated Jan 18, 2025

[🚀ICML 2025] "Taming Rectified Flow for Inversion and Editing" Using FLUX and HunyuanVideo for image and video editing!

Python 604 15 Updated May 1, 2025
Next