FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

This repository is the official implementation of FlowAlign, an inversion & training free image editing algorithm.

Abstract

💡 Recent inversion-free, flow-based editors leverage models like Stable Diffusion 3 to enable text-driven image editing via ODE integration.

🤔 However, skipping latent inversion often leads to unstable trajectories and poor source consistency.

🚀 FlowAlign addresses this by introducing a flow-matching loss—a simple yet effective regularizer that ensures smooth, semantically aligned, and structurally consistent edits.

🌟 Thanks to its ODE-based formulation, FlowAlign naturally supports reverse editing, highlighting its reversible and robust transformation capability.

Requirements

Clone this repo:

git clone https://github.com/FlowAlign/FlowAlign.git
cd FlowAlign

To install requirements:

conda create -n flowalign python==3.11
conda activate flowalign
pip install torch==2.1.2+cu118 torchvision==0.16.2+cu118 --extra-index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

Quick Start

For the text-based image editing, run:

Examples 1

python run_edit.py \
  --img_path "samples/bicycle.jpg" \
  --src_prompt "a slanted mountain bicycle on the road in front of a building" \
  --tgt_prompt "a slanted rusty mountain bicycle on the road in front of a building"

The expected result:

Example 2

python run_edit.py \
  --img_path "samples/cat.jpg" \
  --src_prompt "a opened eyes cat sitting on wooden floor" \
  --tgt_prompt "a closed eyes cat sitting on wooden floor"

The expected result:

How to choose editing methods

You can freely change the editing method using arguments:

method : dual / sdedit / flowedit / flowalign

Efficient inference

If you use --efficient_memory, text encoder will pre-compute text embeddings and is removed from the GPU.

This allows us to run image editing with a single GPU with VRAM 24GB.

Reproducibility

All edited images were generated on a single NVIDIA RTX 3090 GPU, using a fixed random seed of 123 and a Classifier-Free Guidance (CFG) scale of 13.5.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
assets		assets
diffusion		diffusion
samples		samples
utils		utils
README.md		README.md
requirements.txt		requirements.txt
run_edit.py		run_edit.py
run_t2i.py		run_t2i.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

Abstract

Requirements

Quick Start

How to choose editing methods

Efficient inference

Reproducibility

About

Uh oh!

Releases

Packages

Uh oh!

Languages

FlowAlign/FlowAlign

Folders and files

Latest commit

History

Repository files navigation

FlowAlign: Trajectory-Regularized, Inversion-Free Flow-based Image Editing

Abstract

Requirements

Quick Start

How to choose editing methods

Efficient inference

Reproducibility

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages