Skip to content

Henry-Lee-real/StableI2I

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StableI2I

Official implementation of StableI2I: Spotting Unintended Changes in Image-to-Image Transition (ICML 2026)

Any questions can be consulted -> (Email:lijiayang.cs@gmail.com)

Looking forward to your ⭐!

📌 TODOs

  • release code
  • release ckpt
  • release pip-pkg
  • release arxiv
  • ICML version paper

HuggingFace Project Page arXiv 2605.04453

Core Concept:

In most real-world image-to-image (I2I) scenarios, existing evaluations primarily focus on instruction following and the perceptual quality or aesthetics of the generated images. However, they largely fail to assess whether the output image preserves the semantic correspondence and spatial structure of the input image. To address this limitation, we propose StableI2I, a unified and dynamic evaluation framework that explicitly measures content fidelity and pre--post consistency across a wide range of I2I tasks without requiring reference images, including image editing and image restoration. In addition, we construct StableI2I-Bench, a benchmark designed to systematically evaluate the accuracy of MLLMs on such fidelity and consistency assessment tasks. Extensive experimental results demonstrate that StableI2I provides accurate, fine-grained, and interpretable evaluations of content fidelity and consistency, with strong correlations to human subjective judgments. Our framework serves as a practical and reliable evaluation tool for diagnosing content consistency and benchmarking model performance in real-world I2I systems.

Environment Setting:

Install dependencies:

pip install -r requirements.txt

The specific environment is consistent with that of Qwen3-VL.

APP Usage:

image

app.py is the local web demo and API entry. Running it starts a FastAPI service with a browser UI.

Example:

set MODEL_PATH=path/to/ckpt
set GPU_ID=0
set HOST=127.0.0.1
set PORT=10004
python app.py

Then open:

http://127.0.0.1:10004

The demo supports:

  • built-in examples
  • inference by local image path
  • inference by image upload
  • summarized semantic / structure / low-level results

Inference

See infer.md.

Training

Recommended official references:

Notes:

  • For SFT, start from the official Qwen3-VL finetuning workflow.
  • For GRPO and related alignment training, use Swift.

Citation

If you find our work helpful for your research, please consider citing our work.

@article{li2026stablei2i,
  title={StableI2I: Spotting Unintended Changes in Image-to-Image Transition},
  author={Li, Jiayang and Cao, Shuo and Li, Xiaohui and Zhang, Zhizhen and Zhu, Kaiwen and Duan, Yule and Qiao, Yu and Zhang, Jian and Liu, Yihao},
  journal={arXiv preprint arXiv:2605.04453},
  year={2026}
}

About

Official implementation of StableI2I (ICML 2026)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages