UMCFuse: A Unified Multiple Complex Scenes Infrared and Visible Image Fusion Framework (IEEE-TIP 2025)
π Paper: [IEEE Xplore]
π Paper: [Arxiv]
UMCFuse is a unified framework for infrared and visible image fusion (IVIF) in complex scenes, including haze, rain, snow, overexposure, fire, and noisy environments. It effectively integrates multi-modal information, preserves fine details, suppresses noise, and enhances scene understanding for downstream tasks such as object detection, semantic segmentation, and salient object detection. UMCFuse can be applied to fusion scenarios regardless of whether degradation is present or absent.
To the best of our knowledge, it is the π₯ first study π₯ addressing unified infrared and visible image fusion in complex scenes.
-
Unified Framework for Complex Scenes
Handles multiple challenging environmental conditions within a single model, eliminating the need for scene-specific tuning. -
Transmission Map Guided Decomposition
Estimates light transmission for robust separation of contrast and structure layers, improving fusion quality in adverse conditions. -
Adaptive High-Frequency Denoising
Preserves fine details while suppressing noise in high-frequency components. -
Multi-Scale Low-Frequency Extraction
Effectively captures energy information in high-contrast regions for robust structural fusion.
UMCFuse leverages transmission maps to decompose images into contrast and structure layers, separating degraded pixels from original content.
- Rainy scenes: Raindrops scatter and refract light, creating blurred regions.
- Snow scenes: Snowflakes scatter light, reducing contrast and capturing snow features.
- Overexposed areas: Excessive light produces nearly uniform regions, highlighting saturated pixels.
Benefit: Separating degradation reduces interference during feature extraction and improves the preservation of useful details.
UMCFuse consistently preserves infrared luminance, visible textures, and scene details while suppressing noise across complex conditions.
Fusion comparison on normal, noisy, and overexposed scenes.
Fusion comparison on haze, rain, and blur scenes.
Fusion performance in snow, rain+noise, and fire conditions.
We quantitatively evaluated UMCFuse against 11 state-of-the-art methods across seven datasets using metrics like
-
Overall Comparison: Compared to the average of 11 competing methods, UMCFuse improves 13.64% in
$Q_{MI}$ ,$Q_{NCIE}$ ,$Q_G$ ,$Q_{abf}$ and 34.06% in$Q_{CV}$ ,$VIF$ ,$EN$ ,$SSIM$ .
UMCFuse demonstrates cross-domain generalization on medical images (50 SPECT-MRI pairs from Harvard Medical School). Compared with CDDFuse, SwinFusion, PRRGAN, and MDHU, UMCFuse consistently ranks top two across five metrics, highlighting robust feature extraction capability.
UMCFuse enhances feature representation for tasks such as:
- Object Detection: Higher detection rates and confidence.
- Semantic Segmentation: Improved segmentation accuracy under adverse conditions.
- Salient Object Detection: Accurate target localization with fewer artifacts.
- Depth Estimation: Better depth maps by reducing interference from noise and haze.
The following demonstrates how to run UMCFuse on infrared and visible image pairs using the provided MATLAB demo.
- MATLAB R2023a or other
- Image Processing Toolbox
- GPU recommended for faster computation
Choice your source images
Open demo.m in MATLAB (or any script)
If you find our work useful in your research, please consider citing:
@article{li2025umcfuse,
title={UMCFuse: A Unified Multiple Complex Scenes Infrared and Visible Image Fusion Framework},
author={Li, Xilai and Li, Xiaosong and Tan, Tianshu and Li, Huafeng and Ye, Tao},
journal={IEEE Transactions on Image Processing},
year={2025},
publisher={IEEE}
}