1 November 2024 Saliency-guided convolution neural network–transformer fusion network for no-reference image quality assessment
Lipeng Wu, Ziguan Cui, Zongliang Gan, Guijin Tang, Feng Liu
Author Affiliations +
Abstract

Convolution neural networks (CNNs) and transformers are good at extracting local and global features, respectively, whereas both local and global features are important for the no-reference image quality assessment (NR-IQA) task. Therefore, we innovatively propose a CNN–transformer dual-stream parallel fusion network for NR-IQA that can simultaneously extract local and global hierarchical features related to image quality. In addition, considering the importance of saliency in NR-IQA, a saliency-guided CNN and transformer feature fusion module is proposed to fuse and optimize the hierarchical features extracted by the dual-stream network. Finally, the high-level features of the dual-stream network are fused through the local and global cross-attention module to better model the interaction relationship between local and global information in the image, and the quality prediction module containing evaluation and weight branches is used to obtain the quality score of distorted images. To comprehensively evaluate the performance of our model, we conducted experiments on six standard image quality assessment datasets, and the experimental results showed that our model has better quality prediction performance and generalization ability than previous representative NR-IQA models.

© 2024 SPIE and IS&T
Lipeng Wu, Ziguan Cui, Zongliang Gan, Guijin Tang, and Feng Liu "Saliency-guided convolution neural network–transformer fusion network for no-reference image quality assessment," Journal of Electronic Imaging 33(6), 063001 (1 November 2024). https://doi.org/10.1117/1.JEI.33.6.063001
Received: 5 April 2024; Accepted: 14 October 2024; Published: 1 November 2024
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image quality

Transformers

Feature extraction

Neural networks

Data modeling

Distortion

Performance modeling

Back to Top