Abstract
Salient object detection is a fundamental problem in image processing and computer vision. Many saliency detection algorithms based on the background and frequencydomain are used to extract salient object clues. However, the former causes the real object to be submerged in the detected object areas, especially in complex or small object scenes. While the latter will lead to the loss of some object information when detecting large objects. To solve these problems and achieve better object detection results, we propose a fusion framework for salient object detection by fusing background and frequencydomain features. The background features of the image are extracted by an improved background model. This model represents the spatial layout of the image area with respect to the image boundaries. Meanwhile, we present a new frequency-domain processing method to obtain multiscale frequency-domain features and mark the saliency of the object at different scales. Within our framework, inspired by human visual attention, we use the idea of a self-attention mechanism to capture the intrinsic relation between background
and multiscale frequency-domain features. In addition, this fusion framework provides a three-dimensional Gaussian convolution kernel, which expands two-dimensional local information to three dimensions for feature fusion, thus producing more accurate salient objects. Experiment results demonstrate that the proposed method consistently outperforms eleven state-of-the-art methods on five challenging and complicated datasets in terms of four evaluation metrics.
and multiscale frequency-domain features. In addition, this fusion framework provides a three-dimensional Gaussian convolution kernel, which expands two-dimensional local information to three dimensions for feature fusion, thus producing more accurate salient objects. Experiment results demonstrate that the proposed method consistently outperforms eleven state-of-the-art methods on five challenging and complicated datasets in terms of four evaluation metrics.
Original language | English |
---|---|
Pages (from-to) | 53-71 |
Number of pages | 19 |
Journal | Information Sciences |
Volume | 618 |
Early online date | 3 Nov 2022 |
DOIs | |
Publication status | Published (in print/issue) - 31 Dec 2022 |
Bibliographical note
Funding Information:This work was supported by the National Science Foundation of China (No. U1803261 and 62261053) and the International Science and Technology Cooperation Project of the Ministry of Education of the People’s Republic of China (No. 2016–2196).
Funding Information:
This work was supported by the National Science Foundation of China under Grant 62261053 and Grant U1803261 and the International Science and Technology Cooperation Project of the Ministry of Education of the People’s Republic of China (No. 2016–2196).
Publisher Copyright:
© 2022 Elsevier Inc.
Keywords
- Background feature Frequency-domain feature Fusion framework Salient object detection Three-dimensional Gaussian convolution kernel
- Three-dimensional Gaussian convolution kernel
- Frequency-domain feature
- Fusion framework
- Salient object detection
- Background feature