Most existing attention mechanisms have no supervised signal during the training phase, which limits the model feature learning capability. To solve this problem, we propose a novel parameter-free attention mechanism based on class activation mapping. Attention mechanisms usually consist of spatial attention and channel attention, which indicates that 'where' and 'what' is more meaningful, respectively. Our attention also contains both types of attention. For Spatial Attention, we use class activation mapping as a supervision signal to guide the generation of it directly in space. Thus our spatial attention can pay more attention to the informative pedestrian parts of the scene and reduce background interference. For Channel Attention, the importance of each channel is obtained by the similarity between the aforementioned spatial attention and the feature map of each channel. In this manner, our channel attention is indirectly guided by class activation mapping. In addition, our attention is parameter-free, which reduces the risk of over-fitting. Finally, we conduct extensive evaluations on three popular benchmark datasets including Market1501, DukeMTMC-reID, and MSMT17, demonstrating the effectiveness of our approach on discriminative person representations.
|Number of pages||5|
|Journal||IEEE Signal Processing Letters|
|Early online date||24 Jun 2022|
|Publication status||Published (in print/issue) - 19 Jul 2022|
Bibliographical noteFunding Information:
The work was supported in part by the National Natural Science Foundation of China under Grants 61973066 and 61471110 and in part by the Major Science and Technology Projects of Liaoning Province under Grant 2021JH1/10400049.
© 1994-2012 IEEE.
- attention mechanism
- class activation mapping
- Person re-identification