CBAM：基于注意力机制的卷积块模块的研究概述

5星 · 超过95%的资源需积分: 27 176 浏览量更新于2024-01-21 收藏 1.32MB PDF 举报

CBAM: Convolutional Block Attention Module是一种基于注意力机制的卷积网络模型，由韩国科学技术院（KAIST）的Sanghyun Woo、Jongchan Park等人在2018年的ECCV（European Conference on Computer Vision）上提出。该论文的核心思想是提出了CBAM模块，该模块通过通道注意力模块和空间注意力模块对输入进行处理，得到调整参数的注意力特征图。 CBAM模块的设计是为了解决卷积神经网络在处理图像时对于不同特征的关注程度不一致的问题。通常情况下，卷积神经网络对不同通道的特征提取能力有所差异，而对于不同空间位置的特征关注程度也可能不同。CBAM模块通过引入通道注意力模块和空间注意力模块，能够自适应地调整不同通道和空间位置的特征的权重，从而提升网络的性能。通道注意力模块的作用是对每个通道的特征进行权重调整。它通过先进行全局平均池化和全连接层得到通道全局信息，然后再通过两个全连接层分别得到通道的最大激活和平均激活，并结合这两部分信息计算出通道的注意力权重。最后，将通道的注意力权重与输入特征相乘，得到经过通道注意力调整后的特征。空间注意力模块的作用是对不同空间位置的特征进行权重调整。它通过先进行通道维度的最大池化和平均池化，得到两种不同尺度的特征。然后，将这两种尺度的特征分别通过两个全连接层得到对应的注意力映射，并将这两个映射相加，再通过一个sigmoid函数得到最终的空间注意力权重。最后，将空间注意力权重与输入特征相乘，得到经过空间注意力调整后的特征。通过将通道注意力模块和空间注意力模块结合起来，在CBAM模块中，先经过通道注意力模块得到通道注意力调整后的特征，然后再经过空间注意力模块得到空间注意力调整后的特征。最终，将这两个调整后的特征相乘，得到最终的注意力特征图。 CBAM模块可以灵活地插入到现有的卷积神经网络架构中，以提升网络的性能。实验证明，将CBAM模块应用于不同的任务和网络架构，都能够获得显著的性能提升。总之，CBAM: Convolutional Block Attention Module是一种基于注意力机制的卷积网络模型，通过通道注意力模块和空间注意力模块对输入进行处理，得到调整参数的注意力特征图。该模块的设计能够自适应地调整不同通道和空间位置的特征的权重，从而提升网络的性能。CBAM模块的应用具有广泛的潜力，在各种任务和网络架构中都能够获得良好的效果。

4 Woo, Park, Lee, Kweon

Wang et al. [

27] propose Residual Attention Network which uses an encoder-

decoder style attention module. By reﬁning the feature maps, the network not

only performs well but is also robust to noisy inputs. Instead of directly com-

puting the 3d attention map, we decompose the process that learns channel

attention and spatial attention separately. The separate attention generation

process for 3D feature map has much less computational and parameter over-

head, and therefore can be used as a plug-and-play module for pre-existing base

CNN architectures.

More close to our work, Hu et al. [

28] introduce a compact module to exploit

the inter-channel relationship. In their Squeeze-and-Excitation module, they use

global average-pooled features to compute channel-wise attention. However, we

show that those are suboptimal features in order to infer ﬁne channel attention,

and we suggest to use max-pooled features as well. They also miss the spatial

attention, which plays an important role in deciding ‘where’ to focus as shown in

[

29]. In our CBAM, we exploit both spatial and channel-wise attention based on

an eﬃcient architecture and empirically verify that exploiting both is superior to

using only the channel-wise attention as [

28]. Moreover, we empirically show that

our module is eﬀective in detection tasks (MS-COCO and VOC). Especially, we

achieve state-of-the-art performance just by placing our module on top of the

existing one-shot detector [

30] in the VOC2007 test set.

Concurrently, BAM [

31] takes a similar approach, decomposing 3D atten-

tion map inference into channel and spatial. They place BAM module at every

bottleneck of the network while we plug at every convolutional block.

3 Convolutional Block Attention Module

Given an intermediate feature map F ∈ R

C×H×W

as input, CBAM sequentially

infers a 1D channel attention map M

∈ R

C×1×1

and a 2D spatial attention

map M

∈ R

1×H×W

as illustrated in Fig.

1. The overall attention process can

be summarized as:

′

= M

(F) ⊗ F,

′′

= M

′

) ⊗ F

′

(1)

where ⊗ denotes element-wise multiplication. During multiplication, the atten-

tion values are broadcasted (copied) accordingly: channel attention values are

broadcasted along the spatial dimension, and vice versa. F

′′

is the ﬁnal reﬁned

output. Fig.

2 depicts the computation process of each attention map. The fol-

lowing describes the details of each attention module.

Channel attention module. We produce a channel attention map by exploit-

ing the inter-channel relationship of features. As each channel of a feature map

is considered as a feature detector [

32], channel attention focuses on ‘what’ is

meaningful given an input image. To c ompute the channel attention eﬃciently,

we squeeze the spatial dimension of the input feature map. For aggregating spa-

tial information, average-pooling has been commonly adopted so far. Zhou et al.

剩余16页未读，继续阅读

脚踏南山

粉丝: 1396
资源: 15

CBAM：基于注意力机制的卷积块模块的研究概述

CBAM：卷积块注意力模块

CBAM Attention.py

cbam: convolutional block attention module

引用CBAM: Convolutional Block Attention Module

给我系统介绍下CBAM（Convolutional Block Attention Module）注意力机制，step by step

convolutional block attention module

Convolutional Block Attention Module的公式推导

Keras实现CBAM：提升CNN模型性能

初识CBAM：理解基于注意力机制的神经网络模型

一文读懂CBAM与CBAM-Lite：详解两种版本的区别与优劣

最新资源