胶囊网络：未来视觉识别的新星

需积分: 39 194 浏览量更新于2024-07-18 1 收藏 721KB DOCX 举报

"胶囊网络介绍——一种新型神经网络结构，由Geoffrey Hinton提出，旨在改进对象识别和位置不变性，但目前在某些数据集上的性能仍待优化。" 胶囊网络（Capsule Networks，简称CapsNets）是深度学习领域的一种创新性神经网络架构，由著名科学家Geoffrey Hinton及其团队在2011年首次提出，并在2017年取得了在MNIST手写数字识别任务上的最新进展。胶囊网络的设计理念是解决传统卷积神经网络（CNN）在识别复杂场景时的局限性，尤其是对于物体的变形、旋转以及位置变化的不敏感性。胶囊网络的核心概念是“胶囊”（Capsule），它是一组神经元，其活动向量表示特定类型实体（如对象或对象部分）的实例化参数。活动向量的长度表示实体存在的概率，而方向则代表实例化参数，如物体的方向、形状等特征。不同于传统神经网络中的单个神经元，胶囊网络中的胶囊能够捕捉到更多的语义信息。在胶囊网络中，低层胶囊通过变换矩阵预测高层胶囊的实例化参数。当多个预测一致时，高层胶囊变得活跃。动态路由算法（Dynamic Routing Between Capsules）是胶囊网络的关键组成部分，它允许低层胶囊将其输出有选择地发送给与其预测向量有较大标量积的高层胶囊。这个迭代的过程使得网络能够通过“同意”的预测来决定哪些胶囊应该被激活，从而提高了识别的准确性。胶囊网络的动态路由过程解决了传统CNN中信息传递的模糊性，有助于区分同一类别但位置或姿态不同的对象。然而，当前的胶囊网络在CIFAR10或ImageNet等更复杂的数据集上表现尚不及成熟的CNN模型，并且由于其计算密集型特性，处理效率相对较低。这表明胶囊网络虽然有潜力，但仍有很大的改进空间。在实际应用中，胶囊网络可以用于图像识别、语义分割和姿态估计等领域，有望在解决传统方法的不足上取得突破。通过持续的研究和优化，胶囊网络可能会成为深度学习领域的一个重要分支，对未来的计算机视觉任务产生深远影响。

should be coupled to capsule

c =

exp( b

)

exp(b

)

(3)

The log priors can be learned discriminatively at the same time as all the other weights. They

depend

on the location and type of the two capsules but not on the current input image

. The

initial coupling

coefficients are then iteratively refined by measuring the agreement between

the current output

each capsule,

in the layer above and the prediction

uˆ

j|i

made by

capsule

The agreement is simply the scalar product

= v

.uˆ

j|i

This agreement is treated as

was a log

likelihood and is added to the initial logit,

before computing the new values for all

the coupling

coefficients linking capsule

to higher level capsules.

In convolutional capsule layers, each capsule outputs a local grid of vectors to each type of capsule

the layer above using different transformation matrices for each member of the grid as well as

for

each type of capsule.

Procedure 1 Routing algorithm.

1: procedure ROUTING(uˆ

j|i

, r, l)

2: for all capsule

in layer

and capsule

in layer

(l +

1):

← 0.

3: for

iterations do

4: for all capsule

in layer l: c

← softmax(b

)

softmax computes Eq. 3

5: for all capsule

in layer

(l +

1): s

←

uˆ

j|i

6: for all capsule

in layer

(l +

1):

← squash(s

)

squash computes Eq. 1

7: for all capsule

in layer

and capsule

in layer (l

1): b

← b

+ uˆ

j|i

return v

Margin loss for digit

existence

We are using the length of the instantiation vector to represent the probability that a capsule’s entity

exists. We would like the top-level capsule for digit class k to have a long instantiation vector if and

only if that digit is present in the image. To allow for multiple digits, we use a separate margin loss,

for each digit capsule, k:

max(0,

− ||v

||)

+ λ

(1 − T

)

max(0, ||v

|| −

−

)

(4)

where T

iff

a digit of class k is present

and

0.9 and

−

0.1. The

down-

weighting

of the loss for absent digit classes stops the initial learning from shrinking the lengths of the activity

vectors of all the digit capsules. We use

λ =

0.5. The total loss is simply the sum of the losses of all

digit capsules.

CapsNet

architecture

A simple CapsNet architecture is shown in Fig. 1. The architecture is shallow with only two

convolutional layers and one fully connected layer. Conv1 has 256, 9 × 9 convolution kernels with

stride of 1 and ReLU activation. This layer converts pixel intensities to the activities of local

feature

detectors that are then used as inputs to the primary capsules.

The primary capsules are the lowest level of multi-dimensional entities and, from an inverse

graphics

perspective, activating the primary capsules corresponds to inverting the rendering

process. This is a

very different type of computation than piecing instantiated parts together to

make familiar wholes,

which is what capsules are designed to be good at.

The second layer (PrimaryCapsules) is a convolutional capsule layer with 32 channels of

convolutional

8D capsules (i.e. each primary capsule contains 8 convolutional units with a 9 × 9

kernel and a stride

of 2). Each primary capsule output sees the outputs of all 256 × 81 Conv1

剩余17页未读，继续阅读

xiaoxc_java

粉丝: 1

胶囊网络：未来视觉识别的新星

胶囊网络代码

胶囊网络及NLP应用介绍.pptx

胶囊网络代码实现

Capsule胶囊网络介绍1

胶囊网络动态路由详解介绍.ppt

胶囊网络概述(md原文件)

Capsule Networks（胶囊网络）.pdf

反向传播等神经网络经典算法的发明人，介绍了全新的胶囊网络模型，以及相应的囊间动态路由算法。本人用Paddle框架实现了它.zip

反向传播等神经网络经典算法的发明人，介绍了全新的胶囊网络模型，以及相应的囊间动态路由算法 本资用Paddle框架实现了它.zip

介绍了全新的胶囊网络模型，以及相应的囊间动态路由算法。用Paddle框架实现了它.zip

最新资源

反向传播等神经网络经典算法的发明人，介绍了全新的胶囊网络模型，以及相应的囊间动态路由算法本资用Paddle框架实现了它.zip