DNN的编码器-解码器视角：正向传播与反向传播深度解析

91 浏览量更新于2024-08-26 收藏 2.32MB PDF 举报

深度神经网络（Deep Neural Network, DNN）因其复杂的结构和隐藏层间的非线性关系而难以理解。本文旨在提供一个深入的视角来解析DNN的正向传播（forward-propagation）和反向传播（back-propagation）过程，将其视为两个独立但相关的网络结构：fp-DNN（forward-propagation DNN）和bp-DNN（back-propagation DNN）。 fp-DNN被视为一个编码器，其核心任务是在隐藏层中学习区分特征，这些特征在没有直接目标损失函数监督的情况下被提取出来。这个观点是通过引入一种对fp-DNN隐藏层的直接损失函数来实现的，它允许我们更直观地理解fp-DNN的作用。这种直接损失函数使得fp-DNN的学习过程更加透明，我们可以观察到它如何在bp-DNN的指导下，逐渐形成有意义的特征表示。另一方面，bp-DNN被解释为一个解码器，它接收fp-DNN编码后的特征，并根据这些特征进行后续的决策或预测。反向传播算法在bp-DNN中起着至关重要的作用，它通过计算梯度来调整各层权重，以最小化整个网络的总损失。在这个过程中，bp-DNN利用fp-DNN提供的特征来进行优化，确保最终输出的准确性。实验部分展示了如何通过bp-DNN来分析和可视化fp-DNN的学习过程。结果表明，fp-DNN在bp-DNN的监督下，能够有效地学习到有用的特征，这些特征对于网络的整体性能至关重要。通过这种方式，作者揭示了DNN中这两个过程之间的内在联系和协同工作机制，为理解深度学习模型的工作原理提供了新的见解。总结来说，这篇研究论文通过构建fp-DNN和bp-DNN的理论框架，以及实验验证，深化了我们对DNN正向传播和反向传播的理解，将复杂的模型学习过程分解为编码和解码两部分，有助于提高模型可解释性和理解性。这对于优化模型设计、提升学习效率以及故障诊断等方面具有重要意义。

An Interpretation of Forward-Propagation and Back-Propagation of DNN 5

(b) fp-DNN

(a) DNN

Forward

Back-propagaƟon

Fig. 1. (a) The normal DNN training procedure contains two steps, forward(blue) and

backward(red), which forms two network sharing weights. (b) The network fp-DNN rep-

resents the forward pass, extracting features (c) The network bp-DNN has the inverted

structure of fp-DNN, but sharing the same parameters, which is for transporting the

gradients (or label information) from top to the bottom. (Color ﬁgure online)

2 Formulation of Deep Neural Networks

Classiﬁcation is a basic task for Machine Learning. In this paper, we use DNN

to model the classiﬁcation task and analyze how DNN is trained. We assume

a classiﬁcation task with C classes, with a training data set {x

, y

}

i=1

that

contains N training samples. Where x ∈ R

is the input signal and y ∈{0, 1}

is the class label of x, with y

=1ifx belongs to the cth class and otherwise

=0,i = c. The classiﬁcation task for this data set is to train a DNN to

predict the conditional distribution p(y|x)=f

(x), where f

(x) is the function

of DNN. We denote p =[p

,...,p

]

∈ R

as the output of f

(x)for

convenience, with p

= p(y

|x).

To solve this classiﬁcation task, we construct a model of deep neural network

with L hidden layers, and formulate it as [1], (Fig. 1(a))

DNN =

⎧

⎪

⎨

⎪

⎩



(x, y)=



log p

L,i



L,j

= W

x + b

= W

σ(z

l−1

)+b

, 2 ≤ l ≤ L

(1)

where W

∈ R

×C

l−1

and b

∈ R

is the parameters of the lth layer of DNN,

and 

(x, y) is the softmax loss function. Θ is all the parameters of DNN.

=[z

L,1

L,2

,...,z

L,C

]

is the linear output of DNN. z

∈ R

is the linear

output of the lth hidden layer, and p =[p

, ··· ,p

]

is the ﬁnal prediction

of this DNN.

We deﬁne two network structures corresponding to the training process with

forward and back-propagation,

fp-DNN =

⎧

⎪

⎨

⎪

⎩

= x

= W

+ b

= W

σ(z

l−1

)+b

, 2 ≤ l ≤ L

(2)

剩余12页未读，继续阅读

weixin_38652196

粉丝: 2
资源: 939

DNN的编码器-解码器视角：正向传播与反向传播深度解析

DNN神经网络讲解

GR-DNN代码解释

基于Numpy动手实现一个dnn训练框架z_dnn，支持全连接层的正反传播，支持Sigmoid和Tanh激活.zip

Matlab代码verilog-Dual-mode-DNN:双模DNN

手把手教你用Numpy打造简易DNN训练框架

YOLO人物识别优化指南：提升精度和速度

揭秘YOLO目标检测：原理、算法和应用场景全解析

YOLO神经网络游戏中的物体检测：深度解析算法和实现

图像预处理和后处理的艺术：YOLOv2图像分割的精髓

YOLO神经网络游戏中的大数据分析：优化游戏体验和盈利

最新资源