深度学习网络详解：CNN、DBN与堆叠自编码器

版权申诉

185 浏览量更新于2024-09-10 收藏 988KB PDF 举报

"这篇论文对深度神经网络架构、深度学习方法以及它们的一些可能应用进行了简要的理论回顾。重点讨论了最常见的网络结构：卷积神经网络（CNN）、深度信念网络（DBN）和堆叠自编码器（SA）。文中解释了构建更深网络的关键组件，如修正线性单元（ReLU）和softmax激活函数、卷积滤波器、受限玻尔兹曼机和自编码器。在论文的最后部分，还介绍了一些混合系统的例子，并对当前的技术现状和深度神经网络可能的未来应用做了总结。关键词包括：自编码器、玻尔兹曼机、卷积神经网络、深度学习综述。" 本文深入探讨了深度学习领域的核心概念，首先介绍了深度神经网络的基础。深度学习是机器学习的一个分支，它通过模拟人脑神经元的工作方式来处理复杂的数据。随着计算能力的提升，深度学习已经取得了显著的进步，尤其在图像识别、自然语言处理和语音识别等领域。卷积神经网络（CNN）是深度学习的重要组成部分，特别适用于图像处理任务。CNN利用卷积层提取特征，池化层减少计算量并保持关键信息，而全连接层则用于分类或回归。ReLU激活函数解决了传统Sigmoid或Tanh函数的梯度消失问题，提高了模型的训练效率。Softmax函数用于多类别的概率预测，确保输出的概率总和为1。深度信念网络（DBN）是一种无监督学习的深度模型，由多个受限玻尔兹曼机（RBM）堆叠而成。RBM是一种能量模型，可以学习数据的潜在表示。DBN通过逐层预训练，然后进行微调，可以在高维无标签数据上初始化权重，提高学习效率。堆叠自编码器（SA）是另一种无监督学习方法，由多个自编码器串联形成。自编码器用于学习输入数据的压缩表示，然后重构原始数据。通过堆叠多层，自编码器能够捕获更复杂的输入模式，从而在无监督学习中发现数据的潜在结构。论文最后部分讨论了深度学习的混合系统，这通常涉及将多种网络结构结合，以解决特定问题。例如，可以将CNN与循环神经网络（RNN）结合处理序列数据，或者在DBN和SA的基础上构建更复杂的深度生成模型。这篇论文全面概述了深度学习中的关键网络结构及其组件，为读者提供了理解深度学习原理的坚实基础，并指出了深度学习在各个领域的广泛应用潜力和未来研究方向。

International Journal of Engineering and Technical Research (IJETR)

ISSN: 2321-0869 (O) 2454-4698 (P) Volume-9, Issue-12, December 2019

5 www.erpublication.org



Abstract—This paper presents a brief theoretical review on

deep neural network architectures, deep learning procedures,

as well assome of its possible applications. The paper focuses on

the most common networks structures: Convolutional Neural

Network (CNN), Deep Belief Network (DBN) and Stacked

Auto-encoders (SA). The building blocks which enabled the

construction of deeper networkssuch as Rectified Linear Unit

(ReLU) and softmax activation functions, convolution filters,

restricted Boltzmann machines and autoencoders, are

explained in the beginning and middle sections of the paper. A

few examples of hybrid systems are also presented at the last

sections of the paper. The paper concludes with some

considerations on the state-of-art work and on the possible

future applications enabled by deep neural networks.

Index Terms— Autoencoder, Boltzmann Machine,

Convolutional Neural Network, Deep Leaning Review.

I. INTRODUCTION

Neural networks algorithms have been applied to a wide

variety of complex tasks, inareas ranging from computer

vision, speech recognition, text translation, system

identification and control, among others.

The greatest advantage of this algorithm lies on their

ability to learn from a set of examples, without the need for

defining a set of explicit rules for a given task. After learning

how to solve a given problem, an artificial neural network

would generally perform in the same level or better than a

rule-based algorithm for the same task, especially for very

abstract problems such as in computer vision.

While neural networks were shown to theoretically be able

to represent any nonlinear function [1], in practice neural

networks were limited in depth and by long training times.

What allowedneural networks to achieve the high level of

performance seen today was the development of a series of

techniques for training deep networks in the past decade. This

set of techniques is what is now known as deep learning.

This paper presents a brief theoretical review on deep

Rômulo Fernandes da Costa, GraduateProgram in

ElectronicEngineeringAnd Computer Science, ITA, São José dos Campos

Sarasuaty Megume Hayashi Yelisetty, Graduate Program in Electronic

Engineering And Computer Science, ITA, São José dos Campos, Brazil

Johnny Cardoso Marques, Computer Science Division, ITA, São José

dos Campos, Brazil.

Paulo Marcelo Tasinaffo, Computer Science Division, ITA, São José

dos Campos, Brazil.

This work was funded by the Brazilian National Council for

Scientific and Technological Development (CnPq), in the form of

funding for the first author.

neural network’s structures, training procedures, and

enumerates some of its possible applications. The paper

focuses on presenting a general description on the inner

workings of the most common deep architectures, namely the

Deep Belief Networks (DBN), Stacked Autoencoders (SA)

and Convolutional Neural Networks (CNN).

In terms of structure, these three topologies can be

decomposed in fundamental blocks, such as the ReLU and

softmax activation functions, convolution filters, restricted

Boltzmann machines and autoencoders. These blocks, along

with the associated architectures, are described in the middle

sections of the paper.

A few examples of hybrid systemsare also presented at

later sections of the paper. The paper concludes with some

considerations on the state-of-art work and on the possible

future applications enabled bydeep neural networks.

II. BASIC CONCEPTS

A. Artificial Neuron Structure

An Artificial Neural Network (ANN) is a parallel

computational structure loosely inspired on real neural

networks,capable of learning from large amounts of data. The

network is trained to generate a set of outputs from the inputs

presents on the training data. Thus, an ANN can act as

anuniversal approximator of nonlinear functions [1].

These networks are composed of several small units, called

neurons or nodes, grouped in multiple sequential layers. Each

neuron in a layer receivessignals fromneurons localized in

other layers or from the network’s input itself.The neuron

then responds by emitting a signal of its own, propagating the

information forward to the next layers in the network.

The output signaly

fired by a neuron as a response to an

input vector x

isdescribed by:





 



󰇛









 



󰇜

(1)

Here, 



and 



are the connection weight vector and

activation bias respectively.The mathematical function 



a nonlinear function called "activation function” and

describes the response of the neuron to its collective input.

Historically, 



used to be simple linearfunctions (such as

in the original perceptron[2]) and sigmoid functions, but with

the popularization of deeper networks, less computationally

expensive options such as Rectified Linear Unit (ReLU)

started to be employed. Fig. 1 shows a plot ofsome of the

commonly used activation functions.

A Brief Didactic Theoretical Review on Convolutional

Neural Networks, Deep Belief Networks and Stacked

Auto-Encoders

MSc.Rômulo Fernandes da Costa, MSc.Sarasuaty Megume Hayashi Yelisetty, Dr. Johnny

Cardoso Marques, Dr. Paulo Marcelo Tasinaffo

下载后可阅读完整内容，剩余7页未读，立即下载

Fun_He

粉丝: 19
资源: 104

深度学习网络详解：CNN、DBN与堆叠自编码器

A_Brief_Didactic_Theoretical_Review_on_Convolutional_Neural_Networks.pdf

Convolutional deep belief networks on CIFAR-10.pdf

Deep-Belief-Network-for-Regression-master.zip_deep belief networ

翻译A-fast-learning-algorithm-for-deep-belief-nets.pdf

belief-update-abm-源码.rar

dbnmatlab代码-Deep-Belief-Networks-for-Image-Denoising_MATLAB-Code:Mohamm

翻译A-fast-learning-algorithm-for-deep-belief-nets.docx

翻译A-fast-learning-algorithm-for-deep-belief-nets.doc编程资料

libsvm-master.zip_deep belief network_libsvm-master_libsvm工具箱_深度

sensors-16-00868.pdf_wirelessnetworks_sensor_

最新资源