卷积神经网络详解：特征提取与自动不变性

4星 · 超过85%的资源需积分: 10 94 浏览量更新于2024-09-15 收藏 140KB PDF 举报

卷积神经网络（Convolutional Neural Networks, CNNs）是一种深度学习架构，自20世纪80年代以来在图像处理、计算机视觉等领域取得了显著的成功。本文档由Jake Bouvrie撰写，他是麻省理工学院脑与认知科学系生物与计算学习中心的研究员。文章旨在介绍卷积神经网络的基本原理、设计和应用，特别关注其独特的结构和功能。首先，作者介绍了传统的全连接神经网络中的反向传播算法，这是一种用于训练多层神经网络的标准优化方法，通过梯度下降更新权重以最小化损失函数。然而，卷积神经网络引入了额外的层次，包括滤波器（filters）和下采样（subsampling），这使得网络能够实现特征提取和数据驱动的学习。滤波器相当于一组预先设定的模板，用于识别输入数据中的特定模式，如边缘、纹理或局部特征。这种结构不仅减少了参数数量，因为每个位置只用一个滤波器进行计算，而且提供了某种程度的平移不变性，即对输入图像中的物体位置变化具有稳健性。 CNN的设计包含以下几个关键步骤： 1. **滤波器和卷积操作**：网络通过将输入数据与多个滤波器进行卷积运算，得到特征图（feature maps）。这个过程利用了局部连接和权值共享，提高了计算效率，并减少了过拟合的风险。 2. **激活函数**：卷积层后通常会采用非线性激活函数，如ReLU（Rectified Linear Unit），以增加模型的表达能力。 3. **池化层**：下采样（pooling）层用于减小特征图的空间尺寸，同时保留重要的特征，进一步减少计算量并提高模型的尺度不变性。 4. **全连接层**：经过一系列卷积和池化层后，输出被展平并通过全连接层进行分类或回归任务，这与传统的神经网络结构相似。 5. **反向传播**：尽管网络结构不同于常规神经网络，但反向传播算法仍被用于计算梯度并更新网络参数，以最小化损失函数。 6. **扩展和创新**：文章提到，虽然本文的讨论主要针对二维数据和卷积，但其实现方法可以扩展到任意维度，这展示了CNN的灵活性。通过学习和应用卷积神经网络，我们可以处理高维图像数据，实现诸如图像分类、目标检测、图像分割等复杂任务，且在诸如自然语言处理和语音识别等领域也有潜在的应用。卷积神经网络是现代深度学习中不可或缺的一部分，其独特的优势和强大的性能使其在众多领域展现了显著的优势。

Notes on Convolutional Neural Networks

Jake Bouvrie

Center for Biological and Computational Learning

Department of Brain and Cognitive Sciences

Massachusetts Institute of Technology

Cambridge, MA 02139

jvb@mit.edu

November 22, 2006

1 Introduction

This document discusses the derivation and implementation of convolutional neural networks

(CNNs) [3, 4], followed by a few straightforward extensions. Convolutional neural networks in-

volve many more connections than weights; the architecture itself realizes a form of regularization.

In addition, a convolutional network automatically provides some degree of translation invariance.

This particular kind of neural network assumes that we wish to learn ﬁlters, in a data-driven fash-

ion, as a means to extract features describing the inputs. The derivation we present is speciﬁc to

two-dimensional data and convolutions, but can be extended without much additional effort to an

arbitrary number of dimensions.

We begin with a description of classical backpropagation in fully connected networks, followed by a

derivation of the backpropagation updates for the ﬁltering and subsampling layers in a 2D convolu-

tional neural network. Throughout the discussion, we emphasize efﬁciency of the implementation,

and give small snippets of MATLAB code to accompany the equations. The importance of writing

efﬁcient code when it comes to CNNs cannot be overstated. We then turn to the topic of learning

how to combine feature maps from previous layers automatically, and consider in particular, learning

sparse combinations of feature maps.

Disclaimer: This rough note could contain errors, exaggerations, and false claims.

2 Vanilla Back-propagation Through Fully Connected Networks

In typical convolutional neural networks you might ﬁnd in the literature, the early analysis consists of

alternating convolution and sub-sampling operations, while the last stage of the architecture consists

of a generic multi-layer network: the last few layers (closest to the outputs) will be fully connected

1-dimensional layers. When you’re ready to pass the ﬁnal 2D feature maps as inputs to the fully

connected 1-D network, it is often convenient to just concatenate all the features present in all the

output maps into one long input vector, and we’re back to vanilla backpropagation. The standard

backprop algorithm will be described before going onto specializing the algorithm to the case of

convolutional networks (see e.g. [1] for more details).

2.1 Feedforward Pass

In the derivation that follows, we will consider the squared-error loss function. For a multiclass

problem with c classes and N training examples, this error is given by

n=1

k=1

− y

)

Here t

is the k-th dimension of the n-th pattern’s corresponding target (label), and y

is similarly

the value of the k-th output layer unit in response to the n-th input pattern. For multiclass classiﬁ-

cation problems, the targets will typically be organized as a “one-of-c” code where the k-th element

下载后可阅读完整内容，剩余7页未读，立即下载

Bomber

粉丝: 15
资源: 12

卷积神经网络详解：特征提取与自动不变性

卷积神经网络实现手写数字识别（纯numpy实现）--python手撕卷积神经网络代码

卷积神经网络pdf讲义超详细

基于python实现的CNN卷积神经网络手写数字识别项目源码+详细注释+数据集（毕业设计&期末大作业）

cnn卷积神经网络cnn卷积神经网络cnn卷积神经网络cnn卷积神经网络.txt

卷积神经网络（CNN）车牌识别卷积神经网络（CNN）车牌识别卷积神经网络（CNN）车牌识别卷积神经网络（CNN）车牌识别卷积神经

matlab卷积神经网络.zip_tracesrs_winterh53_卷积神经网络_卷积神经网络代码_神经网络

卷积神经网络.rar_卷积_卷积神经_卷积神经网络_卷积网络_神经网络 图片

卷积神经网络详述.zip_卷积神经·_卷积神经网络_卷积网络_神经网络结构

卷积神经网络.rar_卷积神经_卷积神经网络_卷积网络_权值更新_神经网络 目标

卷积神经网络与深度卷积神经网络

最新资源

卷积神经网络.rar_卷积_卷积神经_卷积神经网络_卷积网络_神经网络图片

卷积神经网络.rar_卷积神经_卷积神经网络_卷积网络_权值更新_神经网络目标