深度学习中的Dropout算法：减缓过拟合提升性能

deep

learnin

需积分: 0 99 浏览量更新于2023-05-21 1 收藏 1.59MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

在机器学习领域，特别是在深度学习中，Dropout算法是一种重要的正则化技术，旨在解决过拟合问题。该算法最初由G.E. Hinton等人于2012年提出，他们发现当一个大型前馈神经网络（Feedforward Neural Network）在小规模训练数据上进行训练时，往往会在未见过的数据上表现不佳，即存在过度拟合的现象。Dropout通过一种随机策略来改进这种情况。 Dropout的基本思想是在每次训练迭代中，随机地忽略（或“丢弃”）一部分神经元，即在每个训练样本上，不是让所有隐藏单元都参与计算，而是以一定的概率让它们保持激活状态。这个概率通常设置在0.5到0.8之间，比如在每次前向传播时，将大约一半的神经元暂时关闭。这样做的目的是强迫网络学习更加鲁棒和独立的特征表示，而不是仅仅依赖于某些特定的特征组合。通过这种方式，Dropout防止了神经元之间的过度协同适应（co-adaptation），即一个神经元的功能变得依赖于其他特定的神经元。这样，每个神经元不仅需要检测那些在大多数情况下对正确答案有用的通用特征，还要能够适应各种内部上下文。这种随机性促使网络学习到更广泛的特征表示，增强了泛化能力。 Dropout在许多标准基准任务上表现出显著的提升效果，特别是在语音识别和对象识别等复杂任务中，它打破了当时的记录。由于其简单而有效，Dropout已经成为深度学习模型中不可或缺的一部分，被广泛应用于深度神经网络（如卷积神经网络、循环神经网络等）中，以防止过拟合，提高模型的稳定性和性能。实施Dropout时，通常会在训练阶段启用，而在测试阶段，所有的神经元都会被考虑，但其权重是基于训练期间的有效学习。 Dropout算法在深度学习中扮演着至关重要的角色，它通过随机神经元失活实现了模型的正则化，提升了模型在新数据上的泛化能力，从而在众多实际应用中取得了卓越的性能。

资源详情

资源推荐

Fig. 2: The frame classiﬁcation error rate on the core test set of the TIMIT benchmark. Com-

parison of standard and dropout ﬁnetuning for different network architectures. Dropout of 50%

of the hidden units and 20% of the input units improves classiﬁcation.

layer and 185 “softmax” output units that are subsequently merged into the 39 distinct classes

used for the benchmark. Dropout of 50% of the hidden units signiﬁcantly improves classiﬁca-

tion for a variety of different network architectures (see ﬁgure 2). To get the frame recognition

rate, the class probabilities that the neural network outputs for each frame are given to a decoder

which knows about transition probabilities between HMM states and runs the Viterbi algorithm

to infer the single best sequence of HMM states. Without dropout, the recognition rate is 22.7%

and with dropout this improves to 19.7%, which is a record for methods that do not use any

information about speaker identity.

CIFAR-10 is a benchmark task for object recognition. It uses 32x32 downsampled color

images of 10 different object classes that were found by searching the web for the names of the

class (e.g. dog) or its subclasses (e.g. Golden Retriever). These images were labeled by hand

to produce 50,000 training images and 10,000 test images in which there is a single dominant

object that could plausibly be given the class name (9) (see ﬁgure 3). The best published error

rate on the test set, without using transformed data, is 18.5% (10). We achieved an error rate of

16.6% by using a neural network with three convolutional hidden layers interleaved with three

“max-pooling” layers that report the maximum activity in local pools of convolutional units.

These six layers were followed by one locally-connected layer (For details see Appendix D) .

Using dropout in the last hidden layer gives an error rate of 15.6%.

ImageNet is an extremely challenging object recognition dataset consisting of thousands of

high-resolution images of thousands of classes of object (11). In 2010, a subset of 1000 classes

with roughly 1000 examples per class was the basis of an object recognition competition in

剩余17页未读，继续阅读

dawnstar2008

粉丝: 1
资源: 4

会员权益专享

深度学习中的Dropout算法：减缓过拟合提升性能

机器学习经典论文

Targeted-Dropout：Targeted Dropout纸的补充代码

机器学习经典论文---十大经典算法

机器学习论文TOP20

机器学习经典论文（人工智能）

keras中加入droupout技术.docx

各种机器学习算法的简要回顾-研究论文

服务器虚拟化部署方案.doc

北京市东城区人民法院服务器项目.doc

求集合数据的均方差iction-mast开发笔记

Wom6.3Wom6.3Wom6.3

html网页版python语言pytorch框架的图像分类西瓜是否腐烂识别-含逐行注释和说明文档-不含图片数据集

2020年细分产品出口数据集.xlsx

注重设置让FTP服务器共享更安全.doc

孵化器孵化服务标准(绝对超值).doc

wx116个人健康信息管理-springboot+vue+uniapp-小程序.zip（可运行源码+sql文件+）

简历求职 (11).pptx

软件测试学习日志-自动化测试阶段-day01

html网页版python语言pytorch框架的图像分类草莓品质识别-含逐行注释和说明文档-不含图片数据集

FXMochaPro2023 v10.0.5.38 是一款功能强大的视觉效果和后期制作工具，由 Boris FX 开发

会员权益专享

最新资源