深度学习：对抗样本挑战与解决策略

神经网络

样本误分类

1星需积分: 50 93 浏览量更新于2023-06-04 2 收藏 1012KB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

资源详情

资源推荐

Published as a conference paper at ICLR 2015

+ .007 × =

x sign(∇

J(θ, x, y))

x +

sign(∇

J(θ, x, y))

“panda” “nematode” “gibbon”

57.7% conﬁdence 8.2% conﬁdence 99.3 % conﬁdence

Figure 1: A demonstration of fast adversarial example generation applied to GoogLeNet (Szegedy

et al., 2014a) on ImageNet. By adding an imperceptibly small vector whose elements are equal to

the sign of the elements of the gradient of the cost function with respect to the input, we can change

GoogLeNet’s classiﬁcation of the image. Here our  of .007 corresponds to the magnitude of the

smallest bit of an 8 bit image encoding after GoogLeNet’s conversion to real numbers.

Let θ be the parameters of a model, x the input to the model, y the targets associated with x (for

machine learning tasks that have targets) and J(θ, x, y) be the cost used to train the neural network.

We can linearize the cost function around the current value of θ, obtaining an optimal max-norm

constrained pertubation of

η = sign (∇

J(θ, x, y)) .

We refer to this as the “fast gradient sign method” of generating adversarial examples. Note that the

required gradient can be computed efﬁciently using backpropagation.

We ﬁnd that this method reliably causes a wide variety of models to misclassify their input. See

Fig. 1 for a demonstration on ImageNet. We ﬁnd that using  = .25, we cause a shallow softmax

classiﬁer to have an error rate of 99.9% with an average conﬁdence of 79.3% on the MNIST (?) test

set

. In the same setting, a maxout network misclassiﬁes 89.4% of our adversarial examples with

an average conﬁdence of 97.6%. Similarly, using  = .1, we obtain an error rate of 87.15% and

an average probability of 96.6% assigned to the incorrect labels when using a convolutional maxout

network on a preprocessed version of the CIFAR-10 (Krizhevsky & Hinton, 2009) test set

. Other

simple methods of generating adversarial examples are possible. For example, we also found that

rotating x by a small angle in the direction of the gradient reliably produces adversarial examples.

The fact that these simple, cheap algorithms are able to generate misclassiﬁed examples serves as

evidence in favor of our interpretation of adversarial examples as a result of linearity. The algorithms

are also useful as a way of speeding up adversarial training or even just analysis of trained networks.

5 ADVERSARIAL TRAINING OF LINEAR MODELS VERSUS WEIGHT DECAY

Perhaps the simplest possible model we can consider is logistic regression. In this case, the fast

gradient sign method is exact. We can use this case to gain some intuition for how adversarial

examples are generated in a simple setting. See Fig. 2 for instructive images.

If we train a single model to recognize labels y ∈ {−1, 1} with P (y = 1) = σ



x + b



where

σ(z) is the logistic sigmoid function, then training consists of gradient descent on

x,y∼p

data

ζ(−y(w

x + b))

where ζ(z) = log (1 + exp(z)) is the softplus function. We can derive a simple analytical form for

training on the worst-case adversarial perturbation of x rather than x itself, based on gradient sign

This is using MNIST pixel values in the interval [0, 1]. MNIST data does contain values other than 0 or

1, but the images are essentially binary. Each pixel roughly encodes “ink” or “no ink”. This justiﬁes expecting

the classiﬁer to be able to handle perturbations within a range of width 0.5, and indeed human observers can

read such images without difﬁculty.

See https://github.com/lisa-lab/pylearn2/tree/master/pylearn2/scripts/

papers/maxout. for the preprocessing code, which yields a standard deviation of roughly 0.5.

剩余10页未读，继续阅读

fengxingtianxiaaaaa

粉丝: 0
资源: 4

会员权益专享

深度学习：对抗样本挑战与解决策略

轴承故障诊断+1DCNN+深度学习+故障分类

代价敏感分类算法的实验比较

论文研究 - 两输入过程多响应面方法数据集中模型选择准则的小样本偏差和不确定性问题的实用解决方案

信号识别基于matlab深度学习cnn信号调制分类

深度学习面试csdn

深度学习 基于深度学习的病理图像细胞核分割 一万字

音频分类任务，标签为0,1,2，标签为2的样本7个分到8个分到0,7个分到1，测试集样本数量的比例为107:30:112

深度学习有标签和无标签

技术文档-基于深度学习的细粒度人体姿态与步态软件研发pdt

python 深度学习 缺陷检测

基于keras实现分类的研究背景与意义

横向联邦学习和深度学习结合的预测实例

基于深度学习的立体匹配

pytorch-multi-label-classifier-master

deep cross-domain few-shot learning for hyperspectral image classification

用深度学习进行故障诊断的学习流程csdn

用python针对给定数据集ORL_Faces，提取图像的特征(可以采用多种特征)并图像进行分类，分类方法自选。训练数据集和测试数据集采用随机划分的方法生成，测试样本占比为20%。图像可以经过预处理

给我写一个可执行的深度学习模型

小迈步之人工智能(一)|matlab赋能信号处理——基于深度学习的信号调制识别

会员权益专享

最新资源

深度学习基于深度学习的病理图像细胞核分割一万字

python 深度学习缺陷检测