深度学习图像分类的对抗攻击与防御策略综述

图像分类

对抗ML

需积分: 18 99 浏览量更新于2024-07-15 收藏 1.87MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

资源详情

资源推荐

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender’s Perspective 7

Fig. 4. (a): Malicious and usually imperceptible perturbations present in a input image can induce trained

models to misclassification. Adapted from Klarreich [

]. (b): The objective of an adversarial aack is to

generate a perturbation

δx

and insert it into a legitimate image

in order to make the resulting adversarial

image x

′

= x + δx cross the decision boundary. Adapted from Bakhti et al. [8].

3.2 Taxonomy of Aacks and Aackers

This section is also based on the concepts and denitions of the works of Akhtar and Mian, Barreno

et al

, Brendel et al

, Kumar and Mehta, Xiao and Yuan et al

[

190

198

] to extend

existing taxonomies which organize attacks and attackers. In the context of security, adversarial

attacks and attackers are categorized under threat models. A threat model denes the conditions

under which a defense is designed to provide security garantees against certain types of attacks

and attackers [

]. Basically, a threat model delimiters (i) the knowledge an attacker has about the

targeted classier (such as its parameters and architecture), (ii) his goal with the adversarial attack

and (iii) how he will perform the adversarial attack. A threat model can be then classied into six

dierent axes: (i) attacker’s inuency, (ii) attacker’s knowledge, (iii) security violation, (iv) attack

specicity, (v) attack computation and (vi) attack approach.

3.2.1 Aacker’s Influence. This axis denes how the attacker will control the learning process

of deep learning models. According to Xiao [

190

], the attacker can perform two types of attack,

taking into account his inuence on the classication model: (i) causative or poisoning attacks and

(ii) evasive or exploratory attacks.

• Causative or poisoning attacks

: in causative attacks, the attacker has inuence on the deep

learning model during its training stage. In this type of attack, the training samples are corrupted

or the training set is polluted with adversarial examples in order to produce a classication model

incompatible with the original data distribution;

• Evasive or exploratory attacks

: in constrast of causative attacks, in evasive attacks the attacker

has inuence on the deep learning models during the inference or testing stage. Evasive attacks

are the most common type of attack, where the attacker craft adversarial examples that lead

deep learning models to misclassication, usually with a high condence on the prediction.

Evasive attacks can also have an exploratory nature, where the attacker’s objective is to gather

information about the targeted model, such as its parameters, architectures, cost functions, etc.

The most common exploratory attack is the input/output attack, where the attacker provides

the targeted model with adversarial images crafted by him. Afterwards, the attacker observes

the outputs given by the model and tries to reproduce a substitute or surrogate model, so that it

Again here, the novel topics proposed by this paper are highlighted by undelined font.

, Vol. 1, No. 1, Article . Publication date: September 2020.

剩余34页未读，继续阅读

syp_net

粉丝: 158
资源: 1187

深度学习图像分类的对抗攻击与防御策略综述

机器学习与图像识别：理论、应用

机器学习对抗性攻击手段.pdf

机器学习在图像处理当面

基于机器学习的图像增强

对抗扰动进行图像分类模型版权保护的展望

红外与可见光图像融合属于深度学习还是机器学习

图像分类的对抗样本生成方法

gan图像对抗学习c++

机器学习医学图像分割模型

python机器学习拼图

请分别介绍机器学习的数据泄露问题、对抗样本问题和后门问题

图像分类国内外研究现状

简述机器学习分类算法研究目的、意义和研究现状

机器学习和深度学习的区别

机器学习图像超分辨率原理

半监督学习交通标识图像分类

介绍一下机器学习算法

陆家嘴学堂邹博 python机器学习与深度学习课件

机器学习在模式识别中的应用研究综述

请我我的机器学习部分应该如何准备

最新资源