"解释性深度学习在医学诊断中的应用综述：可解释性的挑战与应对策略"

人工智能

需积分: 9 53 浏览量更新于2024-03-13 收藏 4.21MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

模型在临床工作流程中很难被采用，主要是因为它们缺乏可解释性。深度学习模型的黑盒性导致了对其决策过程进行解释的需求，这促使了可解释人工智能(XAI)的发展。本文提供了XAI应用于医疗诊断的全面综述，包括可视化、文本和基于示例的解释方法。此外，还对现有的医学成像数据集和现有的指标进行了回顾，以评估解释的质量。此外，对现有的报告生成方法进行了性能比较，并讨论了XAI在医学影像应用中的主要挑战。在过去的十年中，深度学习已经广泛应用于各种医学图像分析任务，如肿瘤检测、疾病诊断和影像分割。这些深度学习模型在对医学图像进行分类和诊断方面取得了令人瞩目的成就，甚至有些已经超过了医生的水平。然而，尽管这些模型的准确性很高，它们的黑盒性使得医生和患者难以理解其决策过程。这种缺乏可解释性成为了深度学习模型在临床实践中应用的主要障碍。为了解决这一问题，可解释的人工智能(XAI)应运而生。XAI致力于开发能够解释深度学习模型决策过程的方法，以使医生和患者能够理解模型的预测结果。本文从可视化、文本和基于示例的解释方法三个方面对XAI在医疗诊断中的应用进行了综述。可视化方法通过生成图像或热图来展示深度学习模型的关注区域，从而帮助人们理解模型的判断依据。文本解释方法则将模型的预测结果转化为易于理解的语言描述，使医生能够直观地了解模型的推理过程。而基于示例的解释方法则通过展示模型对特定样本的重要特征，帮助人们理解模型的推理逻辑。除了对XAI方法的综述外，本文还对现有的医学成像数据集和评估解释质量的指标进行了回顾。了解数据集的特点和指标的优劣，有助于进一步推动XAI方法在医学诊断中的应用。另外，本文对现有的报告生成方法进行了性能比较，以指导医学影像报告的自动生成技术的发展。最后，本文还讨论了XAI在医学影像应用中面临的挑战。其中包括模型的可解释性与准确性之间的平衡、数据集的质量和多样性、以及XAI方法的可扩展性和实用性等方面的挑战。对这些挑战的深入分析有助于指导未来XAI方法在医学影像领域的发展方向。总之，深度学习在医学诊断中取得了显著的成就，但其缺乏可解释性限制了其在临床实践中的应用。本文提供了对XAI在医学诊断中的应用的全面综述，并讨论了XAI在医学影像应用中的挑战。这有助于推动XAI方法在医学影像领域的进一步发展，为实现自动化医学诊断提供更加可靠和可解释的技术支持。

资源详情

资源推荐

Explainable Deep Learning Methods in Medical Diagnosis: A Survey • 7

understand the contribution of dierent features in the model prediction, or by analytically determining the

contribution of dierent features to the model prediction. On the other hand, intrinsic models, also known as

in-model approaches or inherently interpretable models are self-explainable since they are designed to produce

human-understandable representations from the internal model features.

3.2 Classical XAI Methods

The rst attempts to explain deep learning models relied on the post-hoc analysis of the models. In spite of the

criticism that post-hoc approaches have been recently subjected [

105

], they are still being used in many domains

of medical imaging, and their understanding is important to explain the advances on the topic of interpretable

deep learning. As such, the following sections briey describe the most popular XAI algorithms according to the

two major categories of post-hoc analysis.

3.2.1 Perturbation-based methods. The rationale behind perturbation-based methods is to perceive how a

perturbation in the input aects the model’s prediction. Two of the most representative perturbation-based

methods are LIME [101] and SHAP [77]; and an occlusion-based method [151].

LIME

. LIME [

101

] stands for Local Interpretable Model-agnostic Explanations. As the name suggests, it can

explain any black-box model, and according to the XAI taxonomy is a post-hoc, model-agnostic method providing

local explanations. The intuition behind LIME is to approximate the complex model (black-box model) locally

with an interpretable model, usually denoted as local surrogate model. Thus, an individual instance is explained

locally using a simple interpretable model around the prediction, such as linear models or decision trees. Figure 2

a) provides an intuitive illustration of the overall functioning of LIME.

In order to approximate the model prediction locally, a new dataset consisting of perturbed samples conditioned

on their proximity to the instance being explained is used to t the interpretable model. The labels for those

perturbed samples are obtained through the complex model. In the case of tabular data, the perturbed instances

are sampled around the instance being explained, by randomly changing the feature values in order to obtain

samples both in the vicinity and far away from the instance being explained. Analogously, when LIME is applied

to the image classication problem, the image being explained is rst segmented into superpixels, which are

groups of pixels in the image sharing common characteristics, such as colour and intensity. Then, the perturbed

versions of the original data are obtained by randomly masking out a subset of superpixels, resulting in an

image with occluded patches. The new dataset used to t the interpretable model consists of perturbed versions

of the image being explained, and the superpixels with the highest positive coecients in the interpretable

model suggest they largely contributed to the prediction. Thus, they will be selected as part of the interpretable

representation that is simple a binary vector indicating the presence or absence of those superpixels.

SHAP

. SHAP [

] was inspired on the Shapley values from the cooperative game theory [

113

] and operates

by determining the average contribution of a feature value to the model prediction using all combinations of the

features powerset. As an example, given the task of predicting the risk of stroke based on age, gender and Body

Mass Index (BMI), the SHAP explanations for a particular prediction are given in terms of the contribution of

each feature. This contribution is determined from the change observed in model prediction when using the 2

𝑛

combinations from the features powerset, where the missing features are replaced by random values. Figure 2 b)

illustrates the above-described example. Similar to LIME, SHAP is a local model-agnostic interpretation method

that can be applied to both tabular and image data. In the case of tabular data, the explanation is given in the

form of importance values to each feature. In the case of image data, it follows a similar procedure to the LIME,

by calculating the Shapley values for all possible combinations between superpixels. Several variations of SHAP

method were proposed to approximate Shapley values in a more ecient way, namely KernelSHAP, DeepSHAP

and TreeSHAP [76].

, Vol. 1, No. 1, Article . Publication date: May 2022.

8 • Patrício et al

Feature 1

Feature 2

Interpretable

Model

New Test

Sample

Complex

Model

Generated

Samples

Feature 1

Feature 2

Stroke No Stroke

Prediction

: Stroke

Contribution

0.0 0.5 1.0

Black-Box

Model

Age = 13

BMI = 29.1

AVG_Glucose_Level = 76.55

Gender_Female = 1

Predicted Class:

No Stroke

Explanation

age

bmi

avg_glucose_level

gender_Female

No Stroke

Stroke

mean(|SHAP value|)

0.0 0.1 0.2 0.3 0.4

(a) (b)

Fig. 2. (a)

LIME

. The black curved line represents decision boundary learned by the complex black-box model. LIME explains

a new test sample (dashed circle), by fiing an interpretable model (represented by a green dashed line) to the variations of

the test sample (orange circles), which are generated by randomly perturbing the test sample features. The fied model

allows to perceive the contribution of each feature for classifying that specific test sample. (b)

SHAP

. The predicted risk of

stroke of a classification model for a female person with 13 years, a body mass index of 29.1 and an average glucose level

of 76.55 was “No Stroke”. As evidenced by the bar plot, which provides the Shapley values for each feature, “age” was the

feature with a higher impact on the prediction of “No Stroke”, followed by the BMI and average glucose level features.

3.2.2 Saliency. Saliency maps are one of the most popular techniques to explain the decisions of a model. Saliency

methods produce visual explanation maps representing the importance of image pixels to the model classication.

Class Activation Mapping (CAM) [

153

] is a seminal saliency method which allowed the generation of a saliecy

map using a linear combination of the output of the last Global Average Pooling (GAP) layer of the network.

Despite being a seminal contribution, CAM can only be applied to architectures following a specic pattern:

convolutional layers, followed by a GAP layer connected to a single fully-connected layer. To address this problem,

Selvaraju et al. [

112

] proposed the Gradient-weighted Class Activation Mapping (Grad-CAM) [

112

] that uses

the gradient information of the target class with respect to the input image to produce a class-discriminative

localization map that act as a visual explanation for the model’s prediction. Grad-CAM is a generalization of

CAM. The main dierence between Grad-CAM and CAM is that the latter requires a particular kind of CNN

architecture. Alternatively, SmoothGrad [

122

] is another gradient-based explanation method whose core idea is

to attenuate the noise of the explanations provided by other gradient-based techniques. The rationale behind

SmoothGrad is to sample multiple images from the input image by adding noise to it. Then, the sensitivity maps

are computed for each sampled image. The nal map is the average of the sensitivity maps.

The Integrated Gradients (IG) [

127

] is an attribution method that relies on generating a set of images between

the baseline and the original image using linear interpolation. These interpolated images are minor changes

in the feature space between the baseline and input image and consistently increase with each interpolated

image’s intensity. Calculating the gradients per feature (pixels) makes it possible to measure the correlation

between changes to a feature and changes in the model’s predictions. The pixels with a high score are the ones

that contributed the most to the prediction. The Layer-wise Relevance Propagation (LRP) [

] is an alternative

solution to the use gradients, where the decision function is decomposed into the relevance score of neuron in

the network. The output is propagated backwards through the model to determine the relevance score of the

input, allowing to produce an importance heatmap of image pixels.

, Vol. 1, No. 1, Article . Publication date: May 2022.

剩余35页未读，继续阅读

努力+努力=幸运

粉丝: 2
资源: 138

会员权益专享

"解释性深度学习在医学诊断中的应用综述：可解释性的挑战与应对策略"

A Survey of Deep Learning Methods for Relation Extraction

A survey on deep learning in medical image analysis.pdf

Deep Learning: Methods and Applications

robust and explainable autoencoder

linear model neural network

log_tensorboard=True参数

The current research topic in the field of artificial intelligence

人工智能会用到的常见英文

XAI post hoc

tell me more exact informations about OPENAI

aaai 2020 tutorial explainable ai:

近两年的新技术、新方法、新理论

DARPA发布XAI项目相关文献引用

请综述机器学习目前发展以及发展趋势

explainable artificial intelligence for accident anticipation

现在的图像识别技术有哪些

Why we need Explainable ML?中文翻译并回答问题

EBM模型的局部可解释性如何代码实现

基于可解释图神经网络的海空目标行为认知技术研究现状

如何查看EBM模型中interpret对各特征的分析结果 代码实现

会员权益专享

最新资源

如何查看EBM模型中interpret对各特征的分析结果代码实现