深度学习引领医学图像分析新纪元

4星 · 超过85%的资源需积分: 50 139 浏览量更新于2024-07-18 1 收藏 5.25MB PDF 举报

"本文深入探讨了深度学习在医学图像处理中的应用，特别是在诊断图像分析方面的进展。文中总结了超过300篇相关领域的研究，主要关注深度学习在图像分类、对象检测、分割、配准等任务中的应用，并针对神经、视网膜、肺部、数字病理学、乳腺、心脏、腹部、肌肉骨骼等多个医学领域提供了具体应用概述。" 深度学习，作为一种人工智能技术，近年来在各个领域都取得了显著成就，尤其是在医学图像处理方面，它的潜力已经被广泛认可。深度学习，尤其是卷积神经网络（Convolutional Neural Networks，CNN），已经成为分析医疗图像的主要方法。CNNs通过多层次的特征提取，能够自动学习和理解图像中的复杂模式，从而在识别、分类和定位等方面表现出强大的性能。在医学图像分析中，深度学习被广泛应用在以下几个核心任务： 1. **图像分类**：深度学习模型可以对病灶或病变进行自动分类，例如区分良性与恶性肿瘤，或是识别不同类型的疾病，如肺炎类型。 2. **对象检测**：在图像中定位并识别特定的医疗对象，如检测CT扫描中的肺结节或MRI中的脑肿瘤。 3. **图像分割**：分割图像中的特定区域，如分割出肿瘤、血管或组织结构，有助于医生进行精确的诊断和治疗规划。 4. **图像配准**：通过对图像进行空间对齐，确保不同时间点或不同成像方式的图像能够对应，这对于监测病情变化和治疗效果评估至关重要。 5. **其他任务**：还包括图像去噪、增强、重建等预处理步骤，以及基于深度学习的个性化医疗决策支持系统等。这篇综述文章涵盖了多个医学领域，包括但不限于神经影像（如脑部MRI）、视网膜图像分析（糖尿病视网膜病变检测）、肺部CT（肺部结节检测）、数字病理学（癌症诊断）、乳腺X线摄影（乳腺癌筛查）、心脏成像（心脏病评估）、腹部CT（肝脏和胰腺疾病）、肌肉骨骼图像（骨折和关节疾病）等。文章总结了当前的最新进展，同时也指出了一些挑战，如数据不足、标注成本高、过度拟合问题、模型解释性不足以及临床应用的合规性和安全性等。未来的研究方向可能包括更有效的数据集构建策略、半监督或无监督学习、模型压缩和优化、以及更好地融入临床实践的模型开发。关键词：深度学习、卷积神经网络、医学影像、研究综述。

2.6.2. Restricted Boltzmann Machines (RBMs) and

Deep Belief Networks (DBNs)

RBMs (Hinton, 2010) are a type of Markov Ran-

dom Field (MRF), constituting an input layer or visi-

ble layer x = (x

, x

, . . . , x

) and a hidden layer h =

, h

, . . . , h

) that carries the latent feature represen-

tation. The connections between the nodes are bi-

directional, so given an input vector x one can obtain

the latent feature representation h and also vice versa.

As such, the RBM is a generative model, and we can

sample from it and generate new data points. In anal-

ogy to physical systems, an energy function is deﬁned

for a particular state (x, h) of input and hidden units:

E(x, h) = h

Wx − c

x − b

h, (9)

with c and b bias terms. The probability of the ‘state’ of

the system is deﬁned by passing the energy to an expo-

nential and normalizing:

p(x, h) =

exp{−E(x, h)}. (10)

Computing the partition function Z is generally in-

tractable. However, conditional inference in the form of

computing h conditioned on v or vice versa is tractable

and results in a simple formula:

P(h

|x) =

1 + exp{−b

− W

. (11)

Since the network is symmetric, a similar expression

holds for P(x

|h).

DBNs (Bengio et al., 2007; Hinton et al., 2006) are

essentially SAEs where the AE layers are replaced by

RBMs. Training of the individual layers is, again, done

in an unsupervised manner. Final ﬁne-tuning is per-

formed by adding a linear classiﬁer to the top layer of

the DBN and performing a supervised optimization.

2.6.3. Variational Auto-Encoders and Generative Ad-

verserial Networks

Recently, two novel unsupervised architectures

were introduced: the variational auto-encoder (VAE)

(Kingma and Welling, 2013) and the generative adver-

sarial network (GAN) (Goodfellow et al., 2014). There

are no peer-reviewed papers applying these methods to

medical images yet, but applications in natural images

are promising. We will elaborate on their potential in

the discussion.

2.7. Hardware and Software

One of the main contributors to steep rise of deep

learning has been the widespread availability of GPU

and GPU-computing libraries (CUDA, OpenCL). GPUs

are highly parallel computing engines, which have an

order of magnitude more execution threads than central

processing units (CPUs). With current hardware, deep

learning on GPUs is typically 10 to 30 times faster than

on CPUs.

Next to hardware, the other driving force behind the

popularity of deep learning methods is the wide avail-

ability of open source software packages. These li-

braries provide eﬃcient GPU implementations of im-

portant operations in neural networks, such as convo-

lutions; allowing the user to implement ideas at a high

level rather than worrying about low-level eﬃcient im-

plementations. At the time of writing, the most popular

packages were (in alphabetical order):

• Caﬀe (Jia et al., 2014). Provides C++ and Python

interfaces, developed by graduate students at UC

Berkeley.

• Tensorﬂow (Abadi et al., 2016). Provides C++

and Python and interfaces, developed by Google

and is used by Google research.

• Theano (Bastien et al., 2012). Provides a Python

interface, developed by MILA lab in Montreal.

• Torch (Collobert et al., 2011). Provides a Lua in-

terface and is used by, among others, Facebook AI

research.

There are third-party packages written on top of one or

more of these frameworks, such as Lasagne (https://

github.com/Lasagne/Lasagne) or Keras (https:

//keras.io/). It goes beyond the scope of this paper

to discuss all these packages in detail.

3. Deep Learning Uses in Medical Imaging

3.1. Classiﬁcation

3.1.1. Image/exam classiﬁcation

Image or exam classiﬁcation was one of the ﬁrst ar-

eas in which deep learning made a major contribution

to medical image analysis. In exam classiﬁcation one

typically has one or multiple images (an exam) as in-

put with a single diagnostic variable as output (e.g.,

disease present or not). In such a setting, every diag-

nostic exam is a sample and dataset sizes are typically

Concatenate

Up-convolu�on

Up-sample

Down-sample

(b) (c) (d)

(e)

(f)

Input node Weighted connec�on

Weighted connec�on

(similar colors indicate shared weights)

Hidden node

Output node

Probabilis�c node

Pooling connec�on

(a)

Figure 2: Node graphs of 1D representations of architectures commonly used in medical imaging. a) Auto-encoder, b) restricted Boltzmann

machine, c) recurrent neural network, d) convolutional neural network, e) multi-stream convolutional neural network, f) U-net (with a single

downsampling stage).

small compared to those in computer vision (e.g., hun-

dreds/thousands vs. millions of samples). The popular-

ity of transfer learning for such applications is therefore

not surprising.

Transfer learning is essentially the use of pre-trained

networks (typically on natural images) to try to work

around the (perceived) requirement of large data sets

for deep network training. Two transfer learning strate-

gies were identiﬁed: (1) using a pre-trained network as

a feature extractor and (2) ﬁne-tuning a pre-trained net-

work on medical data. The former strategy has the extra

beneﬁt of not requiring one to train a deep network at

all, allowing the extracted features to be easily plugged

in to existing image analysis pipelines. Both strategies

are popular and have been widely applied. However,

few authors perform a thorough investigation in which

strategy gives the best result. The two papers that do,

Antony et al. (2016) and Kim et al. (2016a), oﬀer con-

ﬂicting results. In the case of Antony et al. (2016), ﬁne-

tuning clearly outperformed feature extraction, achiev-

ing 57.6% accuracy in multi-class grade assessment of

knee osteoarthritis versus 53.4%. Kim et al. (2016a),

however, showed that using CNN as a feature extractor

outperformed ﬁne-tuning in cytopathology image clas-

siﬁcation accuracy (70.5% versus 69.1%). If any guid-

ance can be given to which strategy might be most suc-

cessful, we would refer the reader to two recent papers,

published in high-ranking journals, which ﬁne-tuned a

pre-trained version of Google’s Inception v3 architec-

ture on medical data and achieved (near) human expert

剩余37页未读，继续阅读

一..一

粉丝: 22
资源: 51

深度学习引领医学图像分析新纪元

深度学习在医学图像处理分析平台的应用

深度学习在医学图像分析中的应用与进展

深度学习在医学图像诊断中的应用实践

深度学习在医学图像处理中的应用分析

论文研究-深度学习在医学图像分割中应用 .pdf

基于深度学习的医学图像处理分析平台

深度学习在医学图像分析中的应用.pdf

深度学习在医学图像识别中的应用.pptx

利用最小误差法进行胸片分割：计算机视觉与深度学习在医学图像处理中的应用

深度学习在生物医学图像处理中的应用

最新资源