深度学习驱动的视网膜OCT图像增强技术

需积分: 9 38 浏览量更新于2024-07-16 1 收藏 9.53MB PDF 举报

"这篇论文探讨了利用深度学习技术来增强视网膜光学相干断层扫描（OCT）图像，以提升诊断眼疾的准确性和可靠性。作者Kerry J. Halupka等人来自IBM Research和纽约大学Langone Eye Center，他们提出了一种新颖的基于深度学习的散斑噪声减少方法。" 在光学相干断层扫描（OCT）图像中，视网膜的细节清晰度对于眼科疾病的诊断至关重要。然而，这些图像常常受到散斑噪声的影响，导致图像质量下降，进而影响评估的准确性。论文的重点是利用深度学习在医学成像领域的最新进展，设计出一种能够减少散斑噪声的算法。作者构建了两个版本的神经网络模型以满足不同用户的需求和偏好。首先，他们训练了一个卷积神经网络（CNN），该网络针对健康眼睛的OCT体积切片进行去噪处理，采用的是均方误差作为损失函数。这种方法的目标是直接减少噪声，提高图像的清晰度。其次，他们采用了生成对抗网络（GAN）的策略，特别是结合了 Wasserstein 距离和感知相似性。GAN在图像生成方面表现优异，能捕捉到更复杂的图像特征。通过这种方式，网络不仅能够降低噪声，还能尽可能地保留图像的原始结构和细节，从而提供更接近真实情况的增强图像。这种深度学习方法的应用可以显著改善OCT图像的质量，帮助医生更准确地识别和监测眼疾的发展，如糖尿病视网膜病变、黄斑变性等。此外，通过使用GAN，还能实现更自然、更逼真的图像恢复，这在临床实践中具有极大的潜力，可以提高诊断的精确度，最终有利于患者的治疗。这篇论文展示了深度学习在医疗成像领域的一个成功应用案例，特别是在提高视网膜OCT图像质量上。通过这两种不同的网络架构，研究人员能够提供更有效的解决方案，以应对OCT图像中的噪声问题，为眼科疾病的早期发现和治疗提供了强有力的技术支持。

generative adversarial network (WGAN) [24], and utilising a perceptual loss in addition to the

MSE loss. In the next section, we will describe the implementation of these methods.

2.2. Perceptual loss

The perceptual loss is based on high-level features extracted from a pre-trained network [25].

This ensures that the network is trained to replicate image similarities more robustly compared

to using per-pixel losses. The perceptual loss is deﬁned as the Euclidean distance between the

feature representations of the enhanced image (

)

) and the frame-averaged reference image

F A

) given by a pre-trained VGG19 network [26].

V GG/i. j

i, j

x=1

i, j

y=1

(φ

i, j

F A

)

x,y

− φ

i, j

))

x,y

)

(2)

where,

i, j

indicates the feature map obtained by the j-th convolution, after ReLU activation,

prior to the i-th pooling layer, and

i, j

and

i, j

describe the dimensions of the respective feature

maps within the VGG network.

2.3. Adversarial loss

Along with the generator network, a generative adversarial network (GAN) involves a discriminator

network,

, parametrised by

(shown in Fig. 1). The generator network is trained to

produce realistic images, while the discriminator network is trained to identify which images are

real versus those that are generated. Here, we implement a WGAN, an improved version of the

original GAN, which uses the Earth Mover’s distance [27] to compare two data distributions (that

and

). We optimise both networks in an alternating manner (ﬁxing one and updating

the other) to solve the following min-max problem:

min

max

WGAN

(D, G) = −E

F A

[D(I

F A

)] + E

[D(G(I

))]

+λE

F A

[(k∆



F A

− 1)

(3)

where, the ﬁrst two terms represent the estimation of the Wasserstein distance, and the ﬁnal

term performs a gradient penalty to enforce the Lipschitz constraint, with penalty coeﬃcient

is uniformly sampled along pairs of

and

F A

samples. This results in improved stability

during training. Additionally, we impose gradient penalty [28], which has been shown to improve

convergence of the WGAN compared to gradient clipping. With this approach, our generator can

learn to create solutions that are highly similar to real images and thus diﬃcult to classify by D.

Thus, the overall loss of the CNN-WGAN architecture is given by:

min

max

WGAN

(D, G) + λ

V GG

(G) + L

MSE

(G), (4)

where

and

are weighting parameters to control the trade-oﬀ between the three components

of the loss.

3. Experiments

3.1. Data acquisition and pre-processing

Six OCT volumes were acquired from both eyes of 38 healthy patients on a Cirrus HD-OCT

Scanner (Zeiss, Dublin, CA) at a single visit. The scans were centred on the optic nerve head

(ONH) and were 200 x 200 x1024 voxel per cube, acquired from a region 6mm x 6mm x 2mm.

These scans were then registered and averaged to create the “ground truth" denoised image. The

scan with the highest signal strength (as provided by the scanner software) was chosen as the

Vol. 9, No. 12 | 1 Dec 2018 | BIOMEDICAL OPTICS EXPRESS 6208

剩余16页未读，继续阅读

qqgrlmmrlm

粉丝: 0
资源: 3

深度学习驱动的视网膜OCT图像增强技术

OCTA-Net-OCTA-Vessel-Segmentation-Network

深度学习用于视网膜OCT图像的质量评估

基于联合决策卷积神经网络的光学相干断层扫描图像自动分类.pdf

深度学习在视网膜疾病中的应用.pdf

基于深度学习的视网膜分支动脉阻塞分割.pdf

基于深度学习的多模态眼科图像回归预测.pdf

人工智能在视网膜疾病中应用的研究现状与展望.pdf

人工智能技术可助高效诊断眼疾.pdf

基于卷积神经网络UNet构建糖尿病性黄斑水肿自动识别模型.pdf

Fully-Automated-Detection-and-Quantification-of-Macular-Fluid_2018_Ophthalmo.pdf

最新资源