腾讯发布深度学习全自动化头部与颈部器官分割论文

下载需积分: 10 | PDF格式 | 1.51MB | 更新于2024-08-26 | 95 浏览量 | 举报

腾讯的" AnatomyNet: Deep Learning for Fast and Fully Automated Whole-volume Segmentation of Head and Neck Anatomy"（1808.05238）是一篇重要的机器学习领域的论文。该研究专注于解决头颈部（Head and Neck, HaN）癌症放射治疗（Radiation Therapy, RT）规划中的关键问题——器官风险区域（Organs at Risk, OARs）的自动分割。由于头颈部CT扫描通常包含数百个切片，手动进行每个切片的OARs标记是一项耗时且易出错的任务。因此，自动化OARs分割具有显著的优势：一方面可以显著减少医生的工作量，节省时间；另一方面，通过精确的分割有助于提高放射治疗计划的质量。论文的核心贡献是提出了一种深度学习方法，名为 AnatomyNet。这种方法利用深度神经网络技术，对整个CT图像进行一次处理就能完成OARs的全面自动分割，从而实现高效、精准的自动化流程。它可能采用了卷积神经网络（Convolutional Neural Networks, CNN）或其他先进的深度学习架构，这些架构能够学习并识别不同OARs在CT图像中的特征，如形状、纹理和位置信息。现有的自动解剖分割算法通常依赖于预定义的规则或特征模板，而 AnatomyNet则可能是基于端到端的学习策略，可以直接从原始CT图像中提取高级特征并生成分割结果，减少了对人工设计特征的需求。为了确保高质量的分割，论文可能还涉及了数据增强、模型优化以及对多个OARs类别的联合学习等技术。此外，论文可能还包含了实验部分，展示了AnatomyNet在大规模头颈部CT数据集上的性能，包括分割准确度、召回率和运行时间等方面的评估。与传统手动方法相比，AnatomyNet的性能优势和对临床实践的潜在影响也是讨论的重点。这篇论文不仅推动了医学图像分析领域内的自动化技术发展，也为头颈部癌症放射治疗的精准医疗提供了强有力的支持，有助于提高放射治疗的个性化和精确性。对于任何关注机器学习在医学影像处理或放射治疗规划中的应用的研究者和开发者来说，深入理解AnatomyNet的方法和成果具有很高的价值。

To train and evaluate the performance of AnatomyNet,

we curated a dataset of 261 head and neck CT images

from a number of publicly available sources. We carried

out systematic experimental analyses on various compo-

nents of the network, and demonstrated their eﬀective-

ness by comparing with other published methods. When

benchmarked on the test dataset from the MICCAI 2015

competition on HaN segmentation, the AnatomyNet out-

performed the state-of-the-art method by 3.3% in terms

of Dice coeﬃcient (DSC), averaged over nine anatomical

structures.

The rest of the paper is organized as follows. Sec-

tion II B describes the network structure and SE residual

block of AnatomyNet. The designing of the loss function

for AnatomyNet is present in Section II C. How to handle

missing annotations is addressed in Section II D. Section

III validates the eﬀectiveness of the proposed networks

and components. Discussions and limitations are in Sec-

tion IV. We conclude the work in Section V.

II. MATERIALS AND METHODS

Next we describe our deep learning model to delin-

eate OARs from head and neck CT images. Our model

receives whole-volume HaN CT images of a patient as

input and outputs the 3D binary masks of all OARs at

once. The dimension of a typical HaN CT is around

178 × 512 × 512, but the sizes can vary across diﬀer-

ent patients because of image cropping and diﬀerent set-

tings. In this work, we focus on segmenting nine OARs

most relevant to head and neck cancer radiation therapy

- brain stem, chiasm, mandible, optic nerve left, optic

nerve right, parotid gland left, parotid gland right, sub-

mandibular gland left, and submandibular gland right.

Therefore, our model will produce nine 3D binary masks

for each whole volume CT.

A. Data

Before we introduce our model, we ﬁrst describe the cu-

ration of training and testing data. Our data consists of

whole-volume CT images together with manually gener-

ated binary masks of the nine anatomies described above.

There were collected from four publicly available sources:

1) DATASET 1 (38 samples) consists of the training set

from the MICCAI Head and Neck Auto Segmentation

Challenge 2015 [4]. 2) DATASET 2 (46 samples) consists

of CT images from the Head-Neck Cetuximab collection,

downloaded from The Cancer Imaging Archive (TCIA)

[36]. 3) DATASET 3 (177 samples) consists of CT im-

ages from four diﬀerent institutions in Qu´ebec, Canada

[37], also downloaded from TCIA [36]. 4) DATATSET

4 (10 samples) consists of the test set from the MICCAI

https://wiki.cancerimagingarchive.net/

HaN Segmentation Challenge 2015. We combined the

ﬁrst three datasets and used the aggregated data as our

training data, altogether yielding 261 training samples.

DATASET 4 was used as our ﬁnal evaluation/test dataset

so that we can benchmark our performance against pub-

lished results evaluated on the same dataset. Each of

the training and test samples contains both head and

neck images and the corresponding manually delineated

OARs.

In generating these datasets, We carried out several

data cleaning steps, including 1) mapping annotation

names named by diﬀerent doctors in diﬀerent hospi-

tals into uniﬁed annotation names, 2) ﬁnding correspon-

dences between the annotations and the CT images, 3)

converting annotations in the radiation therapy format

into usable ground truth label mask, and 4) remov-

ing chest from CT images to focus on head and neck

anatomies. We have taken care to make sure that the four

datasets described above are non-overlapping to avoid

any potential pitfall of inﬂating testing or validation per-

formance.

B. Network architecture

We take advantage of the robust feature learning mech-

anisms obtained from squeeze-and-excitation (SE) resid-

ual blocks [30], and incorporate them into a modiﬁed

U-Net architecture for medical image segmentation. We

propose a novel three dimensional U-Net with squeeze-

and-excitation (SE) residual blocks and hybrid focal and

dice loss for anatomical segmentation as illustrated in

Fig. 1.

The AnatomyNet is a variant of 3D U-Net [25, 38, 39],

one of the most commonly used neural net architectures

in biomedical image segmentation. The standard U-Net

contains multiple down-sampling layers via max-pooling

or convolutions with strides over two. Although they

are beneﬁcial to learn high-level features for segment-

ing complex, large anatomies, these down-sampling lay-

ers can hurt the segmentation of small anatomies such

as optic chiasm, which occupy only a few slices in HaN

CT images. We design the AnatomyNet with only one

down-sampling layer to account for the trade-oﬀ between

GPU memory usage and network learning capacity. The

down-sampling layer is used in the ﬁrst encoding block

so that the feature maps and gradients in the follow-

ing layers occupy less GPU memory than other network

structures. Inspired by the eﬀectiveness of squeeze-and-

excitation residual features on image object classiﬁca-

tion, we design 3D squeeze-and-excitation (SE) residual

blocks in the AnatomyNet for OARs segmentation. The

SE residual block adaptively calibrates residual feature

maps within each feature channel. The 3D SE Residual

learning extracts 3D features from CT image directly by

extending two-dimensional squeeze, excitation, scale and

convolutional functions to three-dimensional functions.

剩余12页未读，继续阅读

灿烂李

粉丝: 393

腾讯发布深度学习全自动化头部与颈部器官分割论文

腾讯全套人力资源管理资料大全.zip

腾讯微服务架构的发展趋势.pdf

腾讯云私有网络服务介绍.pdf

腾讯云从业认证题库..pdf

腾讯2016 运营岗笔试题.pdf

腾讯云认证-数据库组合.pdf

腾讯云发布区块链TBaaS白皮书.pdf

腾讯“试水”车联网.pdf

腾讯WeCity未来城市2.0白皮书.pdf

腾讯云区块链TBaaS产品白皮书.pdf

最新资源