强化学习在医学图像分割中的创新应用

需积分: 10 190 浏览量更新于2024-09-04 收藏 657KB PDF 举报

本文主要探讨了在医学图像分割领域应用强化学习的新方法。作者提出了一种基于强化学习框架的策略，用于优化前列腺超声图像的局部阈值和结构元素选择。传统上，图像分割依赖于预设的阈值和形态学操作，但这种方法可能无法适应各种复杂的图像特性。通过引入强化学习，研究者设计了一个智能代理，该代理接收超声图像及其手动标注版本作为输入，通过尝试不同的阈值和结构元素设置（即动作），来调整图像分割的质量（即环境）。强化学习的关键在于，智能代理会根据其行动的结果（成功或失败，表现为奖励或惩罚）进行学习。这种客观的奖励信号促使代理在探索不同参数组合的同时，也利用已积累的知识（存储在Q矩阵中）来优化决策过程。Q矩阵是一种状态-动作价值函数，它指导着代理在遇到类似图像时如何选择最有效的策略。这样，即使在未见过的新超声图像上，代理也能利用已学习到的知识进行有效的分割。文章的重点在于展示了强化学习在医疗图像分割中的巨大潜力，尤其是在面对具有挑战性的实时、自动化和高精度要求的应用场景。通过这种方法，可以减少对人工干预的依赖，提高分割的准确性和效率，并且能够自适应不同类型的医学图像，如前列腺超声图像，从而推动了医疗图像分析技术的发展。这篇论文为深度学习、人工智能和图像处理算法在医疗领域的实际应用提供了一种创新且实用的解决方案。

A Reinforcement Learning Framework

for Medical Image Segmentation

Farhang Sahba, Member, IEEE, and Hamid R. Tizhoosh, and Magdy M.A. Salama, Fellow, IEEE

Abstract— This paper introduces a new method to medical

image segmentation using a reinforcement learning scheme.

We use this novel idea as an effective way to optimally ﬁnd

the appropriate local thresholding and structuring element

values and segment the prostate in ultrasound images. Re-

inforcement learning agent uses an ultrasound image and

its manually segmented version and takes some actions (i.e.,

different thresholding and structuring element values) to change

the environment (the quality of segmented image). The agent

is provided with a scalar reinforcement signal determined

objectively. The agent uses these objective reward/punishment

to explore/exploit the solution space. The values obtained using

this way can be used as valuable knowledge to ﬁll a Q-matrix.

The reinforcement learning agent can use this knowledge for

similar ultrasound images as well. The results demonstrate high

potential for applying reinforcement learning in the ﬁeld of

medical image segmentation.

I. INTRODUCTION

Many applications in medical imaging need to segment

an object in the image [1]. Ultrasound imaging is an impor-

tant image modality for clinical applications. The accurate

detection of the prostate boundary in ultrasound images is

crucial for diagnostic tasks [2]. However, in these images

the contrast is usually low and the boundaries between the

prostate and background are fuzzy. Also speckle and weak

edges make the ultrasound images inherently difﬁcult to

segment. The prostate boundaries are generally extracted

from transrectal ultrasound (TRUS) images [2]. Prostate seg-

mentation methods generally have limitations when there are

shadows with similar gray level and texture attached to the

prostate, and/or missing boundary segments. In these cases

the segmentation error may increase considerably. Another

obstacle may be the lack of a sufﬁcient number of training

(gold) samples if a learning technique is employed and the

samples are being prepared by an expert as done in the

supervised methods. Algorithms based on active contours

have been quite successfully implemented with the major

drawback that they depend on user interaction to determine

the initial snake. Therefore, a more universal approach should

require a minimum level of user interaction and training data

set.

Farhang Sahba is with the Pattern Analysis and Machine Intelligence Lab-

oratory, Department of System Design Engineering, University of Waterloo,

Waterloo, Ontario , Canada ( email: fsahba@uwaterloo.ca).

Hamid R. Tizhoosh is with the Pattern Analysis and Machine Intelli-

gence Laboratory, Department of System Design Engineering, University of

Waterloo, Waterloo, Ontario , Canada (email: tizhoosh@uwaterloo.ca).

Magdy M.A. Salama is with the Department of Electrical and Computer

Engineering, University of Waterloo, Waterloo, Ontario , Canada (email:

msalama@hivolt.uwaterloo.ca).

Considering the above factors our new algorithm based on

reinforcement learning (RL) is introduced to locally segment

the prostate in ultrasound images. The most important con-

cept of RL is learning by trial and error based on interaction

with the environment [3], [4]. It makes the RL agent suitable

for dynamic environments. Its goal is to ﬁnd out an action

policy that controls the behavior of the dynamic process,

guided by signals (reinforcements) that indicate how well it

has been performing the required task.

In the case of applying this method to medical image

segmentation, the agent takes some actions (i.e., different

values for thresholding and structuring element for a mor-

phological operator) to change its environment (the quality

of the segmented object). Also, states are deﬁned based on

the quality of this segmented object. First, the agent takes the

image and applies some values. Then it receives an objective

reward or punishment obtained based on comparison of

its result with the goal image. The agent tries to learn

which actions can gain the highest reward. After this stage,

based on the accumulated rewards, the agent has appropriate

knowledge for similar images as well.

In our algorithm we use this reinforced local parameter

adjustment to segment the prostate. The proposed method

will control the local threshold and the post-processing

parameter by using a reinforcement learning agent. The main

purpose of this work is to demonstrate this ability that as an

intelligent technique, reinforcement learning can be trained

using a very limited number of samples and also can gain

extra knowledge during online training. This is a major

advantage in contrast to other approaches (like supervised

methods) which either need a large training set or signiﬁcant

amount of expert or a-priori knowledge.

This paper is organized as follows: Section II is a short

introduction to reinforcement learning. Section III describes

the proposed method. Section IV presents results and the last

part, section V, concludes the work.

II. R

EINFORCEMENT LEARNING

Reinforcement learning (RL) is based on the idea that an

artiﬁcial agent learns by interacting with its environment

[3], [4]. It allows agents to automatically determine the

ideal behavior within a speciﬁc context that maximizes

performance with respect to predeﬁned measures. Several

components constitute the general idea behind reinforcement

learning. The RL agent is the decision-maker of the process

and attempts to take an action recognized by the environment.

It receives a reward or punishment from its environment

depending on the action taken. The RL agents discover which

2006 International Joint Conference on Neural Networks

Sheraton Vancouver Wall Centre Hotel, Vancouver, BC, Canada

July 16-21, 2006

511

下载后可阅读完整内容，剩余6页未读，立即下载

phytle0

粉丝: 5
资源: 164

强化学习在医学图像分割中的创新应用

Reinforcement Learning 2nd(Richard_S._Sutton).pdf

2017-a deep reinforcement learning based framework for content caching.pdf

DQ深度学习Deep Reinforcement Learning with Double Q-Learning.pdf

reinforcement learning: an introduction.pdf

DMRO: A Deep Meta Reinforcement Learning-Based Task Offloading Framework for Edge-Cloud Computing

python强化学习项目 python reinforcement learning projects - 2018.pdf

reinforcement learning sutton .pdf

reinforcement learning中文版 pdf

public dataset for AI

pycharm gym

最新资源