多示例学习：CAD诊断中的协同优化框架

需积分: 50 143 浏览量更新于2024-09-09 收藏 518KB PDF 举报

多示例学习是一种在计算机辅助诊断（CAD）等领域的关键技术，它旨在解决医学图像分析中的几个关键挑战。首先，医学图像中的疾病结构通常与正常结构存在极大的数据不平衡，这使得检测任务极具困难。其次，实时性是在线执行的重要需求，系统需要快速准确地识别可能的病变。再者，对于恶性结构，往往会产生多个相关的候选区域，这些区域彼此紧密相邻，进一步增加了识别的复杂性。传统的方法可能难以处理这种复杂性，因此，本文提出了一种创新的学习框架，即结合级联分类器和多实例学习（Multiple Instance Learning, MIL）。MIL的基本思想是将一组样本视为一个"实例集合"，其中至少有一个样本包含所需的信息，而其他可能为噪声。在这个框架下，级联分类器可以逐步筛选出最有可能的正例，同时考虑到多个候选之间的关联性。作者构建了一个统一的最小-最大优化框架，将级联回归和MIL问题联合起来，形成一个可转换为四次多项式约束二次规划的问题。这种形式的优势在于它能够有效地处理复杂的决策过程，并通过块坐标优化算法得到高效的解决方案。这种方法允许系统在保持高效的同时，兼顾对恶性结构的精确检测和对冗余候选区的有效排除。在实际应用中，研究者将这项技术应用于计算机辅助诊断系统，用于检测医疗图像中的潜在病变结构。通过这种多示例学习的级联分类器，系统能够在满足实时性能的同时，显著提高异常结构的识别准确性和区分度，从而有助于医生做出更精准的诊断决策。这种创新方法为医学图像分析领域提供了一种强大的工具，有望在未来改善疾病的早期发现和治疗效果。

A Min-Max Framework of Cascaded Classiﬁer with Multiple Instance Learning

for Computer Aided Diagnosis

Dijia Wu

1∗

, Jinbo Bi

, Kim Boyer

Rensselaer Polytechnic Institute, Troy, NY 12180 USA, wud5@rpi.edu

Siemens Medical Solutions, Malvern, PA 19355 USA, jinbo.bi@siemens.com

Abstract

The computer aided diagnosis (CAD) problems of detect-

ing potentially diseased structures from medical images are

typically distinguished by the following challenging char-

acteristics: extremely unbalanced data between negative

and positive classes; stringent real-time requirement of on-

line execution; multiple positive candidates generated for

the same malignant structure that are highly correlated and

spatially close to each other. To address all these problems,

we propose a novel learning formulation to combine cas-

cade classiﬁcation and multiple instance learning (MIL) in

a uniﬁed min-max framework, leading to a joint optimiza-

tion problem which can be converted to a tractable quadrat-

ically constrained quadratic program and efﬁciently solved

by block-coordinate optimization algorithms.

We apply the proposed approach to the CAD problems of

detecting pulmonary embolism and colon cancer from com-

puted tomography images. Experimental results show that

our approach signiﬁcantly reduces the computational cost

while yielding comparable detection accuracy to the current

state-of-the-art MIL or cascaded classiﬁers. Although not

speciﬁcally designed for balanced MIL problems, the pro-

posed method achieves superior performance on balanced

MIL benchmark data such as MUSK and image data sets.

1. Introduction

Over the years, computer aided diagnosis (CAD) sys-

tems have been widely used to assist physicians in inter-

preting medical images from different modalities such as

magnetic resonance imaging (MRI), X-ray, and computed

tomography (CT) and to identify potentially diseased re-

gions like lesions or tumors. Most CAD systems comprise

of three stages: identify candidate structures, i.e., poten-

∗

This work was conducted when Dijia Wu was with Siemens Medical

Solutions at Malvern PA, USA.

tially unhealthy regions, in the image; generate features for

each candidate; classify each candidate as normal (nega-

tive) or diseased (positive). To maintain high sensitivity,

a very large number of candidates are generated in the ﬁrst

stage because any malignant regions missed at this stage can

never be recovered later in the CAD system. Consequently,

majority of the candidates generated, typically more than

99%, are false positives, which makes the data extremely

unbalanced. In this situation, cascaded classiﬁers can be

used to speed up candidate classiﬁcation by quickly dis-

carding numerous negative samples with low-cost features

at early stages and spending more computation on promis-

ing disease-like candidates [15].

Moreover, for CAD data, a candidate is labeled as posi-

tive if it is sufﬁciently close to a radiologist’s mark (ground

truth) and labeled as negative otherwise. Multiple candi-

dates are usually generated corresponding to the same ab-

normal structure so that if any such candidate is detected,

the underlying structure is found. Therefore, CAD prob-

lems are better modeled as multiple instance learning (MIL)

by enclosing all the candidates within a certain distance to

a radiologist’s mark into a positive bag [6].

In this paper, we propose a novel approach to combine

MIL classiﬁers in a cascade. In particular, we start out

with formulating MIL as an optimization problem in a min-

max framework in Section 2. Section 3 reviews the joint

optimization principle [5] used to construct all hyperplane

classiﬁers of a cascade in one shot, and describes a new

min-max formulation for optimization of the cascade. The

two min-max frameworks are fused as discussed in Sec-

tion 4 to form a uniﬁed approach that optimizes a cascade

of MIL classiﬁers simultaneously. Experimental results on

two CAD applications and MIL benchmark datasets are

given in Section 5 together with some discussion. We con-

clude with a review of our contributions and potential ex-

tensions in Section 6.

下载后可阅读完整内容，剩余7页未读，立即下载

qq_25839767

粉丝: 0
资源: 2

多示例学习：CAD诊断中的协同优化框架

多用户检测MATLAB代码

多示例学习与多标记学习的研究

多示例学习目标跟踪算法

多示例学习内容

多示例学习问题研究进展综述

局部特征与多示例学习结合的超声图像分类方法

一种新的基于多示例学习的场景分类方法 (2010年)

近邻加权多示例学习：一种多标记学习的改进算法

多示例学习框架解析与现状探讨

基于多示例学习的图像检索算法研究

最新资源