ROC曲线操作点阈值的非参数估计

需积分: 9 175 浏览量更新于2024-11-02 收藏 700KB PDF 举报

"这篇论文探讨了在二元分类（或医学诊断）问题中，如何非参数估计ROC曲线上的操作点阈值。分类规则或诊断测试会产生一个连续的决策变量，该变量与临界值（或阈值）进行比较。高于（或低于）该阈值的测试值被标记为疾病阳性（或阴性）。每个阈值都与两种类型的错误相关，即第一类错误（假阳性）和第二类错误（假阴性）。" 在二元分类问题中，系统通常需要判断样本属于两类之一，例如，是否患病。分类器会给出一个连续的预测分数，这个分数反映了样本属于某一类的概率或置信度。阈值是将这些连续分数转化为二进制决策（如“疾病”或“无疾病”）的关键。当预测分数超过阈值时，样本被分类为阳性，反之则为阴性。 ROC曲线（受试者工作特征曲线）是评估分类器性能的重要工具。ROC曲线通过绘制真阳性率（True Positive Rate, TPR）与假阳性率（False Positive Rate, FPR）之间的关系来展示不同阈值下的分类性能。真阳性率是真正例的比例，而假阳性率是假正例的比例。ROC曲线越靠近左上角，分类器的性能越好，因为它同时减少了假阳性率和假阴性率。论文《Nonparametric estimation of the threshold at an operating point on the ROC curve》关注的是在ROC曲线上选择特定操作点的阈值的非参数估计方法。在实际应用中，我们可能对某些错误类型更敏感，比如在医学诊断中，可能希望降低假阴性（漏诊）以减少健康风险，因此需要在ROC曲线上找到一个能平衡这两类错误的最优阈值。非参数方法不依赖于数据遵循特定概率分布的假设，因此更适用于数据分布未知或复杂的情况。该论文可能介绍了如何利用统计方法，如核密度估计或回归技术，来估计ROC曲线上特定性能指标（如最大灵敏度和最小假阳性率）对应的阈值。此外，作者包括了来自不同背景的专家，表明这个问题涵盖了计算机科学、统计学和医药监管等多个领域，突显了阈值选择在跨学科研究中的重要性。论文经过多次修订，最终被接受发表，显示了研究的严谨性和质量。其内容可能涵盖了理论分析、模拟实验以及可能的实际应用案例，为二元分类问题中的阈值选择提供了新的见解和实用工具。

ARTICLE IN PRESS

W.A. Yousef et al. / Computational Statistics and Data Analysis ( ) – 3

larger values for η(X ) are considered ‘‘positive’’ cases and smaller values are considered ‘‘negative’’ ones. Hence, β is called

the True Positive Fraction (TPF), 1 − β is called the False Negative Fraction (FNF), α is called the False Positive Fraction and

1 − α is called the True Negative Fraction (TNF). Classification rules (or diagnostic tests) are assessed in terms of the Area

Under the ROC curve (AUC). It is easy to see that the area under the ROC curve (

TPF dFPF ) is equal to the area under the

ODC curve (

s dp). The larger the AUC the better the classification rule. For more details on ROC, AUC and their meaning

and terminology the reader may be referred to Hanley (1998) and Hanley and McNeil (1982) among others.

The costs of the two types of errors are typically not equal. For example, the cost of missing a cancer (a false negative)

greatly outweighs the cost of sending a nondiseased patient for a biopsy (a false positive). The prior probabilities π

and π

of the two classes ω

and ω

also enter into the choice of the optimum threshold setting. If the decision function η is the

likelihood ratio p

and c

is the cost of deciding ω

while the truth is ω

, then selecting ξ to be equal to c

gives

rise to the Bayes decision rule which minimizes the risk (see Anderson, 2003, Ch. 6). In general, η is not the likelihood ratio

and we need to know which threshold value ξ would give a particular operating point on the ODC, i.e., 1 −α and 1 −β, such

that the risk will be minimized. If we do not know the costs and priors the designers of the classification rule would still

like to operate at a particular operating point that they determine to satisfy some subjective criteria, or at least to control

the most serious type of error. In either case, the decision maker must address the same question, i.e., what is the value of

ξ that achieves the operating point (p, s) = (G(ξ), F (ξ)) on the ODC?

The answer is straightforward if we know the distributions F and G. However, when we do not know these distributions

we construct the classification rule with the aid of a ‘‘training set’’ tr = {x

∈ ω

, i = 1, . . . , n} ∪ {x

∈ ω

, j = 1, . . . , m}.

The resulting decision function is η

. Several parametric and nonparametric techniques are available in the literature to

construct a classification rule, which is not the target of the present work. Estimating the ODC (or ROC) of η

is of great

interest for ROC analysis. Several approaches are available in the literature for that task. Interested readers may refer

to Dorfman and Alf (1969), Hsieh and Turnbull (1996), Metz and Pan (1999), Pepe (2000) and Qin and Zhang (2003).

A naive estimator of the ODC is the unsmooth empirical ODC, which we denote by

[

ODC; see Fig. 2. It is a plot of the

empirical distribution function F

(ξ) vs. G

(ξ) at every threshold value ξ (see Section 2). The empirical ODC has several

attractive distribution-free features. It converges to the true ODC, i.e., FG

−1

(p), 0 ≤ p ≤ 1; moreover, it can be represented

as a summation of two independent versions of Brownian bridges (up to a term of small order of magnitude); see Hsieh and

Turnbull (1996). In the case of finite samples,

[

ODC is an unsmooth staircase curve as opposed to the fits produced by the

approaches cited above.

This article is organized as follows. In Section 2, we give motivation and propose an empirical procedure (algorithm) for

estimating the threshold at a particular operating point with the aid of

[

ODC. No knowledge of F and G are assumed but

the ODC is assumed to be known. Although this assumption does not seem to be valid in practical settings, it allows us to

analyze mathematically and discover any optimality properties of the proposed algorithm before considering the practical

case where estimated ODC is used. Section 3 provides the analysis of the proposed algorithm with deferring the proofs to

the Appendix; the analysis reveals that a simpler version of the proposed algorithm has the same asymptotic efficiency. We

then consider practical problems in which we have finite sample size and do not know the ODC and have to approximate

(estimate) it, e.g., using any smoothing method. Parameters used and derived in the analysis of the algorithm have now to be

estimated nonparametrically from the finite sample size. These practical considerations are incorporated into the third and

final version of our algorithm at the end of that section. In Section 4 we apply the final version of our proposed algorithm to

a real-world data set as well as data sets simulated from a wide variety of distributions.

2. Algorithm for threshold estimation

This section proposes a procedure (algorithm) that estimates the threshold at a particular operating point M, whose

coordinates on the ODC curve are (p, s). Basic statistical definitions are needed for this purpose; they will be introduced in

Section 2.1 for a wider readership. In Section 2.2 the algorithm is motivated and proposed.

2.1. Preliminaries

The empirical (sample) distribution function G

, an estimator for G, is defined as

(ξ) =

≤ξ)

where I is the indicator function. For 0 < G(ξ) < 1, it is known that

(·)

a.s.

→G(·), (1a)

1/2

(ξ) − G(ξ))

→N (0, G(ξ)(1 − G(ξ ))). (1b)

A similar result can be obtained when ξ is a r.v. (see Lemma 4). The pth-quantile G

−1

(p) is defined as

−1

(p) = inf{x, G(x) ≥ p}.

The sample pth-quantile G

−1

(p), an estimator of G

−1

(p), is the pth-quantile of G

, i.e.,

Please cite this article in press as: Yousef, W.A., et al., Nonparametric estimation of the threshold at an operating point on the ROC curve. Computational

Statistics and Data Analysis (2009), doi:10.1016/j.csda.2009.06.006

剩余13页未读，继续阅读

yfrsnoopy

粉丝: 0

ROC曲线操作点阈值的非参数估计

数据回归模型中的L'P误差和混合序列问题综述

"解决特征分析中的维度问题：新方法与框架探讨

"Ch8 假设检验及检验法则的应用案例

Using Wavelet Network in Nonparametric Estimation

Introduction to Nonparametric Estimation 英文版

Nonparametric Estimation from Incomplete.pdf

Missing and Modified Data in Nonparametric Estimation with R

Infinite Mixture Models with Nonparametric Bayes and the Dirichlet Process

nonparametric bayesian modeling of complex networks

Nonparametric-methods-and-robust-estimation

最新资源