ROC图：研究者实用指南

需积分: 9 12 浏览量更新于2024-08-01 收藏 402KB PDF 举报

"ROC曲线图是研究人员在分类器组织和性能可视化方面的一种有用技术，常用于医学决策，并逐渐被机器学习和数据挖掘领域采纳。尽管ROC曲线看似简单，但在实际应用中存在一些误解和陷阱。本文既作为ROC曲线的基础教程，也作为在研究中使用它们的实践指南，介绍如何基于性能可视化、组织和选择分类器，以及分析其在信号检测理论中的应用，如在命中率和误报率之间的权衡。" ROC曲线图（Receiver Operating Characteristics，接收者操作特性）是评估分类器性能的重要工具。它通过绘制真阳性率（True Positive Rate, TPR）与假阳性率（False Positive Rate, FPR）之间的关系来展示分类器在不同阈值下的表现。真阳性率表示分类器正确识别正类的能力，假阳性率则表示将负类错误识别为正类的概率。在医学领域，ROC曲线常用于诊断测试效果的评估，例如判断某种疾病的检测准确性。在机器学习和数据挖掘中，ROC曲线可以帮助我们比较不同模型的性能，无论数据集的不平衡程度如何。通过改变分类阈值，我们可以找到在特定应用场景下最合适的分类器。 ROC曲线的基本构建步骤包括： 1. 计算每个样本的得分或概率，这可以是分类器输出的任何度量。 2. 设置一系列阈值，对所有样本进行分类。 3. 对于每个阈值，计算真阳性率和假阳性率。 4. 将所有真阳性率与假阳性率的点连成曲线，形成ROC曲线。曲线下的面积（Area Under the Curve, AUC）是衡量分类器性能的一个综合指标。AUC接近1表示分类器性能优秀，而接近0.5则表示性能不佳，与随机猜测接近。然而，仅依赖AUC可能不足以全面评估分类器，因为某些应用可能更关心误报率较低或真阳性率较高的情况。在实际应用中，应注意以下几点： - ROC曲线并不考虑类别的不平衡，因此对于严重不平衡的数据集，可能需要结合其他评估指标，如精确度、召回率和F1分数。 - 不同曲线的形状可以反映分类器的辨别能力。U形曲线表示分类器性能较差，而远离对角线的曲线表示性能较好。 - ROC曲线可以帮助识别过度拟合或欠拟合。如果在训练集上得到的曲线优于测试集，可能表明模型在训练数据上过拟合。总结来说，ROC曲线图是一种强大的评估和比较分类器性能的工具，适用于各种领域的决策支持。理解其原理和正确使用方法对于优化模型选择和提高预测质量至关重要。

ROC graphs 7

Infinity

.55

.54 .53 .52

.51 .505

.39

.38 .37 .36 .35

.34 .33

.30

0 0.1 0.2 0.3 0.4

0.5 0.6

0.7 0.8 0.9 1

False positive rate

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

True positive rate

Inst# Class Score Inst# Class Score

1 p .9 11 p .4

2 p .8 12 n .39

3 n .7 13 p .38

4 p .6 14 n .37

5 p .55 15 n .36

6 p .54 16 n .35

7 n .53 17 p .34

8 n .52 18 n .33

9 p .51 19 p .30

10 n .505 20 n .1

Figure 3. The ROC “curve” created by thresholding a test set. The table at right

shows twenty data and the score assigned to each by a scoring classiﬁer. The graph

at left shows the corresponding ROC curve with each point labeled by the threshold

that produces it.

ﬁgure 3 is taken from a very small instance set so that each point’s

derivation can be understood. In the table of ﬁgure 3, the instances are

sorted by their scores, and each point in the ROC graph is labeled by

the score threshold that produces it. A threshold of +∞ produces the

point (0, 0). As we lower the threshold to 0.9 the ﬁrst positive instance is

classiﬁed positive, yielding (0, 0.1). As the threshold is further reduced,

the curve climbs up and to the right, ending up at (1, 1) with a threshold

ROC101.tex; 16/03/2004; 12:56; p.7

8 Tom Fawcett

Algorithm 1 Conceptual method for calculating an ROC curve. See

algorithm 2 for a practical method.

Inputs: L, the set of test instances; f (i), the probabilistic classiﬁer’s es-

timate that instance i is positive; min and max, the smallest and largest

values returned by f; increment, the smallest diﬀerence between any two f

values.

1: for t = min to max by increment do

2: F P ← 0

3: T P ← 0

4: for i ∈ L do

5: if f(i) ≥ t then /* This example is over threshold */

6: if i is a positive example then

7: T P ← T P + 1

8: else /* i is a negative example, so this is a false positive */

9: F P ← F P + 1

10: end if

11: end if

12: end for

13: Add point (

F P

T P

) to ROC curve

14: end for

15: end

of 0.1. Note that lowering this threshold corresponds to moving from

the “conservative” to the “liberal” areas of the graph.

Although the test set is very small, we can make some tentative

observations about the classiﬁer. It appears to perform better in the

more conservative region of the graph; the ROC point at (0.1, 0.5)

produces its highest accuracy (70%). This is equivalent to saying that

the classiﬁer is better at identifying likely positives than at identifying

likely negatives. Note also that the classiﬁer’s best accuracy occurs at

a threshold of ≥ .54, rather than at ≥ .5 as we might expect with a

balanced distribution. The next section discusses this phenomenon.

3.1. Relative versus absolute scores

An important point about ROC graphs is that they measure the ability

of a classiﬁer to produce good relative instance scores. A classiﬁer need

not produce accurate, calibrated probability estimates; it need only

produce relative accurate scores that serve to discriminate positive and

negative instances.

Consider the simple instance scores shown in ﬁgure 4, which came

from a Naive Bayes classiﬁer. Comparing the hypothesized class (which

is Y if score> 0.5, else N) against the true classes, we can see that

the classiﬁer gets instances 7 and 8 wrong, yielding 80% accuracy.

ROC101.tex; 16/03/2004; 12:56; p.8

剩余37页未读，继续阅读

sdfex

粉丝: 1
资源: 6

ROC图：研究者实用指南

ROC Graphs: Notes and Practical Considerations for Data Mining Researchers

伯克利大学机器学习-11Bootstrap&cross-validation&ROC plots Michael Jordan

ROC.zip_ROC Biometric_ROC ROI_ROC matlab_ROC medical _ROC medi

plotroc.rar_AUC_ROC AUC_plotroc_roc_roc and auc

roc.zip_Matlab ROC_roc_roc curve_信号ROC曲线_检测ROC

matlab-roc.rar_Matlab ROC_ROC matlab_ROC plot_plotroc_roc

ROCcurve.zip_ROC 信号检测_ROC曲线_roc curves_roccurve_检测ROC

draw_roc.rar_Matlab ROC_ROC曲线_ROC曲线MATLAB_ROC曲线、_roc

SVM.rar_ROC曲线_ROC曲线SVM_ROC曲线绘制svm_SVM ROC曲线_svm roc

ROC.zip_crowdv82_python_python roc曲线_roc 数据_roc曲线python

最新资源