场景文本检测的紧致度感知评估协议

需积分: 9 53 浏览量更新于2024-09-07 收藏 8.34MB PDF 举报

“Tightness-aware Evaluation Protocol for Scene Text Detection” 是一篇关于场景文字检测评价标准的研究论文，由Yuliang Liu、Lianwen Jin等人撰写，来自华南理工大学电子与信息工程学院。在文本检测领域，评估协议对于方法的发展至关重要。公平、客观和合理的评估方法是推动技术进步的基础。然而，现有的评估指标存在一些明显的不足：首先，它们缺乏目标导向性；其次，它们无法识别检测方法的紧凑性；最后，现有的一对一或多对一解决方案存在内在的漏洞和缺陷。因此，该论文提出了一种新颖的评估协议，称为Tightness-aware Intersect-over-Union (TIoU)度量，它可以量化真实值的完整性、检测结果的紧凑度以及匹配度的紧密性。具体来说，TIoU并不只是简单地使用IoU（交并比）值，而是考虑了两种常见的检测行为，并直接利用TIoU的分数来识别和评价检测效果的优劣。传统IoU仅比较边界框的重叠部分，但忽视了检测结果与实际文字区域的贴合程度。TIoU引入了新的考量因素，它强调了检测框与实际文字区域的紧密度，这有助于更好地评估检测算法是否能够准确且紧凑地定位文字。此外，针对一对一和多对一匹配的问题，TIoU提供了更严密的解决方案，减少了因匹配方式导致的误差。论文中，作者可能通过实验对比分析了TIoU与现有评价标准（如PASCAL VOC的IoU）之间的差异，展示了TIoU在评估场景文字检测任务时的优越性。这样的新评估协议将促进场景文字检测领域的进步，使得研究人员可以更加准确地评估和优化他们的检测算法，进而推动技术的发展。 "Tightness-aware Evaluation Protocol for Scene Text Detection"这篇论文关注于改进现有的文字检测评估体系，提出了TIoU这一新的度量标准，旨在解决当前评估中存在的问题，提高评估的公正性和准确性，对于推动场景文字检测技术的进步具有重要意义。

Tightness-aware Evaluation Protocol for Scene Text Detection

Yuliang Liu, Lianwen Jin

∗

, Zecheng Xie, Canjie Luo, Shuaitao Zhang, Lele Xie

College of Electronic Information Engineering, South China University of Technology

liu.yuliang@mail.scut.edu.cn; lianwen.jin@gmail.com

Abstract

Evaluation protocols play key role in the developmental

progress of text detection methods. There are strict require-

ments to ensure that the evaluation methods are fair, ob-

jective and reasonable. However, existing metrics exhibit

some obvious drawbacks: 1) They are not goal-oriented; 2)

they cannot recognize the tightness of detection methods;

3) existing one-to-many and many-to-one solutions involve

inherent loopholes and deﬁciencies. Therefore, this pa-

per proposes a novel evaluation protocol called Tightness-

aware Intersect-over-Union (TIoU) metric that could quan-

tify completeness of ground truth, compactness of detection,

and tightness of matching degree. Speciﬁcally, instead of

merely using the IoU value, two common detection behav-

iors are properly considered; meanwhile, directly using the

score of TIoU to recognize the tightness. In addition, we

further propose a straightforward method to address the an-

notation granularity issue, which can fairly evaluate word

and text-line detections simultaneously. By adopting the

detection results from published methods and general ob-

ject detection frameworks, comprehensive experiments on

ICDAR 2013 and ICDAR 2015 datasets are conducted to

compare recent metrics and the proposed TIoU metric. The

comparison demonstrated some promising new prospects,

e.g., determining the methods and frameworks for which the

detection is tighter and more beneﬁcial to recognize. Our

method is extremely simple; however, the novelty is none

other than the proposed metric can utilize simplest but rea-

sonable improvements to lead to many interesting and in-

sightful prospects and solving most the issues of the pre-

vious metrics. The code is publicly available at https:

//github.com/Yuliang-Liu/TIoU-metric.

1. Introduction

Recent metrics for evaluating text detection have been

adopted from the object detection Pascal VOC metric [4].

However, unlike object detection, text detection tasks re-

quire the bounding box to be tighter because the primary

goal of detection is to recognize the text. Simply adopting

(a) Cutting.

(b) Pure.

(d) Cutting & Outlier-GTs.

Figure 1. Unreasonable cases obtained using recent evaluation

metrics. (a), (b), (c), and (d) all have the same IoU of 0.66 against

the GT. Red: GT. Blue: detection.

the same IoU metric for text detection leads to the following

issues:

• As shown in Fig. 1 (a), detection over a ﬁxed IoU

threshold with the ground truth (GT) may not com-

pletely recall the text (some characters are missed);

however, previous metrics consider that the GT has

been entirely recalled.

• As shown in Figs. 1 (b), (c), and (d), detection over

a ﬁxed IoU threshold with the GT may still contain

background noise; however, previous metrics consider

such detection to have 100% precision.

• As shown in Fig. 1, previous metrics consider detec-

tions (a), (b), (c), and (d) to be equivalent perfect de-

tections because they all have the same IoU value that

is higher than a threshold. However, considering that

the primary goal of detection is to recognize the text,

these detections are not equivalent: 1) In (a), there is

no way to recognize the characters outside the detec-

tion bounding box; 2) in (c), it is very difﬁcult for a

recognizer to distinguish which is the target GT; 3) the

issues pertaining to both (a) and (c) can simultaneously

occur for (d); 4) as for (b), it is easy for a normal text

recognizer to recognize the content correctly.

• Previous metrics severely rely on an IoU threshold.

4321

arXiv:1904.00813v1 [cs.CV] 27 Mar 2019

下载后可阅读完整内容，剩余9页未读，立即下载

发疯疯

粉丝: 1
资源: 1

场景文本检测的紧致度感知评估协议

【计算机视觉】TIoU文本检测评价指标 计算机视觉.pdf

请患“植物神经紊乱”和“焦虑症”的朋友进来.pdf

一个更加摇摆不定的div的库-JavaScript开发

怎么用交叉点的（经度、纬度）坐标绘制网络并根据节点的紧密度中心值为节点着色？

Novel optical lithography using silver superlens

boxes_tightness_prior:在MIDL 2020上的口头报告-弱监督分割的边界框

On the tightness of a cut-set bound on network function computation

呼吸科常用英文.doc

大维随机矩阵的谱分析理论

基于STM32单片机的激光雕刻机控制系统设计-含详细步骤和代码

最新资源

【计算机视觉】TIoU文本检测评价指标计算机视觉.pdf