利用世界知识评估文本局部连贯性

78 浏览量更新于2024-08-29 收藏 1.54MB PDF 举报

"在局部一致性评估中对世界知识进行编码" 这篇研究论文主要探讨了如何在文本局部一致性评估中利用世界知识来提升评价效果。以前的工作大多集中在匹配文本不同部分中相同实体的多个提及，而忽略了那些语义相关但不一定有核心ference（即指代关系）的实体之间的联系，如“Gates”和“Microsoft”。作者们认为，这些语义相关的实体对于理解文本的连贯性至关重要。在这项研究中，作者通过利用世界知识（例如，“Gates是创建Microsoft的人”）来捕获这种语义关联性。他们采用两种现有的评估框架来进行改进： 1. 在无监督框架中，引入了语义相似度的概念。这涉及将实体与知识库（如Wikipedia或Freebase）中的信息进行比对，以确定它们在语义上的关联程度。这种方法可以帮助识别并评估那些没有直接核心ference关系但具有上下文相关性的实体，从而提高文本连贯性的评估准确性。 2. 在监督框架中，他们利用标注数据来训练模型，以识别和量化文本中的语义关系。这可能包括使用深度学习技术，如循环神经网络（RNN）或卷积神经网络（CNN），以学习捕捉实体之间的复杂关系模式。通过这种方式，模型可以学习到如何利用世界知识来增强局部连贯性的判断。此外，论文还可能涉及以下方面： - 实验设计：作者可能通过构建实验来验证他们的方法，这可能包括使用标准的文本连贯性数据集，并与其他现有方法进行比较，以证明新方法的有效性。 - 结果分析：实验结果可能展示了新方法在提高评估指标（如F1分数、准确率或召回率）上的表现，以及这些提升如何反映了对语义相关实体的理解改善。 - 讨论与局限性：论文可能会讨论这种方法的优势，比如它可以更全面地理解文本内容，同时指出可能存在的问题，如知识库的不完整性或实体消歧的挑战。 - 应用前景：研究可能还探讨了这种方法在自然语言处理（NLP）任务中的潜在应用，如问答系统、文本生成或文档摘要，以及如何通过改进世界知识的编码来提升这些任务的性能。这篇论文为文本连贯性的评估提供了一个新的视角，强调了利用世界知识的重要性，这对于理解和生成高质量、连贯的文本具有深远的影响。通过这种方法，未来的研究可以在NLP领域推动更精确和全面的文本理解。

Human Language Technologies: The 2015 Annual Conference of the North American Chapter of the ACL, pages 1087–1096,

Denver, Colorado, May 31 – June 5, 2015.

2015 Association for Computational Linguistics

Encoding World Knowledge in the Evaluation of Local Coherence

Muyu Zhang

1∗

, Vanessa Wei Feng

, Bing Qin

, Graeme Hirst

, Ting Liu

and Jingwen Huang

Research Center for Social Computing and Information Retrieval

Harbin Institute of Technology, Harbin, China

Department of Computer Science, University of Toronto, Toronto, ON, Canada

{myzhang,qinb,tliu,jwhuang}@ir.hit.edu.cn

{weifeng,gh}@cs.toronto.edu

Abstract

Previous work on text coherence was primar-

ily based on matching multiple mentions of

the same entity in diﬀerent parts of the text;

therefore, it misses the contribution from se-

mantically related but not necessarily coref-

erential entities (e.g., Gates and Microsoft).

In this paper, we capture such semantic relat-

edness by leveraging world knowledge (e.g.,

Gates is the person who created Microsoft),

and use two existing evaluation frameworks.

First, in the unsupervised framework, we in-

troduce semantic relatedness as an enrichment

to the original graph-based model of Guin-

audeau and Strube (2013). In addition, we

incorporate semantic relatedness as additional

features into the popular entity-based model

of Barzilay and Lapata (2008). Across both

frameworks, our enriched model with seman-

tic relatedness outperforms the original meth-

ods, especially on short documents.

1 Introduction

In a well-written document, sentences are organized

and presented in a logical and coherent form, which

makes the text ﬂuent and easily understood. There-

fore, coherence is a fundamental aspect of high text

quality, and the evaluation of coherence is a crucial

component of many NLP applications, such as essay

scoring (Miltsakaki and Kukich, 2004), story gener-

ation (McIntyre and Lapata, 2010), and document

summarization (Barzilay et al., 2002).

∗

This work was partly done while the ﬁrst author was vis-

iting University of Toronto.

A particularly popular model for evaluating text

coherence is the entity-based local coherence model

of Barzilay and Lapata (2008) (B&L), which ex-

tracts mentions of entities in adjacent sentences, and

captures local coherence in terms of the transitions

in the grammatical role of each mention. Follow-

ing this direction, a number of extensions have been

proposed (Elsner and Charniak, 2008; Elsner and

Charniak, 2011; Lin et al., 2011; Feng et al., 2014),

the majority of which focus on enriching the origi-

nal entity features. An exception is the unsupervised

model of Guinaudeau and Strube (2013) (G&S),

which converts the document into a graph of sen-

tences, and evaluates the text coherence by comput-

ing the average out-degree over the entire graph.

However, despite the apparent success of these

methods, they rely merely on matching mentions of

the same entity, but neglect the contribution from

semantically related but not necessarily coreferen-

tial entities. For example, the text in Figure 1a

has

no common entity in s

and s

. However, the tran-

sition between them is perfectly coherent, because

there exists close semantic relatedness between two

distinct entities, Gates in s

and Microsoft in s

which can be captured by the world knowledge that

Gates is the person who created Microsoft (repre-

sented by Gates-create-Microsoft). In fact, the is-

sue of absence of common entities between adjacent

sentences is quite prevalent. Analyzing the CoNLL

2012 dataset (Pradhan et al., 2012), we found that

42.34% of the time, adjacent sentences do not share

common entities. As a result, methods which rely

on strict entity matching would fail on these cases.

Based on a news item: http://www.cnbc.com/id/101576926

1087

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38538264

粉丝: 5
资源: 932

利用世界知识评估文本局部连贯性

无参考光场图像质量评估

QP编码标准

局部和全局特征学习，用于盲目评估屏幕内容和自然场景图像

反向传播技术在手写邮政编码识别中的应用1

卷积稀疏编码在图像超分辨率中的应用

块截断编码在图像压缩中的应用研究

图像合成搜索新方法：自适应兼容性评估

SIFT与VLAD特征编码在布匹检索中的应用

【自编码器性能评估方法】

【编码与可解释性】：如何在类别变量编码中保持模型的透明度

最新资源