消除歧义：散布语义扩散内核在词义消歧中的应用

需积分: 5 172 浏览量更新于2024-08-12 收藏 839KB PDF 举报

"这篇研究论文探讨了在自然语言处理（NLP）中解决词义歧义问题的一种新方法——散布的语义扩散内核（Sprinkled Semantic Diffusion Kernel），用于词义消歧（Word Sense Disambiguation, WSD）。论文的作者包括Tinghua Wang、Wei Li、Fulai Liu、Jialin Hu，发表在《Engineering Applications of Artificial Intelligence》期刊上，卷64，页码43–51，日期为2017年。关键词涉及词义消歧、语义扩散内核、类别信息、支持向量机（SVM）以及核方法。" 正文：在自然语言处理领域，词义消歧是关键问题之一，因为词汇经常在不同的语境中具有多个含义。例如，单词“银行”可以指金融机构，也可以是指河岸。为了准确理解和解析文本，需要识别出单词在特定上下文中的确切含义，即词义消歧。该研究提出了一种新的算法——散布的语义扩散内核，它旨在通过在语义空间中传播和融合信息来消除词义歧义。语义扩散内核是一种利用词汇和上下文信息的数学模型，它能够捕捉到词与词之间的语义关系，并将这些关系纳入决策过程，以确定单词在给定语境中的最可能含义。在传统的词义消歧方法中，如基于实例的方法和基于知识的方法，常常需要大量的标注数据或复杂的知识库。而散布的语义扩散内核则可能通过更高效的方式处理这个问题，它可能利用到的支持向量机（SVM）是一种监督学习模型，常用于分类任务，尤其是小样本量的情况。结合核方法，如高斯核或多项式核，SVM可以在高维空间中进行非线性分类，这有助于识别复杂的语义模式。研究中提到的“散布”可能指的是在扩散过程中随机或有选择地引入其他相关或不相关的语义元素，以增加模型的泛化能力和适应性。这种方法可能能够更好地模拟人类理解语言时的思维过程，即从多种可能的解释中选择最合适的。此外，论文可能详细讨论了实验设置、性能评估指标，比如精确率、召回率和F1分数，以及与其他词义消歧方法的比较。通过这些实验，作者可能证明了散布的语义扩散内核在处理歧义词时的优越性，尤其是在没有大量预训练数据或特定领域知识的情况下。这篇研究论文为解决自然语言处理中的词义歧义问题提供了一个创新的解决方案，即散布的语义扩散内核，它结合了语义扩散、支持向量机和核方法，有望提高词义消歧的准确性和效率，从而推动NLP技术的进步。

Engineering Applications of Artificial Intelligence 64 (2017) 43–51

Contents lists available at ScienceDirect

Engineering Applications of Artificial Intelligence

journal homepage: www.elsevier.com/locate/engappai

Sprinkled semantic diffusion kernel for word sense disambiguation

Tinghua Wang

a,b,

*, Wei Li

, Fulai Liu

, Jialin Hua

School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, PR China

Decision Systems and e-Service Intelligence Laboratory, Centre for Artificial Intelligence, Faculty of Engineering and Information Technology, University of Technology

Sydney, Broadway NSW 2007, Australia

School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, PR China

a r t i c l e i n f o

Keywords:

Word sense disambiguation (WSD)

Semantic diffusion kernel

Class information

Support vector machine (SVM)

Kernel method

a b s t r a c t

Word sense disambiguation (WSD), the task of identifying the intended meanings (senses) of words in context, has

been a long-standing research objective for natural language processing (NLP). In this paper, we are concerned

with kernel methods for automatic WSD. Under this framework, the main difficulty is to design an appropriate

kernel function to represent the sense distinction knowledge. Semantic diffusion kernel, which models semantic

similarity by means of a diffusion process on a graph defined by lexicon and co-occurrence information to smooth

the typical ‘‘Bag of Words’’ (BOW) representation, has been successfully applied to WSD. However, the diffusion

is an unsupervised process, which fails to exploit the class information in a supervised classification scenario. To

address the limitation, we present a sprinkled semantic diffusion kernel to make use of the class knowledge of

training documents in addition to the co-occurrence knowledge. The basic idea is to construct an augmented term-

document matrix by encoding class information as additional terms and appending them to training documents.

Diffusion is then performed on the augmented term-document matrix. In this way, the words belonging to the

same class are indirectly drawn closer to each other, hence the class-specific word correlations are strengthened.

We evaluate our method on several Senseval/Semeval benchmark examples with support vector machine (SVM),

and show that the proposed kernel can significantly improve the disambiguation performance over semantic

diffusion kernel in terms of different measures and yield a competitive result with the state-of-the-art kernel

methods for WSD.

1. Introduction

Ambiguity is inherent to human language. Particularly, word sense

ambiguity is prevalent in all natural languages, with a large number of

words having more than one meaning. For instance, the English noun

bank can mean ‘‘sloping raised land, especially along the sides of a river’’ or

‘‘an organization where people and businesses can invest or borrow money,

convert to foreign money, etc. or a building where these services are offered’’.

The correct sense of an ambiguous word can be determined based on

the context where it occurs, and correspondingly the problem of word

sense disambiguation (WSD) is defined as the task of automatically

assigning the most appropriate meaning to a polysemous word in a given

context (Navigli, 2009). As a fundamental semantic understanding

task at the lexical level in natural language processing (NLP), WSD

can benefit many applications such as machine translation, information

retrieval, parsing, and question answering. WSD is considered to be a

key step in order to approach language understanding beyond keyword

Corresponding author at: School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, PR China.

E-mail address: wthgnnu@163.com (T. Wang).

matching (Agirre et al., 2014). Although WSD for human is essentially

a subconscious process and presents no difficulties, it is very difficult

to formalize the computational process of disambiguation since it is

classified among ‘‘AI-complete’’ problems (Turdakov, 2010), that is, it

is a task whose solution is at least as hard as the most difficult problems

in artificial intelligence.

Generally, WSD methods can be classified into two types: knowledge-

based and machine learning (Navigli, 2009; Raviv and Markovitch,

2012). Knowledge-based WSD systems exploit the information in a

lexical knowledge base, such as WordNet and Wikipedia, to perform

WSD. These approaches usually pick the sense whose definition is most

similar to the context of the ambiguous word, by means of textual

overlap or using graph-based measures (Abualhaija and Zimmermann,

2016; Agirre et al., 2009; Navigli and Lapata, 2010). Machine learning

approaches, also called corpus-based approaches, do not make use of

any knowledge resources for disambiguation. These approaches range

http://dx.doi.org/10.1016/j.engappai.2017.05.010

Received 30 December 2015; Received in revised form 9 December 2016; Accepted 15 May 2017

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38584043

粉丝: 4
资源: 947

消除歧义：散布语义扩散内核在词义消歧中的应用

散布熵matlab代码

多尺度散布熵(matlab).rar

散布

一种步进的单元散布拥塞消除算法

real-cheating.win-spoofer-source:包括内核模式驱动程序源和将请求发送到被劫持设备控件的用户模式控制应用程序，请尽情享受。 发布此消息的原因是散布了许多错误的信息。 笔记-Source code learning

变量X和Y的热散布图：具有类似热图的密度和可选的最小二乘拟合的散布图-matlab开发

深入理解PHP的内核

AI插件_心形散布

散布光斑笔刷.zip

步进单元散布算法在拥塞消除中的应用

最新资源

real-cheating.win-spoofer-source:包括内核模式驱动程序源和将请求发送到被劫持设备控件的用户模式控制应用程序，请尽情享受。发布此消息的原因是散布了许多错误的信息。笔记-Source code learning