语义扩散核在监督词义消歧中的应用

81 浏览量更新于2024-08-28 收藏 498KB PDF 举报

"这篇研究论文探讨了使用语义扩散核进行监督词义消歧的方法，旨在改进机器学习在词义消歧中的表现。作者包括Tinghua Wang、Junyang Rao和Qi Hu，分别来自中国江西省赣南师范大学、北京大学计算机科学技术研究所和北京交通大学计算机与信息技术学院。文章于2013年3月10日提交，经过修订后在8月19日被接受，并于9月17日在线发布。关键词包括词义消歧（WSD）、语义扩散核、支持向量机（SVM）、核方法和自然语言处理（NLP）。" 正文: 词义消歧是自然语言处理领域的一个关键挑战，尤其是在文本理解中。当一个词在不同的上下文中具有多种含义时，词义消歧的目标是确定这个词在特定情境下的具体意义。传统的词义消歧方法通常依赖于“词袋”(Bag of Words, BoW)模型来表示单词的上下文。然而，BoW模型忽略了词语间的语义关系，因此在处理多义词时效率有限。本文提出了一种新的方法，即使用语义扩散核（Semantic Diffusion Kernel）进行监督词义消歧。这种方法旨在克服BoW模型的局限性，通过捕捉词汇之间的语义相似性来增强上下文表示。语义扩散核是基于图论的概念，其中词被视为图中的节点，而词语间的关联（如同义关系、反义关系等）则构成边。通过在这样的语义网络上应用扩散过程，可以计算出任意两个词之间的相似度，这在处理多义词时尤其有用。支持向量机（SVM）作为一种有效的分类工具，被用作词义消歧的模型。在本文中，语义扩散核作为SVM的内核函数，使得模型能够利用语义信息进行学习和预测。内核方法在机器学习中扮演着重要角色，它们能够将数据映射到高维空间，使得原本线性不可分的数据在新空间中变得可分。语义扩散核的引入为SVM提供了一种处理非线性问题的手段，尤其是那些与语义相关的问题。通过实验，作者展示了在多个词义消歧任务上，采用语义扩散核的SVM模型相对于传统的BoW模型和其他基线方法具有更好的性能。这种改进归功于语义扩散核对语义关系的充分利用，它能够在消歧过程中更准确地捕获多义词的上下文含义。这项研究为词义消歧提供了一个创新的解决方案，通过引入语义扩散核，提高了机器学习模型对自然语言文本理解的能力。这一方法对于自然语言处理的应用，如信息检索、问答系统和情感分析等，具有潜在的价值和影响。

Supervised word sense disambiguation using semantic diffusion kernel

Tinghua Wang

a,b,

, Junyang Rao

,QiHu

School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, China

Institute of Computer Science and Technology, Peking University, Beijing 100871, China

School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China

article info

Article history:

Received 10 March 2013

Received in revised form

20 July 2013

Accepted 19 August 2013

Available online 17 September 2013

Keywords:

Word sense disambiguation (WSD)

Semantic diffusion kernel

Support vector machine (SVM)

Kernel method

Natural language processing

abstract

The success of machine learning approaches to word sense disambiguation (WSD) is largely dependent

on the representation of the context in which an ambiguous word occurs. Typically, the contexts are

represented as the vector space using “Bag of Words (BoW)” technique. Despite its ease of use, BoW

representation suffers from well-known limitations, mostly due to its inability to exploit semantic

similarity between terms. In this paper, we apply the semantic diffusion kernel, which models semantic

similarity by means of a diffusion process on a graph deﬁned by lexicon and co-occurrence information,

to smooth the BoW representation for WSD systems. Semantic diffusion kernel can be obtained through

a matrix exponentiation transformation on the given kernel matrix, and virtually exploits higher order

co-occurrences to infer semantic similarity between terms. The superiority of the proposed method is

demonstrated experimentally with several SensEval disambiguation tasks.

1. Introduction

Word sense disambiguation (WSD) refers to the task of identi-

fying the correct sense of an ambiguous word in a given context

(Navigli, 2009). The ambiguity results from homonymy, i.e., words

having the same spelling and pronunciation but different senses,

and polysemy, i.e., words having multiple senses, usually with

subtle differences (Nguyen and Ock, 2011). Homonymy is relatively

easy to disambiguate because the domains of different senses are

distinct, e.g., the noun bank could be deﬁned as “sloping raised

land, especially along the sides of a river” or alternately as “an

organization where people and businesses can invest or borrow

money, convert to foreign money, etc. or a building where these

services are offered” (Cambridge Advanced Learner's Dictionary).

Polysemy is far more difﬁcult because of the subtle differences and

the common origin of the senses, e.g., the noun cold could refer to

“a mild viral infection involving the nose and respiratory passages” or

“the absence of heat, or the sensation produced by low temperatures”

(WordNet 3.1). As a fundamental semantic understanding task at

the lexical level in natural language processing, WSD can beneﬁt

many applications such as information retrieval (Stokoe et al.,

2003; Zhong and Ng, 2012) and machine translation (Carpuat and

Wu, 2007; Chan et al., 2007). In actual applications, WSD is often

fully integrated into the system and cannot be separated out

(for instance, in information retrieval, WSD is often not done

explicitly but is just by-product of query to document matching).

However, it has been very difﬁcult to formalize the process of

disambiguation, which humans can do so effortlessly.

There are two main kinds of methods to perform the task of WSD:

knowledg e-based approaches and corpus-based approaches.

The former disambiguate words by comparing their conte xt against

information from the predeﬁnedlexicalresourcessuchasWordNet,

whereas the latter do no t make use of any the se resources for

disambiguation (Navigli, 2009). Most of the corpus-based approaches

stem from the machine learning community , ranging from supervised

learning in which a classiﬁer is trained for each distinct word on a

corpus of manually sense-annotated examples, to completely unsu-

pervised methods that cluster occurrence of words, thereb y inducing

senses. Among these, supervised learning approaches have been the

most successful algorithms to date. Moreove r , in recent years it seems

very promising that applying kernel methods (Shawe- Taylor and

Cristianini, 2004; Simek et al., 2004) such as support vector machine

(SVM) (Giuliano et al., 2009; Jin et al., 2008;

Joshi et al., 2006; Lee

et al., 2004; Pahikkala et al., 2009), kernel principal component

analysis (KPCA) (Su et al., 2004; Wu et al., 2004)andregularized

least-squares classiﬁer (RLSC) (Popescu, 2004)totheWSDtask.Kernel

methods in general and SVM in particular have delivered extremely

high performance in a wide variety of learning tasks. The advantage of

using kernel methods for WSD is that they offer a ﬂexible and efﬁcient

way of deﬁning application-speciﬁc kernels for introducing back-

ground know ledge and modeling e xplicitly linguistic insights.

For the machine learning-based WSD, one of the key steps is

the representation of the context in which an ambiguous word

Contents lists available at ScienceDirect

journal homepage: www.elsevier.com/locate/engappai

Engineering Applications of Artiﬁcial Intelligence

http://dx.doi.org/10.1016/j.engappai.2013.08.007

Corresponding author at: School of Mathematics and Computer Science,

Gannan Normal University, Ganzhou 341000, China. Tel.: þ 86 18810358076.

E-mail addresses: wthpku@163.com, wthgnnu@163.com (T. Wang).

Engineering Applications of Artiﬁcial Intelligence 27 (2014) 167–174

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38592420

粉丝: 6

语义扩散核在监督词义消歧中的应用

Python机器学习库mljar-supervised 0.7.4发布

mljar-supervised：Python自动化机器学习工具

JavaScript实现Pong机器人：pongbot-supervised项目解析

Improving Distantly-Supervised Neural Relation Extraction using Side Information

Semi-Supervised-Learning-Using-Gaussian-Fields-and-Harmonic-Functions_notes

Semantic Supervised Segmentation

Self-supervised Equivariant Attention Mechanismfor Weakly Supervised Semantic Segmentation.zip

Semi-supervised Anomaly Detection using AutoEncoders

self-supervised-semantic-segmentation

Semi-supervised Anomaly Detection using AutoEncoders PDF

最新资源