利用语义词典提升词向量表示：2015年Retrofitting方法研究

160 浏览量更新于2024-08-25 收藏 383KB PDF 举报

Retrofitting Word Vectors to Semantic Lexicons是一篇发表于2015年的计算机科学论文，由Manaal Faruqui、Jesse Dodge、Sujay K. Jauhar、Chris Dyer、Eduard Hovy和Noah A. Smith等来自卡内基梅隆大学语言技术研究所的研究者共同完成。该研究主要关注如何改进词向量模型，使其更好地利用语义词典如WordNet、FrameNet和Paraphrase Database中的关系信息。传统的词向量，如通过分析大量文本数据学习得到的，虽然能够捕捉到单词之间的分布关系，但这些统计信息往往忽略了语义词典中蕴含的丰富结构化知识。这些词典提供了词汇间的同义关系、框架关系以及词语对的相似性等重要信息。论文提出了一种方法，即“retrofitting”，旨在通过利用这些语义资源来调整词向量，使相关的词在向量空间中有更接近的表示，从而增强其语义表达能力。 Retrofitting方法的核心思想是通过引入语义约束，即鼓励在词典中具有特定关系的词，如同义词或上下位词，其向量距离缩小。这种方法不假设输入词向量的具体构建方式，因此可以应用于多种预训练的词向量模型，如Word2Vec、GloVe等。作者们评估了这种方法在多语言的标准语义评价任务中的表现，结果表明，与之前尝试将语义词典融入词向量的技术相比，他们的retrofitting方法能够显著提升词向量的语义准确性。通过retrofitting，研究人员不仅增强了词向量的语义表达，还为后续的研究提供了新的视角，即如何有效地结合分布数据和有结构的语义知识，以提升自然语言处理任务（如文本分类、命名实体识别、机器翻译等）的整体性能。这项工作对于理解和改进词向量在实际应用中的效果具有重要意义，尤其是在那些对精确的语义理解至关重要的领域。

Retroﬁtting Word Vectors to Semantic Lexicons

Manaal Faruqui Jesse Dodge Sujay K. Jauhar

Chris Dyer Eduard Hovy Noah A. Smith

Language Technologies Institute

Carnegie Mellon University

Pittsburgh, PA, 15213, USA

{mfaruqui,jessed,sjauhar,cdyer,ehovy,nasmith}@cs.cmu.edu

Abstract

Vector space word representations are learned

from distributional information of words in

large corpora. Although such statistics are

semantically informative, they disregard the

valuable information that is contained in se-

mantic lexicons such as WordNet, FrameNet,

and the Paraphrase Database. This paper

proposes a method for reﬁning vector space

representations using relational information

from semantic lexicons by encouraging linked

words to have similar vector representations,

and it makes no assumptions about how the in-

put vectors were constructed. Evaluated on a

battery of standard lexical semantic evaluation

tasks in several languages, we obtain substan-

tial improvements starting with a variety of

word vector models. Our reﬁnement method

outperforms prior techniques for incorporat-

ing semantic lexicons into word vector train-

ing algorithms.

1 Introduction

Data-driven learning of word vectors that capture

lexico-semantic information is a technique of cen-

tral importance in NLP. These word vectors can

in turn be used for identifying semantically related

word pairs (Turney, 2006; Agirre et al., 2009) or

as features in downstream text processing applica-

tions (Turian et al., 2010; Guo et al., 2014). A vari-

ety of approaches for constructing vector space em-

beddings of vocabularies are in use, notably includ-

ing taking low rank approximations of cooccurrence

statistics (Deerwester et al., 1990) and using internal

representations from neural network models of word

sequences (Collobert and Weston, 2008).

Because of their value as lexical semantic repre-

sentations, there has been much research on improv-

ing the quality of vectors. Semantic lexicons, which

provide type-level information about the semantics

of words, typically by identifying synonymy, hyper-

nymy, hyponymy, and paraphrase relations should

be a valuable resource for improving the quality of

word vectors that are trained solely on unlabeled

corpora. Examples of such resources include Word-

Net (Miller, 1995), FrameNet (Baker et al., 1998)

and the Paraphrase Database (Ganitkevitch et al.,

2013).

Recent work has shown that by either changing

the objective of the word vector training algorithm

in neural language models (Yu and Dredze, 2014;

Xu et al., 2014; Bian et al., 2014; Fried and Duh,

2014) or by relation-speciﬁc augmentation of the

cooccurence matrix in spectral word vector models

to incorporate semantic knowledge (Yih et al., 2012;

Chang et al., 2013), the quality of word vectors can

be improved. However, these methods are limited to

particular methods for constructing vectors.

The contribution of this paper is a graph-based

learning technique for using lexical relational re-

sources to obtain higher quality semantic vectors,

which we call “retroﬁtting.” In contrast to previ-

ous work, retroﬁtting is applied as a post-processing

step by running belief propagation on a graph con-

structed from lexicon-derived relational information

to update word vectors (§2). This allows retroﬁtting

to be used on pre-trained word vectors obtained

using any vector training model. Intuitively, our

method encourages the new vectors to be (i) simi-

lar to the vectors of related word types and (ii) simi-

lar to their purely distributional representations. The

retroﬁtting process is fast, taking about 5 seconds for

a graph of 100,000 words and vector length 300, and

its runtime is independent of the original word vec-

tor training model.

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38601446

粉丝: 7
资源: 939

利用语义词典提升词向量表示：2015年Retrofitting方法研究

EMNLP 2019 - 分布式词向量表示【239页ppt】.pdf

retrofitting android to protect data from imperious application

Struts on Ajax-Retrofitting Struts with Ajax Taglibs

能列举一下USENIX ATC、HPCA、ASPLOS、OSDI、NSDI、EuroSys会议中有关操作系统的新算法吗

适配器模式主要有两种：类的适配器模式和对象适配器模式。其中，类的适配器模式主要使用委托来实现，而对象的适配器模式主要使用继承来实现。对吗

Joint-Retrofitting:那讲得通

.Responsive.Mobile.Design.Designing.for.Every.Device

SOSP 2011-ACM Symposium on Operating Systems Principles 2011年论文集

大学英语（二）02-language focus.pptx

cav2010 paper

最新资源