短语类型敏感张量索引模型：语义组合新方法

109 浏览量更新于2024-08-29 收藏 299KB PDF 举报

"语义组合的短语类型敏感张量索引模型是针对语义组合的一篇研究论文，由Yu Zhao, Zhiyuan Liu和Maosong Sun等人撰写，发表于清华大学计算机科学与技术系和江苏省语言能力协同创新中心。该模型旨在通过词意义的组合性构建短语或句子的意义，并同步学习单个词汇和高频短语的表示。提出的模型利用向量-张量-向量操作增强组合灵活性，同时体现了传统加法和乘法模型的组合特性。" 在自然语言处理领域，语义组合是理解语言的关键环节，它涉及将单个词语的意义组合起来以形成更复杂表达（如短语或句子）的意义。这篇研究论文提出了一种名为“短语类型敏感张量索引模型”的新方法，该方法特别关注在构建语义时如何处理不同类型的短语。首先，该模型强调了同步学习单个词和高频短语的表示的重要性。通过这种方式，模型可以捕获到词汇之间的关系以及短语的特定语义特征。提取出的高频短语表示被视为构建未见过的短语表示的黄金标准，这有助于模型泛化到新的、未在训练集中出现的组合形式。其次，为了提高组合的灵活性，该模型采用了张量运算，这是一种比传统的向量运算更强大的工具。张量运算允许模型处理多维数据，可以更好地捕捉复杂的语义结构。具体来说，该模型采用的是向量-张量-向量操作，这种操作能够结合传统加法和乘法模型的组合特性，使得模型在处理不同类型的短语时更加灵活和准确。此外，该模型还考虑了语法类型，这意味着它能够根据短语的语法结构来调整其组合方式。这样的语法敏感性对于理解和生成自然语言至关重要，因为不同的语法结构往往对应着不同的语义解释。在实证研究中，作者可能对模型进行了各种评估，包括与其他现有模型的比较，以证明其在语义组合任务上的优越性。通过这样的方法，他们可能展示了模型在理解句子含义、情感分析、问答系统或其他自然语言处理任务中的效果。这篇研究论文提出的短语类型敏感张量索引模型为语义组合提供了一个新的视角，利用张量运算和语法类型信息，增强了模型在处理未见短语时的泛化能力和组合灵活性。这种方法有望推动自然语言处理领域的进步，尤其是在理解和生成复杂语言表达方面。

Phrase Type Sensitive Tensor Indexing Model for Semantic Composition

Yu Zhao

, Zhiyuan Liu

1∗

, Maosong Sun

1,2

Department of Computer Science and Technology, State Key Lab on Intelligent Technology and Systems,

National Lab for Information Science and Technology, Tsinghua University, Beijing 100084, China

Jiangsu Collaborative Innovation Center for Language Competence, Jiangsu 221009, China

zhaoyu.thu@gmail.com, {liuzy, sms}@tsinghua.edu.cn

Abstract

Compositional semantic aims at constructing the mean-

ing of phrases or sentences according to the composi-

tionality of word meanings. In this paper, we propose

to synchronously learn the representations of individu-

al words and extracted high-frequency phrases. Repre-

sentations of extracted phrases are considered as gold

standard for constructing more general operations to

compose the representation of unseen phrases. We pro-

pose a grammatical type speciﬁc model that improves

the composition ﬂexibility by adopting vector-tensor-

vector operations. Our model embodies the composi-

tional characteristics of traditional additive and multi-

plicative model. Empirical result shows that our model

outperforms state-of-the-art composition methods in the

task of computing phrase similarities.

Introduction

Compositional semantic aims at constructing the meaning

of phrases or sentences according to the compositionality

of word meanings. Most recently, continuous word repre-

sentations are frequently used for representing the seman-

tic meaning of words (Turney and Pantel 2010), which have

achieved great success in various NLP tasks such as se-

mantic role labeling (Collobert and Weston 2008), para-

phrase detection (Socher et al. 2011a), sentiment analy-

sis (Maas et al. 2011) and syntactic parsing (Socher et al.

2013a). Beyond word representation, it is also essential to

ﬁnd appropriate representations for phrases or longer utter-

ances. Hence, compositional distributional semantic models

(Marelli et al. 2014; Baroni and Zamparelli 2010; Grefen-

stette and Sadrzadeh 2011) have been proposed to construct

the representations of phrases or sentences based on the rep-

resentations of the words they contain.

Most existing compositional distributional semantic mod-

els can be divided into the following two typical types:

Vector-Vector Composition. These models use element-

wise composition operations to compose word vectors in-

to phrase vectors, as shown in Figure 1(A). For example,

Mitchell and Lapata (2010) propose to use additive model

∗

Corresponding author: Zhiyuan Liu (liuzy@tsinghua.edu.cn).

 2015, Association for the Advancement of Artiﬁcial

Figure 1: Comparison of different semantic composition

models, including (a) vector-vector composition, (b) tensor-

vector composition and (c) vector-tensor-vector composi-

tion.

(z = x + y) and multiplicative model (z = x  y). How-

ever, both of the operations are commutative, which may

be unreasonable for semantic composition since the order

of word sequences in a phrase may inﬂuence its meaning.

For instance, machine learning and learning machine have

different meanings, while the commutative functions will re-

turn the same representation for them.

Tensor-Vector Composition. To improve composition

capability, complicated schemes for word representation are

proposed to replace simple vector-space models, includ-

ing matrices, tensors, or a combination of vectors and ma-

trices (Erk and Pad

o 2008; Baroni and Zamparelli 2010;

Yessenalina and Cardie 2011; Coecke, Sadrzadeh, and Clark

2010; Grefenstette et al. 2013). In this way, semantic com-

position is conducted via operations like tensor-vector prod-

uct, as demonstrated in Figure 1(B)

. Despite of powerful

capability, these methods have several disadvantages that re-

duce their scalability: (1) They have to learn matrices or

tensors for each word, which is time-consuming. Moreover,

In accordant with grammatical structure, words with k argu-

ments are represented by rank k + 1 tensors. Hence, the word neu-

ral is represented by a 2-order tensor, i.e., a matrix. An example for

words represented by 3-order tensors is the verb loves in the clause

John loves Mary.

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38528180

粉丝: 4
资源: 942

短语类型敏感张量索引模型：语义组合新方法

随机网络上的Ising模型和规范张量模型

pytorch张量索引切片等学习笔记

TF_Aggregation:多个张量流模型组合的示例

pytorch张量索引以及sum函数mean函数unsqueeze函数

torch_indexing:用于高级pytorch张量索引的辅助库

单一黑洞蒸发的张量网络模型

基于张量积模型变换的时变滑动面和达尔摩擦模型的分数解耦滑模控制

计算机视觉中的射影几何、矩阵张量及模型估计

基于类别转移加权张量分解模型的兴趣点分区推荐.docx

张量模型的全息术

最新资源