对抗学习提升语义解析框架

下载需积分: 0 | PDF格式 | 1.17MB | 更新于2024-08-05 | 14 浏览量 | 举报

"对抗学习语义解析1" 对抗学习语义解析是一种用于自然语言处理的方法，其目标是将自然语言查询转化为结构化的逻辑形式。在这一领域中，标注数据的稀缺性是一个关键挑战。针对这个问题，文章提出了一种采用双重学习算法的语义解析框架。双重学习算法的核心思想是建立一个原始模型（语义解析）和一个对偶模型（逻辑形式到查询）。这两个模型之间形成了一种游戏机制，它们互相规整对方的表现，从而充分利用所有可用的数据，包括标记和未标记的数据。通过这种机制，模型可以从先验知识中获取反馈信号，以提高学习效果。文章特别强调了利用逻辑形式的结构作为先验知识的重要性。为了生成完整且合理的逻辑形式，研究者提出了一种新颖的奖励信号，这个信号分别作用于表面层和语义层。这有助于模型在生成逻辑形式时保持完整性和合理性，避免出现不完整的或不合理的结构。实验结果表明，该方法在多个标准数据集上取得了新的最先进的性能，验证了所提出的对抗学习语义解析框架的有效性。这种技术的进步对于提升自然语言理解系统的准确性和实用性具有重大意义，尤其是在处理大量无标签数据时，能够显著提高模型的泛化能力和学习效率。对抗学习语义解析结合了双重学习策略和逻辑形式的先验知识，以解决语义解析中的数据稀缺问题。通过模型间的相互校正和有效的反馈机制，这种方法不仅提高了模型的训练效率，还提升了生成逻辑形式的质量，为自然语言处理领域带来了新的研究方向和工具。

Figure 1: An overview of dual semantic parsing framework. The primal model (Q2LF ) and dual model (LF 2Q)

can form a closed cycle. But there are two different directed loops, depending on whether they start from a query

or logical form. Validity reward is used to estimate the quality of the middle generation output, and reconstruction

reward is exploited to avoid information loss. The primal and dual models can be pre-trained and ﬁne-tuned with

labeled data to keep the models effective.

where v, b

∈ R

, and W

∈ R

n×2n

, W

∈ R

n×n

are parameters. Then we compute the vocabulary

distribution P

gen

, x) by

|x|

i=1

(5)

gen

, x) =softmax(W

; c

] + b

) (6)

where W

∈ R

|×3n

, b

∈ R

and |V

| is the

output vocabulary size. Generation ends once an

end-of-sequence token “EOS” is emitted.

Copy Mechanism We also include copy mech-

anism to improve model generalization following

the implementation of See et al. (2017), a hybrid

between Nallapati et al. (2016) and pointer net-

work (Gulcehre et al., 2016). The predicted token

is from either a ﬁxed output vocabulary V

or raw

input words x. We use sigmoid gate function σ to

make a soft decision between generation and copy

at each step t.

=σ(v

; c

; φ(y

t−1

)] + b

) (7)

P (y

, x) =g

gen

, x)

+ (1 − g

copy

, x)

(8)

where g

∈ [0, 1] is the balance score, v

is a

weight vector and b

is a scalar bias. Distribution

copy

, x) will be described as follows.

Entity Mapping Although the copy mechanism

can deal with unknown words, many raw words

can not be directly copied to be part of a log-

ical form. For example, kobe bryant is

represented as en.player.kobe_bryant in

OVERNIGHT (Wang et al., 2015). It is common

that entities are identiﬁed by Uniform Resource

Identiﬁer (URI, Klyne and Carroll, 2006) in a

knowledge base. Thus, a mapping from raw words

to URI is included after copying. Mathematically,

copy

in Eq.8 is calculated by:

copy

= w|y

, x) =

i,j: KB(x

i:j

)=w

k=i

where i < j, a

is the attention weight of posi-

tion k at decoding step t, KB(·) is a dictionary-

like function mapping a speciﬁc noun phrase to

the corresponding entity token in the vocabulary

of logical forms.

2.2 Dual Model

The dual task (LF 2Q) is an inverse of the primal

task, which aims to generate a natural language

query given a logical form. We can also exploit

the attention-based Encoder-Decoder architecture

(with copy mechanism or not) to build the dual

model.

Reverse Entity Mapping Different with the pri-

mal task, we reversely map every possible KB

entity y

of a logical form to the corresponding

noun phrase before query generation, KB

−1

)

Since each KB entity may have multiple aliases in

the real world, e.g. kobe bryant has a nick-

name the black mamba, we make different

selections in two cases:

• For paired data, we select the noun phrase

from KB

−1

), which exists in the query.

• For unpaired data, we randomly select one

from KB

−1

(·) is the inverse operation of KB(·), which re-

turns the set of all corresponding noun phrases given a KB

entity.

剩余13页未读，继续阅读

是因为太久

粉丝: 24

对抗学习提升语义解析框架

无监督姿势引导图像生成：语义解析与外观生成新方法

语义解析驱动的文本到逼真图像生成模型

宏微对抗网络：解决人类解析的局部与语义不一致性

AdvSemiSeg:用于半监督语义分割的对抗学习，BMVC 2018

Text2SQL语义解析数据集、解决方案、paper资源整合项目.zip

ECCV2020语义分割论文_神经网络_深度学习_语义分割_

人本感知网络中用户位置隐私保护机制对抗语义攻击

GAN潜在空间的语义操控：解析面部属性解缠

李宏毅详解：生成对抗网络GAN深度解析

ECCV2020语义分割论文深度解析：六篇最新研究盘点

最新资源