对抗性特征匹配：文本生成新方法

需积分: 9 6 浏览量更新于2024-09-03 收藏 907KB PDF 举报

"Adversarial Feature Matching for Text Generation.pdf" 这篇论文探讨了在文本生成领域应用生成对抗网络（Generative Adversarial Networks, GANs）所面临的问题及其解决方案。GANs在生成逼真的连续数据（如图像）方面取得了显著成就，但其在处理离散数据（如文本）时，收敛问题和处理困难成为了主要挑战。作者提出了一种新的框架，用于通过对抗性训练生成逼真的文本。在该框架中，他们使用长短期记忆网络（LSTM）作为生成器，利用LSTM的强大能力来捕捉文本序列的上下文信息。同时，采用卷积网络（Convolutional Network）作为判别器，以判断输入句子是真实的还是合成的。与传统的GAN目标函数不同，他们提出通过核化差异度量来匹配真实句子和合成句子的高维潜在特征分布，而不是直接比较生成样本与真实样本的相似性。这种方法有助于缓解GAN中的模式塌缩问题，即生成器过度简化输出，导致多样性丧失。实验结果表明，该模型在定量评估中表现出优越性能，并且能够生成看起来非常逼真的句子。这表明，通过对抗性特征匹配，模型不仅能够生成有意义、连贯的文本，还能提高生成多样性和真实性的水平，这对于自然语言处理的应用，如机器翻译、对话系统和文本自动生成等具有重要意义。此外，该研究还可能对恶意软件分析领域有所启示。尽管论文主要关注的是文本生成，但其提出的对抗性训练策略和特征匹配方法可能适用于改进对恶意软件行为的理解或生成对抗性示例，以测试安全系统的鲁棒性。通过生成看似真实的恶意软件行为，可以更好地理解和预测恶意软件的行为模式，从而提升检测和防御机制。 "Adversarial Feature Matching for Text Generation"这篇论文为解决GAN在文本生成中的难题提供了新的视角，其方法论对于推动自然语言处理技术的发展，以及在安全领域的应用具有深远影响。

Adversarial Feature Matching for Text Generation

Yizhe Zhang

Zhe Gan

Kai Fan

Zhi Chen

Ricardo Henao

Dinghan Shen

Lawrence Carin

Abstract

The Generative Adversarial Network (GAN) has

achieved great success in generating realistic (real-

valued) synthetic data. However, convergence

issues and difﬁculties dealing with discrete data

hinder the applicability of GAN to text. We pro-

pose a framework for generating realistic text via

adversarial training. We employ a long short-

term memory network as generator, and a con-

volutional network as discriminator. Instead of

using the standard objective of GAN, we propose

matching the high-dimensional latent feature dis-

tributions of real and synthetic sentences, via a

kernelized discrepancy metric. This eases adver-

sarial training by alleviating the mode-collapsing

problem. Our experiments show superior perfor-

mance in quantitative evaluation, and demonstrate

that our model can generate realistic-looking sen-

tences.

1. Introduction

Generating meaningful and coherent sentences is central to

many natural language processing applications. The gen-

eral idea is to estimate a distribution over sentences from

a corpus, then use it to sample realistic-looking sentences.

This task is important because it enables generation of novel

sentences that preserve the semantic and syntactic properties

of real-world sentences, while being potentially different

from any of the examples used to estimate the model. For

instance, in the context of dialog generation, it is desirable

to generate answers that are more diverse and less generic

(Li et al., 2016).

One simple approach consists of ﬁrst learning a latent

space to represent (ﬁxed-length) sentences using an encoder-

decoder (autoencoder) framework based on Recurrent Neu-

ral Networks (RNNs) (Cho et al., 2014; Sutskever et al.,

2014), then generate synthetic sentences by decoding ran-

Duke University, Durham, NC, 27708. Correspondence to:

Yizhe Zhang <yizhe.zhang@duke.edu>.

Proceedings of the

International Conference on Machine

the author(s).

dom samples from this latent space. However, this approach

often fails to generate realistic sentences from arbitrary

latent representations. The reason for this is that, when map-

ping sentences to their latent representations using an au-

toencoder, the mappings usually cover a small but structured

region of the latent space, which corresponds to a manifold

embedding (Bowman et al., 2016). In practice, most regions

of the latent space do not necessarily map (decode) to re-

alistic sentences. Consequently, randomly sampling latent

representations often yields nonsensical sentences. Recent

work by Bowman et al. (2016) has attempted to generate

more diverse sentences via RNN-based variational autoen-

coders. However, they did not address the fundamental

problem that the posterior distribution over latent variables

does not appropriately cover the latent space.

Another underlying challenge of generating realistic text

relates to the nature of the RNN. During inference, the

RNN generates words in sequence from previously gener-

ated words, contrary to learning, where ground-truth words

are used every time. As a result, error accumulates propor-

tional to the length of the sequence, i.e., the ﬁrst few words

look reasonable, however, quality deteriorates quickly as

the sentence progresses. Bengio et al. (2015) coined this

phenomenon exposure bias. Toward addressing this prob-

lem, Bengio et al. (2015) proposed the scheduled sampling

approach. However, Huszár (2015) showed that scheduled

sampling is a fundamentally inconsistent training strategy,

in that it produces largely unstable results in practice.

The Generative Adversarial Network (GAN) (Goodfellow

et al., 2014) is an appealing and natural answer to the above

issues. GAN matches the distributions of synthetic and real

data by introducing an adversarial game between a gen-

erator and a discriminator. The GAN objective seeks to

constitute a generator, that functionally maps samples from

a given (simple) prior distribution, to synthetic data that ap-

pear to be realistic. The GAN setup explicitly seeks that the

latent representations from real data (via encoding) be dis-

tributed in a manner consistent with the speciﬁed prior (e.g.,

Gaussian or uniform). Due to the nature of adversarial train-

ing, the discriminator compares real and synthetic sentences,

rather than their individual words, which in principle should

alleviate the exposure-bias issue. Recent work (Lamb et al.,

2016) has incorporated an additional discriminator to train a

sequence-to-sequence language model that better preserves

arXiv:1706.03850v3 [stat.ML] 18 Nov 2017

下载后可阅读完整内容，剩余9页未读，立即下载

鱼雨羽

粉丝: 145

对抗性特征匹配：文本生成新方法

relgan_relational_generative_adversarial_networks_for_text_generation.pdf

Sora学习论文 Adversarial Video Generation on Complex Datasets.pdf

深度学习神经网络(英文版PDF教程）

Demystifying the Principles of Generative Adversarial Networks (GANs): Essential Basics and ...

Multilayer Perceptrons (MLP) in Natural Language Processing: Text Analysis and Understanding, NLP ...

[Model Debugging]: GAN Training Troubleshooting Guide: Expert Tips for Resolving Common Issues

【In-Depth Analysis】: Comprehensive Interpretation of GAN Loss Functions: Practical Techniques for ...

cole_02_0507.pdf

工程硕士开题报告：无线传感器网络路由技术及能量优化LEACH协议研究

【东海期货-2025研报】东海贵金属周度策略：金价高位回落，阶段性回调趋势初现.pdf

最新资源