优化Wasserstein GAN训练：梯度惩罚提升模型稳定性与生成质量

版权申诉

67 浏览量更新于2024-07-20 收藏 5.9MB PDF 举报

标题："改进训练的Wasserstein GANs"（Improved Training of Wasserstein GANs）描述：该论文探讨了生成性对抗网络（GANs）在深度学习中的广泛应用，尽管它们展示了强大的生成能力，但在训练稳定性上却面临着挑战。传统的Wasserstein GAN (WGAN)方法旨在解决GAN训练不稳定的问题，然而，它仍然存在生成低质量样本或无法收敛的问题。研究者发现，WGAN中使用权重剪辑（weight clipping）来确保批评家（critic）的Lipschitz连续性可能是导致这些问题的原因。作者们观察到，这种强制性的权重约束可能导致批评家行为异常，因此他们提出了一种替代方案——通过惩罚批评家对其输入的梯度范数（gradient norm penalty）来控制权重。这种方法避免了直接剪切权重，使得WGAN的训练更为稳健。实验结果表明，新提出的惩罚机制显著提高了WGAN的性能，使其能够在广泛的架构下进行稳定训练，包括具有101层的ResNet网络和具有连续生成器的语言模型。论文亮点在于，在几乎无需超参数调整的情况下，改进后的WGAN能够在CIFAR-10和LSUN卧室等数据集上实现高质量的图像生成。这意味着研究人员已经找到了一种更有效且更具普适性的训练策略，这对于推动GAN技术的实际应用和发展具有重要意义。这篇论文的核心贡献在于提出了一个新的训练技巧，解决了Wasserstein GAN在高维空间中训练时的挑战，为稳定、高效地生成高质量样本提供了一个新的解决方案。这将有助于进一步提升生成模型的实用性和可靠性，对于人工智能领域的研究者和实践者来说，具有很高的价值。

Algorithm 1 WGAN with gradient penalty. We use default values of λ = 10, n

critic

= 5, α =

0.0001, β

= 0, β

= 0.9.

Require: The gradient penalty coefﬁcient λ, the number of critic iterations per generator iteration

critic

, the batch size m, Adam hyperparameters α, β

, β

Require: initial critic parameters w

, initial generator parameters θ

1: while θ has not converged do

2: for t = 1, ..., n

critic

3: for i = 1, ..., m do

4: Sample real data x ∼ P

, latent variable z ∼ p(z), a random number  ∼ U [0, 1].

x ← G

(z)

x ← x + (1 − )

7: L

(i)

← D

(

x) − D

(x) + λ(k∇

(

x)k

− 1)

8: end for

9: w ← Adam(∇

i=1

(i)

, w, α, β

, β

)

10: end for

11: Sample a batch of latent variables {z

(i)

}

i=1

∼ p(z).

12: θ ← Adam(∇

i=1

−D

(z)), θ, α, β

, β

)

13: end while

critic. In each case, the critic trained with weight clipping ignores higher moments of the data dis-

tribution and instead models very simple approximations to the optimal functions. In contrast, our

approach does not suffer from this behavior.

3.2 Exploding and vanishing gradients

We observe that the WGAN optimization process is difﬁcult because of interactions between the

weight constraint and the cost function, which result in either vanishing or exploding gradients

without careful tuning of the clipping threshold c.

To demonstrate this, we train WGAN on the Swiss Roll toy dataset, varying the clipping threshold c

in [10

−1

, 10

−2

, 10

−3

], and plot the norm of the gradient of the critic loss with respect to successive

layers of activations. Both generator and critic are 12-layer ReLU MLPs without batch normaliza-

tion. Figure 1b shows that for each of these values, the gradient either grows or decays exponentially

as we move farther back in the network. We ﬁnd our method results in more stable gradients that

neither vanish nor explode, allowing training of more complicated networks.

4 Gradient penalty

We now propose an alternative way to enforce the Lipschitz constraint. A differentiable function

is 1-Lipschtiz if and only if it has gradients with norm at most 1 everywhere, so we consider di-

rectly constraining the gradient norm of the critic’s output with respect to its input. To circumvent

tractability issues, we enforce a soft version of the constraint with a penalty on the gradient norm

for random samples

x ∼ P

. Our new objective is

L = E

x∼P

[D(

x)] − E

x∼P

[D(x)]

| {z }

Original critic loss

+ λ E

x∼P



(k∇

x)k

− 1)



| {z }

Our gradient penalty

(3)

Sampling distribution We implicitly deﬁne P

sampling uniformly along straight lines between

pairs of points sampled from the data distribution P

and the generator distribution P

. This is

motivated by the fact that the optimal critic contains straight lines with gradient norm 1 connecting

coupled points from P

and P

(see Proposition 1). Given that enforcing the unit gradient norm

constraint everywhere is intractable, enforcing it only along these straight lines seems sufﬁcient and

experimentally results in good performance.

Penalty coefﬁcient All experiments in this paper use λ = 10, which we found to work well across

a variety of architectures and datasets ranging from toy tasks to large ImageNet CNNs.

剩余19页未读，继续阅读

电动汽车控制与安全

粉丝: 267
资源: 4186

优化Wasserstein GAN训练：梯度惩罚提升模型稳定性与生成质量

improved_wgan_training, 在"Improved Training of Wasserstein GANs" 中，用于复制实验的代码.zip

opencv_python-4.1.0.25-cp37-cp37m-linux_armv7l.whl

onnxruntime-1.13.1-cp38-cp38-win_amd64.whl

元学习，小样本图像数据集：FC100数据集

numpy-1.19.5-cp39-cp39-linux_armv7l.whl

基于springboot的城乡商城协作系统源码数据库文档.zip

基于springboot宠物管理系统源码数据库文档.zip

基于springboot餐饮连锁店管理系统源码数据库文档.zip

基于springboot在线问诊系统源码数据库文档.zip

商道融绿、润灵环球ESG评级数据（2015-2023年）dta

最新资源