LSGANs：解决GAN训练问题与提升图像质量的关键

需积分: 50 178 浏览量更新于2024-07-17 收藏 3.47MB PDF 举报

Least Squares Generative Adversarial Networks (LSGANs) 是一种改进版的生成对抗网络 (GAN) 方法，针对标准GAN在生成图像质量和训练稳定性方面存在的问题进行了优化。标准GAN中的关键组件是生成器（Generator）和判别器（Discriminator），它们通过竞争学习来共同提高生成图像的真实性。在标准GAN中，判别器通常使用sigmoid交叉熵损失函数，这种函数在训练过程中可能导致梯度消失的问题，影响模型性能。 LSGANs的核心创新在于将判别器的目标函数从交叉熵损失转换为最小二乘损失。最小二乘损失相比于交叉熵，更倾向于解决梯度消失的问题，因为它具有更稳定的梯度更新，使得训练过程更为平稳。这种改动不仅有助于改善生成器的学习效率，而且直接解决了训练不稳定的问题，使得LSGANs能够在不增加额外复杂性的前提下，提升生成图像的质量。从理论层面看，LSGANs的优化目标与Pearson χ² 分散度有直接联系。最小化LSGANs的损失函数实质上等同于最小化生成图像分布与真实数据分布之间的Pearson χ² 散度，这表明了LSGANs在生成图像逼真度上的优势。Pearson χ² 散度是一种统计量，用于衡量两个连续变量分布的相似性，数值越小表示两者越接近。 LSGANs的两个主要优点可以总结如下： 1. **图像质量提升**：最小二乘损失函数的使用使得LSGANs能够生成更高质量的图像，因为这种损失函数对于训练过程中的细节更加敏感，从而有助于生成更具视觉吸引力和多样性的样本。 2. **训练稳定性增强**：通过最小二乘损失的引入，LSGANs克服了标准GAN中常见的训练不稳定问题，例如模式崩溃（mode collapse）。这使得模型更容易收敛到全局最优解，从而提高了模型的泛化能力和稳定性。 Least Squares Generative Adversarial Networks通过采用最小二乘损失替代传统的交叉熵损失，显著改进了GAN的性能，特别是在图像生成质量和训练稳定性方面。这对于无监督学习任务，尤其是图像生成任务，具有重要的实际应用价值。

in Section 3, and experimental results are presented in Section 4. Finally, we

conclude the paper in Section 5.

2 Related Work

Deep generative models attempt to capture the probability distributions over

the given data. Restricted Boltzmann Machines (RBMs), one type of deep

generative models, are the basis of many other hierarchical models, and they

have been used to model the distributions of images [15] and documents [16].

Deep Belief Networks (DBNs) [17] and Deep Boltzmann Machines (DBMs) [5]

are extended from the RBMs. The most successful application of DBNs is for

image classiﬁcation [17], where DBNs are used to extract feature representa-

tions. However, RBMs, DBNs and DBMs all have the diﬃculties of intractable

partition functions or intractable posterior distributions, which thus use the ap-

proximation methods to learn the models. Another important deep generative

model is Variational Autoencoders (VAE) [6], a directed model, which can be

trained with gradient-based optimization methods. But VAEs are trained by

maximizing the variational lower bound, which may lead to the blurry problem

of generated images.

Recently, Generative Adversarial Networks (GANs) have been proposed by

Goodfellow et al. [7], who explained the theory of GANs learning based on a

game theoretic scenario. Compared with the above models, training GANs does

not require any approximation method. Like VAEs, GANs also can be trained

through diﬀerentiable networks. Showing the powerful capability for unsuper-

vised tasks, GANs have been applied to many speciﬁc tasks, like image genera-

tion [18], image super-resolution [9], text to image synthesis [19] and image to

image translation [20]. By combining the traditional content loss and the ad-

versarial loss, super-resolution generative adversarial networks [9] achieve state-

of-the-art performance for the task of image super-resolution. Reed et al. [19]

proposed a model to synthesize images given text descriptions based on the con-

ditional GANs [21]. Isola et al. [20] also use the conditional GANs to transfer

images from one representation to another. In addition to unsupervised learning

tasks, GANs also show potential for semi-supervised learning tasks. Salimans

et al. [10] proposed a GAN-based framework for semi-supervised learning, in

which the discriminator not only outputs the probability that an input image

is from real data but also outputs the probabilities of belonging to each class.

Despite the great successes GANs have achieved, improving the quality of

generated images is still a challenge. A lot of works have been proposed to

improve the quality of images for GANs. Radford et al. [13] ﬁrst introduced

convolutional layers to GANs architecture, and proposed a network architecture

called deep convolutional generative adversarial networks (DCGANs). Denton

et al. [22] proposed another framework called Laplacian pyramid of genera-

tive adversarial networks (LAPGANs). They construct a Laplacian pyramid to

generate high-resolution images starting from low-resolution images. Further,

Salimans et al. [10] proposed a technique called feature matching to get better

剩余16页未读，继续阅读

天心之若水

粉丝: 0
资源: 10

LSGANs：解决GAN训练问题与提升图像质量的关键

Python实现的基于生成对抗网络的人脸超分辨率修复系统

生成对抗网络基础：Goodfellow的Generative Adversarial Nets框架

SNGAN与频谱范数在数字信号处理中的应用

Generative Adversarial Networks in Computer Vision A Survey a

生成对抗网络GAN（ Generative Adversarial Networks）63PPT，GAN原理，介绍，变体详细

【Introduction】: Demystifying the Principles of Generative Adversarial Networks (GANs): Essential ...

generative_adversarial_networks_101：生成对抗网络的Keras实现。 具有MNIST和CIFAR-10数据集的GAN，DCGAN，CGAN，CCGAN，WGAN和LSGAN模型

生成对抗网络在计算机视觉领域的应用.pdf

DeepLearning深度学习教程_第七章_生成对抗网络.pdf

生成式对抗网络GAN的研究进展与展望_王坤峰.pdf

最新资源

generative_adversarial_networks_101：生成对抗网络的Keras实现。具有MNIST和CIFAR-10数据集的GAN，DCGAN，CGAN，CCGAN，WGAN和LSGAN模型