无配对图像到图像翻译：使用循环一致对抗网络

需积分: 50 146 浏览量更新于2024-09-08 收藏 2.61MB PDF 举报

"Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks" Image-to-Image翻译是一种视觉和图形处理领域的问题，其目标是学习输入图像与输出图像之间的映射关系，通常依赖于配对的训练图像对。然而，在实际应用中，获取到配对的训练数据往往十分困难。针对这一挑战，"Unpaired Image-to-Image Translation"提出了一个创新的方法，即在没有配对示例的情况下，学习从源域X到目标域Y的图像转换。该方法的核心是Cycle-Consistent Adversarial Networks（循环一致对抗网络），由Jun-Yan Zhu、Taesung Park、Phillip Isola和Alexei A. Efros等研究人员在伯克利人工智能研究实验室(BAIR)开发。这种方法能够解决无配对图像转换问题，例如将斑马转换成马，或者将夏季风景转化为冬季景色（见图1）。循环一致性是该模型的关键特性。在训练过程中，模型不仅学习如何将图像从X转换到Y，还学习将从Y转换回来的图像恢复到尽可能接近原始X的图像。这种双向转换的约束确保了转换过程的合理性，因为如果转换后的图像再经过反向转换，应该能恢复到大致相同的输入图像，从而形成一个闭合的循环。具体来说，该模型包含两个对抗网络：一个生成器G，用于从X到Y的转换，以及一个判别器D_Y，用于区分真实Y域的图像与G生成的图像；同时还有一个逆生成器F，从Y到X，以及对应的判别器D_X。在训练过程中，生成器和逆生成器试图使转换的图像看起来真实，而判别器则试图区分真实图像和生成图像，形成对抗性学习。通过最小化循环一致性损失（Cycle-consistency loss）和对抗性损失（Adversarial loss），模型能够在无配对数据的情况下进行有效的图像翻译。这个技术有广泛的应用前景，例如在艺术风格迁移中，用户可以将自己的照片转换成著名艺术家如梵高、塞尚或莫奈的风格（如图1所示）。此外，它还可以应用于图像修复、图像增强、季节转换、物体类别转换等多种图像处理任务。总结起来，"Unpaired Image-to-Image Translation"利用Cycle-Consistent Adversarial Networks解决了无配对图像数据下的图像转换问题，为计算机视觉和图形学领域提供了新的工具和思路，极大地扩展了图像处理和生成的可能性。

Unpaired Image-to-Image Translation

using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu

∗

Taesung Park

∗

Phillip Isola Alexei A. Efros

Berkeley AI Research (BAIR) laboratory, UC Berkeley

Zebras Horses

horse zebra

zebra horse

Summer Winter

summer winter

winter summer

Photograph Van Gogh CezanneMonet Ukiyo-e

Monet Photos

Monet photo

photo Monet

Figure 1:

Given any two unordered image collections

and

, our algorithm learns to automatically “translate” an image from one into the other and vice

versa. Example application (bottom): using a collection of paintings of a famous artist, learn to render a user’s photograph into their style.

Abstract

Image-to-image translation is a class of vision and graph-

ics problems where the goal is to learn the mapping between

an input image and an output image using a training set of

aligned image pairs. However, for many tasks, paired train-

ing data will not be available. We present an approach for

learning to translate an image from a source domain

to a

target domain

in the absence of paired examples. Our goal

is to learn a mapping

G : X → Y

such that the distribution

of images from

G(X)

is indistinguishable from the distribu-

tion

using an adversarial loss. Because this mapping is

highly under-constrained, we couple it with an inverse map-

ping

F : Y → X

and introduce a cycle consistency loss to

push

F (G(X)) ≈ X

(and vice versa). Qualitative results are

presented on several tasks where paired training data does

not exist, including collection style transfer, object transﬁgu-

ration, season transfer, photo enhancement, etc. Quantitative

comparisons against several prior methods demonstrate the

superiority of our approach.

1. Introduction

What did Claude Monet see as he placed his easel by the

bank of the Seine near Argenteuil on a lovely spring day in

1873 (Figure

1, top-left)? A color photograph, had it been

invented, may have documented a crisp blue sky and a glassy

river reﬂecting it. Monet conveyed his impression of this same

scene through wispy brush strokes and a bright palette. What

if Monet had happened upon the little harbor in Cassis on a

cool summer evening (Figure

1, bottom-left)? A brief stroll

through a gallery of Monet paintings makes it easy to imagine

how he would have rendered the scene: perhaps in pastel

shades, with abrupt dabs of paint, and a somewhat ﬂattened

dynamic range.

We can imagine all this despite never having seen a side by

side example of a Monet painting next to a photo of the scene

he painted. Instead we have knowledge of the set of Monet

paintings and of the set of landscape photographs. We can

reason about the stylistic differences between these two sets,

and thereby imagine what a scene might look like if we were

to “translate” it from one set into the other.

* indicates equal contribution

2223

下载后可阅读完整内容，剩余9页未读，立即下载

Pumpkin_tong

粉丝: 40
资源: 54

无配对图像到图像翻译：使用循环一致对抗网络

Image-to-Image Translation with Conditional Adversarial Networks

Deep-photo enhancer

contrastive-unpaired-translation:对比性的非配对图像到图像翻译，比cyclegan更快，更轻松的训练（ECCV 2020，在PyTorch中）

Unpaired Image-to-Image Translation using Cycle-consistent adversarial networks

Unpaired Image-to-Image Translationusing Cycle-Consistent Adversarial Networks

Unpaired Image-to-Image Translation using Adversarial Consistency Loss.pdf

unpaired image-to-image translation using cycle-consistent adversarial networks

详细介绍一下AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks 的缺点

deformation-aware-unpaired-image-translation:此仓库包含CVPR2020论文的代码

人像卡通化探索项目 (photo-to-cartoon translation project)-Python开发

最新资源