动态实例归一化：任意风格转换的革新方法

论文

需积分: 22 180 浏览量更新于2024-08-05 收藏 9.69MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

Dynamic Instance Normalization for Arbitrary Style Transfer

Yongcheng Jing

, Xiao Liu

, Yukang Ding

, Xinchao Wang

, Errui Ding

Mingli Song

1∗

, Shilei Wen

Zhejiang University,

Department of Computer Vision Technology (VIS), Baidu Inc.,

Stevens Institute of Technology

{ycjing, brooksong}@zju.edu.cn, {liuxiao12, dingyukang, dingerrui, wenshilei}@baidu.com, xinchao.wang@stevens.edu

Abstract

Prior normalization methods rely on afﬁne transformations

to produce arbitrary image style transfers, of which the pa-

rameters are computed in a pre-deﬁned way. Such manually-

deﬁned nature eventually results in the high-cost and shared

encoders for both style and content encoding, making style

transfer systems cumbersome to be deployed in resource-

constrained environments like on the mobile-terminal side. In

this paper, we propose a new and generalized normalization

module, termed as Dynamic Instance Normalization (DIN),

that allows for ﬂexible and more efﬁcient arbitrary style trans-

fers. Comprising an instance normalization and a dynamic

convolution, DIN encodes a style image into learnable con-

volution parameters, upon which the content image is styl-

ized. Unlike conventional methods that use shared complex

encoders to encode content and style, the proposed DIN intro-

duces a sophisticated style encoder, yet comes with a compact

and lightweight content encoder for fast inference. Experi-

mental results demonstrate that the proposed approach yields

very encouraging results on challenging style patterns and,

to our best knowledge, for the ﬁrst time enables an arbitrary

style transfer using MobileNet-based lightweight architec-

ture, leading to a reduction factor of more than twenty in com-

putational cost as compared to existing approaches. Further-

more, the proposed DIN provides ﬂexible support for state-

of-the-art convolutional operations, and thus triggers novel

functionalities, such as uniform-stroke placement for non-

natural images and automatic spatial-stroke control.

Introduction

Image stylization has been a long-standing research topic.

It has been studied in the domain of computer graphics, or

more speciﬁcally, the area of Non-Photorealistic Render-

ing (NPR) (Gooch and Gooch 2001; Rosin and Collomosse

2012). In the ﬁeld of computer vision, image stylization is

studied as a generalized problem of texture synthesis (Efros

and Leung 1999). Built upon the recent progress in visual

texture modelling (Gatys, Ecker, and Bethge 2015) and im-

age reconstruction (Mahendran and Vedaldi 2015), Gatys et

al. (Gatys, Ecker, and Bethge 2016) propose to exploit Con-

∗

Corresponding author

 2020, Association for the Advancement of Artiﬁcial

(a) Content

(b) Chen et al. (c) Huang et al.

(d) Ours (VGG)

(e) Style

(f) Li et al. (g) Sheng et al.

(h) Ours (MobileNet)

Figure 1: Existing ASPM methods either barely transfer

style to the target (Chen et al., Huang et al.), or produce

distorted style patterns (Li et al., Sheng et al.) while rely-

ing on high-cost encoders. By contrast, the proposed DIN

achieves superior performance using the same architecture

(Ours (VGG)), and for the ﬁrst time endows a much smaller

lightweight network to transfer arbitrary styles (Ours (Mo-

bileNet)).

volutional Neural Networks (CNNs) to render a content im-

age in different styles, pioneering a new ﬁeld called Neural

Style Transfer (NST) (Jing et al. 2019).

The inspiring work of Gatys et al. is, however, built upon

an iterative image optimization in the pixel space, which

turns out to be computationally expensive due to the on-

line optimization. To address this efﬁciency issue, model-

optimization-based NST algorithms are proposed, which

optimize feed-forward models in an ofﬂine training man-

ner. The earliest model-optimization-based NST algorithms,

namely Per-Style-Per-Model (PSPM), train separate style-

speciﬁc models for each particular style, and are there-

fore burdensome to be adopted for real-world applications.

(Johnson, Alahi, and Fei-Fei 2016; Ulyanov et al. 2016;

Li and Wand 2016). To address this issue, Multiple-Style-

Per-Model (MSPM) algorithms are proposed by incorporat-

ing multiple styles into one single model (Zhang and Dana

2017; Chen et al. 2017; Li et al. 2017b; Dumoulin, Shlens,

and Kudlur 2017). Unfortunately, MSPM also suffers from

arXiv:1911.06953v1 [cs.CV] 16 Nov 2019

下载后可阅读完整内容，剩余8页未读，立即下载

DeepLearning小舟

粉丝: 2384
资源: 57

动态实例归一化：任意风格转换的革新方法

Python-ArbitraryStyleTransferinRealtimewithAdaptiveInstanceNormalization

经典CNN论文.zip

Python库 | radiometric_normalization-0.1.89.tar.gz

PyPI 官网下载 | radiometric_normalization-0.1.161.tar.gz

Normalization Techniques in Deep Learning.xdf

apacheds-interceptors-normalization-2.0.0-M7.zip

cannot import name 'BatchNormalization' from 'keras.layers.normalization'

cannot import name 'batchnormalization' from 'keras.layers.normalization'

ImportError: cannot import name 'BatchNormalization' from 'keras.layers.normalization'

ImportError: cannot import name 'BatchNormalization' from 'tensorflow.python.keras.layers'

ImportError: cannot import name 'BatchNormalization' from 'tensorflow.python.keras.layers'咋办

instancenormalization

Instance Normalization

instance normalization

seq = tensorflow.keras.layers.LayerNormalization(tensorflow.keras.layers.Add([seq1,seq])) TypeError: __init__() takes 1 positional argument but 2 were given

最新资源

seq = tensorflow.keras.layers.LayerNormalization(tensorflow.keras.layers.Add([seq1,seq])) TypeError: init() takes 1 positional argument but 2 were given