实时神经网络视频风格迁移：保持一致性

需积分: 0 157 浏览量更新于2024-08-05 收藏 2.92MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

资源详情

资源推荐

Real-Time Neural Style Transfer for Videos

Haozhi Huang

†‡∗

Hao Wang

‡

Wenhan Luo

‡

Lin Ma

‡

Wenhao Jiang

‡

Xiaolong Zhu

‡

Zhifeng Li

‡

Wei Liu

‡∗

†

Tsinghua University

‡

Tencent AI Lab

∗

Correspondence: huanghz08@gmail.com wliu@ee.columbia.edu

Abstract

Recent research endeavors have shown the potential of

using feed-forward convolutional neural networks to ac-

complish fast style transfer for images. In this work, we

take one step further to explore the possibility of exploiting

a feed-forward network to perform style transfer for videos

and simultaneously maintain temporal consistency among

stylized video frames. Our feed-forward network is trained

by enforcing the outputs of consecutive frames to be both

well stylized and temporally consistent. More speciﬁcally,

a hybrid loss is proposed to capitalize on the content in-

formation of input frames, the style information of a given

style image, and the temporal information of consecutive

frames. To calculate the temporal loss during the train-

ing stage, a novel two-frame synergic training mechanism

is proposed. Compared with directly applying an existing

image style transfer method to videos, our proposed method

employs the trained network to yield temporally consistent

stylized videos which are much more visually pleasant. In

contrast to the prior video style transfer method which relies

on time-consuming optimization on the ﬂy, our method runs

in real time while generating competitive visual results.

1. Introduction

Recently, great progress has been achieved by apply-

ing deep convolutional neural networks (CNNs) to image

transformation tasks, where a feed-forward CNN receives

an input image, possibly equipped with some auxiliary in-

formation, and transforms it into a desired output image.

This kind of tasks includes style transfer [12, 27], seman-

tic segmentation [19], super-resolution [12, 7], coloriza-

tion [11, 31], etc.

A natural way to extend image processing techniques to

videos is to perform a certain image transformation frame

by frame. However, this scheme inevitably brings temporal

inconsistencies and thus causes severe ﬂicker artifacts. The

second row in Fig. 1 shows an example of directly applying

the feed-forward network based image style transfer method

Style Image

Figure 1: Video style transfer without and with temporal

consistency. The ﬁrst row displays two consecutive input

frames and a given style image. The second row shows

the stylized results generated by the method of Johnson

et al. [12]. The zoom-in regions in the middle show that

the stylized patterns are of different appearances between

the consecutive frames, which creates ﬂicker artifacts. The

third row shows the stylized results of our method, where

the stylized patterns maintain the same appearance.

of Johnson et al . [12] to videos. It can be observed that the

zoom-in content marked by white rectangles is stylized in-

to different appearances between two consecutive frames,

therefore creating ﬂicker artifacts. The reason is that slight

variations between adjacent video frames may be ampliﬁed

by the frame-based feed-forward network and thus result in

obviously different stylized frames. In the literature, one

solution to retain temporal coherence after video transfor-

mation is to explicitly consider temporal consistency during

the frame generation or optimization process [18, 1, 14, 22].

While effective, they are case-speciﬁc methods and thus

cannot be easily generalized to other problems. Among

them, the method of Ruder et al. [22] is speciﬁcally de-

signed for video style transfer. However, it relies on time-

consuming optimization on the ﬂy, and takes about three

minutes to process a single frame even with pre-computed

optical ﬂows. Another solution to maintaining temporal

consistency is to apply post-processing [15, 2]. A draw-

783

下载后可阅读完整内容，剩余8页未读，立即下载

chenbtravel

粉丝: 25
资源: 296

实时神经网络视频风格迁移：保持一致性

实战| 一行命令对你的图像视频进行风格迁移-附件资源

语音风格迁移-克隆5秒语音实时生成任意相同口音的语音-附演示视频+项目源码+模型-优质AI项目实战.zip

神经风格迁移的立体图像风格传递

让视频焕发新的艺术魅力：OpenCV视频风格迁移技术详解

神经风格迁移的矩匹配方法及其效果

控制神经风格迁移方法的介绍与应用

Python中的风格迁移技术探索

视频风格迁移使用循环神经网络

视频风格迁移添加时间约束

视频风格迁移使用时间损失函数

传统图像风格迁移相较于快速图像风格迁移的优点

对图像风格迁移技术的展望

图像风格迁移的现实作用

图像风格迁移技术的应用

python图像风格迁移

图像风格迁移所作的工作

tensorflowhub中的风格迁移模型

基于文心千帆大模型的图像风格迁移课题现状

数据风格迁移的方法有哪些

对基于卷积神经网络的图像风格迁移方法研究的展望

最新资源