视频超分辨率研究：基于HR光流估计

157 浏览量更新于2024-08-26 收藏 1.18MB PDF 举报

"这篇研究论文探讨了通过高分辨率（HR）光流估计来实现视频超分辨率的方法。作者提出了一种端到端可训练的视频超分辨率框架，旨在同时提升图像和光流的分辨率，以获得更准确的对应关系和更好的超分辨率结果。" 在视频超分辨率（Video Super-Resolution, VSR）领域，目标是从低分辨率（LR）帧中生成具有合理且时间一致性的高分辨率（HR）帧。准确的对应关系在视频SR中起着关键作用。传统的视频SR方法表明，同时进行图像和光流的超分辨率能够提供精确的对应关系，从而改善超分辨率效果。然而，现有的基于深度学习的VSR方法主要依赖于低分辨率（LR）光流来生成对应关系。论文中，作者提出了一种新的光学流重建网络（OFRnet），它以粗到细的方式估计HR光流。首先，OFRnet通过分析LR帧，逐步推断出高精度的光流信息。接下来，为了补偿运动影响，他们应用了运动补偿技术，确保在不同帧之间的时间一致性。这有助于减少由于物体运动或相机移动引起的失真，从而提高超分辨率重建的质量。此外，论文还可能涉及以下知识点： 1. **深度学习模型**：作者可能设计了一个深度神经网络，用于联合学习图像和光流的超分辨率。这种模型通常包括卷积神经网络（CNN）、递归神经网络（RNN）或变形卷积网络（DeconvNet）等组件，以捕捉空间和时间上的细节。 2. **损失函数**：为了训练该端到端系统，可能会采用多种损失函数，如均方误差（MSE）损失、结构相似度指数（SSIM）损失以及对抗性损失，以促进图像质量和真实感的提升。 3. **光流估计**：光流是描述像素在连续帧之间移动的向量场，对于视频处理至关重要。OFRnet的创新之处在于能够从LR帧中恢复HR光流，这对提升视频的时空一致性至关重要。 4. **运动补偿**：运动补偿通过预测并消除由于物体或相机运动导致的图像差异，有助于保持视频帧间的连贯性。 5. **实验与评估**：论文可能包含对不同基准数据集的实验，对比了所提方法与其他现有方法的性能，通过客观和主观质量指标（如峰值信噪比（PSNR）、结构相似度（SSIM）和视觉评估）来验证其优越性。这篇论文针对当前深度学习VSR方法中使用LR光流的局限性，提出了一个新颖的框架，致力于同时提升图像和光流的超分辨率，以改善视频的视觉质量和时间一致性。这种方法对视频增强和重采样等领域有潜在的应用价值。

Learning for Video Super-Resolution through HR Optical Flow Estimation

Longguang Wang, Yulan Guo, Zaiping Lin, Xinpu Deng, and Wei An

School of Electronic Science, National University of Defense Technology

Changsha 410073, China

{wanglongguang15, yulan.guo, linzaiping, dengxinpu, anwei}@nudt.edu.cn

Abstract

Video super-resolution (SR) aims to generate a sequence

of high-resolution (HR) frames with plausible and tempo-

rally consistent details from their low-resolution (LR) coun-

terparts. The generation of accurate correspondence plays

a signiﬁcant role in video SR. It is demonstrated by tra-

ditional video SR methods that simultaneous SR of both

images and optical ﬂows can provide accurate correspon-

dences and better SR results. However, LR optical ﬂows

are used in existing deep learning based methods for cor-

respondence generation. In this paper, we propose an end-

to-end trainable video SR framework to super-resolve both

images and optical ﬂows. Speciﬁcally, we ﬁrst propose

an optical ﬂow reconstruction network (OFRnet) to infer

HR optical ﬂows in a coarse-to-ﬁne manner. Then, mo-

tion compensation is performed according to the HR optical

ﬂows. Finally, compensated LR inputs are fed to a super-

resolution network (SRnet) to generate the SR results. Ex-

tensive experiments demonstrate that HR optical ﬂows pro-

vide more accurate correspondences than their LR coun-

terparts and improve both accuracy and consistency per-

formance. Comparative results on the Vid4 and DAVIS-

10 datasets show that our framework achieves the state-

of-the-art performance. The codes will be released soon

at: https://github.com/LongguangWang/SOF-VSR-Super-

Resolving-Optical-Flow-for-Video-Super-Resolution-.

1. Introduction

Super-resolution (SR) aims to generate high-resolution

(HR) images or videos from their low-resolution (LR) coun-

terparts. As a typical low-level computer vision problem,

SR has been widely investigated for decades [23, 5, 7]. Re-

cently, the prevalence of high-deﬁnition display further ad-

vances the development of SR. For single image SR, image

details are recovered using the spatial correlation in a sin-

gle frame. In contrast, inter-frame temporal correlation can

further be exploited for video SR.

Since temporal correlation is crucial to video SR, the

Groundtruth

SOF-VSRTDVSRVSRnet

Figure 1. Temporal proﬁles under ×4 conﬁguration for VSRnet

[13], TDVSR [20] and our SOF-VSR on Calendar and City. Pur-

ple boxes represent corresponding temporal proﬁles. Our SOF-

VSR produces ﬁner details in temporal proﬁles, which are more

consistent with the groundtruth.

key to success lies in accurate correspondence generation.

Numerous methods [6, 19, 22] have demonstrated that the

correspondence generation and SR problems are closely in-

terrelated and can boost each other’s accuracy. Therefore,

these methods integrate the SR of both images and opti-

cal ﬂows in a uniﬁed framework. However, current deep

learning based methods [18, 13, 35, 2, 20, 21] mainly focus

on the SR of images, and use LR optical ﬂows to provide

correspondences. Although LR optical ﬂows can provide

sub-pixel correspondences in LR images, their limited ac-

curacy hinders the performance improvement for video SR,

especially for scenarios with large upscaling factors.

In this paper, we propose an end-to-end trainable video

SR framework to generate both HR images and optical

ﬂows. The SR of optical ﬂows provides accurate correspon-

dences, which not only improves the accuracy of each HR

image, but also achieves better temporal consistency. We

ﬁrst introduce an optical ﬂow reconstruction net (OFRnet)

to reconstruct HR optical ﬂows in a coarse-to-ﬁne manner.

These HR optical ﬂows are then used to perform motion

compensation on LR frames. A space-to-depth transforma-

tion is therefore used to bridge the resolution gap between

HR optical ﬂows and LR frames. Finally, the compensated

LR frames are fed to a super-resolution net (SRnet) to gen-

erate each HR frame. Extensive evaluation is conducted

4321

arXiv:1809.08573v2 [cs.CV] 25 Oct 2018

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38665449

粉丝: 8
资源: 963

视频超分辨率研究：基于HR光流估计

【图像增强】基于深度学习的超分辨率图像增强含Matlab源码.zip

视频超分辨

基于变分光流估计的肺部4D-CT图像超分辨率重建

一种多尺度三维卷积的视频超分辨率方法.docx

vc++超分辨率图像重构

迭代步长自适应超分辨_opencv_超分辨_源码

TDAN：视频超分辨率中的时间可变形对齐网络

视频超分辨率重建：注册可靠性调控与自适应总变差

CrossNet：跨尺度翘曲的端到端深度学习超分辨率方法

超分辨率重建技术：现状与展望

最新资源