视频超分辨率重建：粗精配准结合方法

需积分: 10 114 浏览量更新于2024-09-09 收藏 2.99MB PDF 举报

"这篇论文提出了一种结合粗略与精细配准的视频帧超分辨率重建方法，旨在解决处理快速移动或显著变形物体的限制。通过采用基于内容的注册算法，提高了视频超分辨率重建的准确性，尤其针对运动或变形对象。" 在超分辨率技术领域，图像或视频的分辨率提升是关键问题，特别是对于视频流，帧间的精确配准是视频超分辨率重建过程中的重要因素。"基于粗精配准重建"的方法旨在克服现有技术在处理小范围局部运动或全局仿射变换时的局限性。传统方法可能无法有效地处理快速移动或显著变形的对象，而这正是该论文关注的焦点。作者Zifei Liang、Xiaohai He、Zhengyong Wang和Qingchuan Tao提出了一个新的策略，将粗略注册与精细注册相结合，以提高视频帧的注册精度。粗略注册通常用于快速捕捉全局运动，而精细注册则用于处理更细微的局部变化。这种结合策略能够更好地追踪和融合视频序列中的帧，尤其是对于那些在场景中运动或形变的对象。论文中的“content-based registration algorithm”（基于内容的注册算法）是实现这一目标的关键。这个算法可能依赖于图像内容的特征匹配，例如色彩、纹理和结构信息，以识别和校正帧间的小误差，从而增强最终的超分辨率重建效果。它可能包括特征检测、描述符匹配、误差补偿等步骤，确保即便在复杂运动条件下也能实现准确的帧对齐。超分辨率重建的目标是利用相邻帧的信息来增强当前帧的细节，增加像素密度，从而生成更高清晰度的视频。通过提高注册的精度，这种方法可以更有效地利用视频帧之间的相似性和差异性，特别是在处理动态场景时，能更好地恢复运动物体的细节。这篇论文提出的结合粗略与精细配准的方法为视频超分辨率技术开辟了新的途径，尤其是在处理具有挑战性的快速移动和形变物体的场景中。这种方法的实施和效果验证，对于提升视频处理的质量，尤其是在监控、娱乐和科研等领域，都具有重要的理论和实际意义。

sparse matching algorithm to find matching seed pixels. We

then use a propagation strategy to compute rough motion

vectors for the image. Last, we combine the Lucas–

Kanade method for the final motion vectors.

It is widely understood that unpredictable factors, such

as blurring and noise, can cause i ll-posed problems in

super-res olu ti on rec ons tr uc tion .

3,15

Because super-resolu-

tion can be translated into a search for an optimized solu-

tion,

3,5

regularization is widely used in the optimization

pr ocess. Many Tikhonov- based and total variation (T V)-

based regularization methods, s uc h as super-re s ol ut i on

techniques,

3,16,17

have been proposed to solve such ill-

posed problems. By using TV regularization—which i s

widely employed in denoising and deblurring

15,18

—the

ill-posed super-resolution problem becomes optimizable.

This method has the advantage of preserving edges while

not severely penalizing steep loc al gradients; it can, there-

fore, be reasonably employed in a wide range of applica-

tions. In addition, many regularization-based multivideo

super-res ol uti on rec on st ruc ti on met hod s have been pro-

posed.

3,19

The basic steps of multivideo super-resolution

involve the space–time alignment and reconstruction of

multiple images. However, managing the alignment param-

eters of two cameras is challenging.

In this paper, we apply a robust super-resolution algo-

rithm to solve l

-norm which includes data fusion and

regularization terms. Although data fusion relates to solving

motion, blur, and downsample degradation factors, regulari-

zation terms more typically involve preserving edges.

The remainder of this paper is organized as follows. In

Sec. 2, we introduce the LR video observation and super-

resolution model. In Sec. 3, we present our registration

algorithm, and in Sec. 4, we introduce our super-resolution

algorithm. In Sec. 5, we descri be the numerous experiments

we performed to verify our registration accuracy and the

effectiveness of the super-resolution. Our conclusions are

presented in Sec. 6.

2 Low-Resolution Video Observation and

High-Resolution Video Reconstruction Model

In the LR video observation and HR video reconstruction

model, the original HR dynamic frame is denoted as F.It

can be assumed that, after subpixel and subframe shifting,

space blurring, downsampling, and the introduction of noise

effects, the HR video will be degraded to an LR video

(Fig. 1). In model (1), D

represents the space decimatio n

matrix associated with the k’th LR video frame, specifically,

a tw o-scale down-sampling operation in space domain

D ¼





, T

, which is represented as a map ½v

ði;jÞ



and indicates the motion direction and position for every

pixel. This is the geometric motion operator between the

HR scene and k’th LR frame Y

; H

;is the camera point

spread function (PSF) model. This degradation can be

expressed by

3,20,21

¼ D

F þ n

frame number∶k ¼ 1;:::;N: (1)

For a camera-obtained video sequence, it is often assumed

that redundant information between adjacent frames can be

used to reconstruct the current frame. We rewrite the multi-

frame upper-resolution estimator as the following minimiza-

tion:

3,5,20,21

F ¼ argmin



k¼s

jjD

F − Y



; (2)

where 1 ≤ p ≤ 2, each p represents an L

; norm estimator,

and p → 1 refers to the most robust cost function. In this

paper, the choice of parameter p is not our research priority;

for simplification,p ¼ 1 is applied. Frames s to t like a slide

window are to be used to reconstruct current frame F. The

slide window model in Ref. 11 is adopted in this paper.

3 Joint Propagation and Lucas–Kanade Image

Registration

3.1 Seed Selection for Propagation

The frames of a video are typically dynamic. The movement of

both camera and objects can cause a difference in video frame

content. Camera lens motion used to acquire two digital frames

of a flat scene can be approximately illustrated as an affine map-

ping. This apparent deformation of a plane scene is a planar

homographic transform, which is smooth. Simplified local

perspective effects for any scene area can be modeled by

a six-parameter local transform of image coordinates:







cos θ − sin θ

sin θ cos θ







Δx

Δy



: (3)

Harris and Hessian affine invarian t detectors

22,23

are

respective methods that normalize the six parameters in

the affine transform. They first detect key points in the

scale space. They then apply affine normalization to estimate

the parameters for elliptical regions. In this paper, the Harris

affine invariant detector is used for region detection. Because

SIFT matching normalizes rotations, translations, and scal-

ing, it is the only fully scale-invariant detector. Hence,

it is a suitable method for finding the initial matching of

two frames even though they are not adjacent or when the

camera is moving when shooting.

3.2 Propagation Scheme for Coarse Motion

Registration

After matching image seeds, which are the output of the

original seeds selected in Sec. 3.1, the regions around

Fig. 1 Low-resolution (LR) video observation model.

Journal of Electronic Imaging 063018-2 Nov∕Dec 2014

•

Vol. 23(6)

Liang et al.: Combining coarse and fine registration for video frame super-resolution reconstruction

剩余10页未读，继续阅读

静语蓝天

粉丝: 0
资源: 1

视频超分辨率重建：粗精配准结合方法

Coarse-to-Fine Auto-encoder Networks.pdf

matlab三次样条插值函数代码-Data-Driven-and-Coarse-to-Fine-Baseline-Correction-for

Design of Low-Power Coarse-Grained Reconfigurable Architectures

Coarse-to-fine-matching

Coarse-fine interpolation for AMR-开源

详细解读：Image-to-Markup Generation with Coarse-to-Fine Attention中的Row Encoder

A Modeling and Mapping Method for Coarse/Fine Mixed-grained Reconfigurable Architecture

A coarse-to-fine framework to efficiently thwart plagiarism

Coarse-to-Fine Lung Segmentation in Computed Tomography Images

最新资源