上采样导向的帧率降低方法

157 浏览量更新于2024-08-26 收藏 1.9MB PDF 举报

"面向上采样的帧速率降低" 在视频处理领域，帧速率的调整是一个重要的技术环节，尤其在需要节省存储空间或传输带宽时。本文"面向上采样的帧速率降低"是一篇研究论文，由Yongbing Zhang、Haoqian Wang和Debin Zhao等人撰写，来自中国清华大学深圳研究生院和哈尔滨工业大学计算机科学系。文章探讨了如何在降低帧速率的同时，减少信息损失并提高视频质量。传统的帧速率降低方法是直接下采样，即按照固定的间隔保留原始输入序列中的帧，这种方法简单易行，但存在一个主要问题：它忽略了时间上采样过程中对插值帧的影响。当帧速率降低后，需要通过时间上采样来恢复丢失的帧，而直接下采样方法无法充分考虑这一过程，导致上采样后的插值帧质量下降，信息损失严重。论文提出了一种创新的“面向上采样的帧速率降低”方法，该方法着重于在下采样阶段就考虑到后续的上采样过程。通过在下采样时保留更多的信息，尤其是与帧间运动相关的细节，以便在上采样时进行更准确的运动补偿帧插值。这种方法旨在改善直接下采样方法中插值帧的质量，从而在降低帧速率的同时，尽可能保持视频的视觉效果。运动补偿帧插值是一种常见的视频处理技术，用于生成两帧之间的中间帧。论文中提出的方案可能会利用这种技术，通过分析相邻帧间的物体运动来预测和插入丢失的帧。通过这种方式，可以减轻由于帧速率降低而导致的视觉不连续性，使视频看起来更加平滑和自然。这篇论文提出了一个新的策略，将时间上采样过程考虑进帧速率降低的步骤中，以期优化视频压缩和传输的性能。这种面向上采样的方法有望在视频编码、流媒体服务和移动设备的视频处理中找到应用，实现高效且高质量的视频处理。

ðn

Þ. To capture the varying property of frame contents,

in MCFI the whole frame is usually divided into a number

of blocks S, and each block has a motion vector v

¼ðv

, v

with the horizontal component v

and vertical component

, respectively. And then X

ðn

Þ can be formulated as

ðn

Þ¼w

ðn

Þþw

ðn

¼ w

t1

ðn

þv

Þþw

t þ 1

ðn

þv

Þð1Þ

where w

and w

are the relative weights of the forward

predicted block P

and the backward predicted block P

and v

represent the motion vectors in the forward

and backward reference frames. For the majority cases,

þw

¼ 1 and w

¼ w

¼ 1=2. More generally, v

and v

may be any fractional numbers [17]. If the motion vectors

are of sub-pixel accuracy, Eq. (1) is applied to the

corresponding references with fractional-pixel accuracy

to yield the up-sampled signals accordingly.

When a ﬁnite impulse response (FIR) ﬁlter with 2M-tap

is used for the 2-D separate interpolation, the reference

signals with motion vectors of horizontally, vertically and

diagonally half-pixel accuracy in each prediction direction

can be yielded by

Pðn

Þ¼

u ¼M þ 1

hðuÞX

ðn

þ v

þu, n

þ v



Þð2Þ

Pðn

Þ¼

u ¼M þ 1

hðuÞX

ðn

þ v

bc, n

þ v



þuÞð3Þ

and

Pðn

Þ¼

¼M þ 1

hðu



¼M þ 1

hðu

ÞX

ðn

þ v

þu

, n

þ v



þu

ð4Þ

where

represents the operation rounded to the nearest

integer pixel position towards minus inﬁnity and hðuÞ

represents the tap coefﬁcient. The interpolated values

at the horizontal and vertical half-pixel positions are

obtained by applying a one-dimensional 2M-tap FIR

ﬁlter horizontally and vertically using Eqs. (2) and (3),

respectively. For the diagonally half-pixel position, one-

dimensional 2M-tap FIR ﬁlter needs to be performed

horizontally ﬁrstly and then vertically using Eq. (4). The

half pixels and full pixels are then utilized to interpolate

the quarter-pixels via bilinear method. Fig. 2 illustrates

the 1:2 frame rate up conversion process with horizon-

tally half-pixel accuracy in both directions when a FIR

ﬁlter h with 6-tap is used. The corresponding interpola-

tions are ﬁrst used to generate the forward and backward

prediction blocks P

and P

using Eq. (2), respectively. And

then the up-sampled pixel can be yielded by Eq. (1).

2.2. Derivation of the optimal down-sampled frame

Traditional MCFI usually tries to ﬁnd the most faithful

motion vectors for each block to be interpolated. Actually,

it is easy to observe from Eq. (1) that the quality of up-

sampled frames depends on not only the accuracy of

motion vectors but also the information contained in the

forward and backward reference frames. More informa-

tion about the frame to be interpolated embedded in the

forward and backward reference frames, up-sampled

frames with much higher quality can be obtained. To

transfer more information about the frame to be inter-

polated to the down-sampled frames, an up-sampling

oriented frame rate reduction is proposed in this subsec-

tion. Here, we will take MCFI [1] as an example to

describe the derivation of the optimal down-sampled

frame, and it can be easily extended to other frame rate

up conversion algorithms.

Deﬁne X

as the original frame in the input video at

time instance t in a vector form and the corresponding

up-sampled frame is

. For simplicity, we will take 1:2

MCFI as an example to derive the optimal solution of the

frame rate reduction problem. And of course, it can be

easily extended to arbitrary ratio MCFI. The goal of the

proposed frame rate reduction is to generate a high

quality interpolated frame while at the same time make

the down-sampled sequence faithful to the input one.

Consequently, the optimal down-sampled frame should

(

)

(

−1

)

(

−2

)

(

)

(

)

(

)

(

)

(

−1

)

(

−2

)

(

)

(

)

(

)

Full-pixel

sample

Fractional-pixel

sample

Upsampled-pixel

sample

t-1

t+1

...

Interpolation with

(

)

Fig. 2. 1:2 frame rate up conversion when v

¼1=2, 0



and v

¼ 1=2,0



Y. Zhang et al. / Signal Processing: Image Communication 28 (2013) 254–266256

剩余12页未读，继续阅读

weixin_38638309

粉丝: 3
资源: 943

上采样导向的帧率降低方法

CAN总线协议讲解

视频序列保存

2022年上半年网络工程师 综合知识

2011上半年软考网络工程师试题与答案.doc

工具变量城市供应链创新试点数据（2007-2023年）.xlsx

基于Python django-simpleui开发的博客系统详细文档+资料齐全.zip

嵌入式开发 操作系统教程 全部PPT课件 共8个章节.rar

基于Python Django教学资源管理系统网站+源码案例设计详细文档+资料齐全.zip

＜项目代码＞YOLOv8 建筑工地楼层空洞识别＜目标检测＞

【路径规划】未来搜索算法栅格地图机器人最短路径规划【含Matlab仿真 2868期】.zip

最新资源

2022年上半年网络工程师综合知识

嵌入式开发操作系统教程全部PPT课件共8个章节.rar