跨视图下/上采样法：多视图深度视频编码的新策略

156 浏览量更新于2024-08-26 收藏 882KB PDF 举报

"这篇研究论文提出了一种跨视图下/上采样(CDU)方法，专门用于多视图深度视频编码，旨在利用跨视图信息帮助解码器进行上采样。该方法在下采样过程中采用奇偶交错提取，以保留原始深度视频的更多可信信息，并在解码器端利用跨视图信息对重建的深度视频进行上采样。通过迭代插值过程消除压缩失真对上采样的影响，实验结果显示，该算法可实现最高3.88dB的增益，合成视图的质量也有所提高。关键词包括：跨视图、下/上采样、多视图深度视频以及视图合成。" 文章介绍了多视图深度视频编码的新技术，该技术针对减少分辨率的多视图深度视频编码框架，提出了跨视图下/上采样(CDU)方法。下采样阶段采用了奇偶交错提取的方式，目的是在降低分辨率的同时尽可能保留原始深度视频的关键信息。这种方法重视保持视频的信噪比，确保在降低分辨率后仍能保留重要的视觉细节。在解码端，CDU方法引入了跨视图信息来辅助深度视频的上采样。这是因为不同视角的深度信息之间存在一定的关联性，可以用来补充和修复因压缩而产生的失真。为了减少压缩过程中产生的不良影响，论文提出了一种迭代插值过程。这个过程可以逐步校正和优化上采样的结果，使得重建的深度视频质量更接近原始视频。实验部分展示了该方法的有效性，实验结果表明，与传统方法相比，所提出的CDU方法在峰值信噪比（PSNR）上可获得高达3.88dB的提升，这意味着图像质量有显著改善。此外，通过该方法合成的视图质量也优于其他方法，这在自由视点视频（FVV）的生成和多视点视频体验中具有重要意义，因为高质量的视图合成是提供逼真、流畅的多视点体验的关键。这篇研究论文提出的跨视图下/上采样方法为多视图深度视频编码带来了新的解决方案，通过利用跨视图信息和迭代插值技术，提升了低分辨率深度视频的重构质量和合成视图的视觉效果。这种方法对于未来的多视点视频传输和虚拟现实应用有着潜在的应用价值。

IEEE SIGNAL PROCESSING LETTERS, VOL. 19, NO. 5, MAY 2012 295

Cross-View Down/Up-Sampling Method for

Multiview Depth Video Coding

Qiong Liu, Member, IEEE, You Yang, Member, IEEE, Rongrong Ji, Yue Gao, and Li Yu

Abstract—In this letter, we propose a cross-view down/up-sam-

pling (CDU) method for the framework of reduced resolution mul-

tiview depth video coding, which exploits cross-view information to

assist the up-sampling at the decoder. In the dow n-samp ling pro-

cedure of CDU, the odd-even interlaced extraction is employed to

preserve more conﬁdent information of the original depth vid eo

with reduced resolution. In the d ec oder, the cross-view information

is exploited for up-sampling the reconstructed d epth video. An it-

erative interpolation process is proposed to eliminate the effect of

compression distortion on this up-sampling. Experimental results

demonstrate the gains of up to 3.88 dB for the proposed algorithm

and better quality of synthesized views.

Index Terms—Cross-view, down/up-sampling, multiview d epth

video, view synthesis.

I. INTRODUCTION

REE view video (FVV) represented by Multi -vi e w p lus

Depth (MVD) is an att

ractive future video application

characterized by enabling users to freely select their desired

viewpoint [1]. MVD is a data forma t consisting of multi-

view texture v ide

o and associated depth video. Depth video

records the scene and represents the relative distance from

camera to objects in 3-D space, and is widely used in depth

image-based r

endering (DIBR), which can be also further ap-

plied in multiple-view based object retrieval [2], [3] and video

summarization [4]. With DIBR methods, arbitrary viewpoint

can be synth

esized for FVV realization. Different from other

3-D video application s, FVV usually requires a wide range of

viewing angles for user interaction. Therefore, multiview depth

video (MD

V) is required for high quality DIBR . Besides the

multiview texture videos, the hug e v olu me of MDV causes a

vital problem for data storage and transmission. High perfor-

mance

compression techniques for M DV are urgently in n eed

to make FVV practical in the near futur e.

Resolution red ucti on was proposed as an efﬁcient approach

for MD

V compression recently [5], [6]. As illustrated in Fig. 1,

MDVs are down-sampled ﬁrstly and then encoded in reduced

resolution with much lower bit-rate compared with that encoded

Manuscript received January 04, 2012; revised March 02, 2012; accepted

March 02, 2012. Date of publication March 14, 2012; date of current version

April 03, 2012. This work was supported by the NSFC under Grant 61170194.

The associate editor coordinating the review of this manuscript and approving

it for publication was Dr. Ali Bilgin.

Q. Liu and L. Yu are with the Department of Electronic and Information En-

gineering, Huazhong University of Science and Technology, Wuhan, China.

Y. Yang and Y. Gao are with the Department of Automation, Tsinghua Uni-

versity, Beijing, China (e-mail: yangyou@ieee.org).

R. Ji is with the Department of Electrical Engineeri ng , Columbia University,

New York, NY 10027 USA.

Color versions of one or more of the ﬁgures in this paper are available online

at http://ieeexplore.i eee.org.

Dig

ital Object Identiﬁer 10.1109/LSP.2012.2190060

Fig. 1. Framework of multiview depth video cod ing.

in full resoluti on. In the decoder side, the M DV is ﬁrstly re-

constructedfrombitstreamandthenup-sampledtohighresolu-

tion. With the advantag e of bit-rate saving, down/up-samplin g

on MDV in this framework will lead to quality losses, espe-

cially for sharp boundaries of objects. Therefore, an effective

down/up-sampling method is needed for the reduced resolution

framework of M DV coding.

Traditional ﬁlters, including linear and uniform up-sampling

ﬁlters [7], are not speciﬁed for MDV. Distortions in MDV

caused by compression quantization and up-sampling ﬁlter will

result in visual artifacts in DIBR, such as boundary crackles.

In order to handle the distortions, a joint trilateral ﬁlter [8] and

adaptive bilateral ﬁlter [9] were proposed for in-loop ﬁltering

and post-ﬁltering on reconstructed d epth images. Besides that,

a Joint Bilateral U p-sam pling (JBU) ﬁlter [10] was propo sed by

using auxiliary information from hig h resolution images. T his

ﬁlter extends the concept of Gaussian smoothing by w eighting

the ﬁlter coefﬁcients with their corresponding relative pix e l

intensities, and it is beneﬁt for edge-preserving. The texture

informatio n from color im age can be used in JBU for high

quality MD V up-sampling, as the fact that each depth image

has a corresponding color image in MVD data format.

However, the texture-assisted JBU ﬁlter for depth image suf-

fers from the texture copy problem. M ore accurate and conﬁdent

depth values are needed to im prov e the quality of up-sampled

MDV. In this letter, we propose a cross-view do wn/up-samp lin g

(CDU) method for MDV coding. In this method, the conﬁdent

depth value o f neighbo r views is preserved by odd-even inter-

laced extraction for down-sampling in the encoder side. This

conﬁdent depth value from the n eigh bor view is projected to

the current view to improve the quality of up-sampling in the

decoder side.

II. P

ROPOSED ALGORITHM

In this section, we ﬁrst introduce our way on odd-even inter-

laced extraction for down-sampling. Then we deﬁne the cross-

view iterative interpolation for up-sampling. Finally, the con-

vergence analysis for up-sampling is given.

A. The Odd-Even Interlaced Extraction for Down-Samplin g

In the framework of depth video coding depicted by Fig. 1, a

down-sampling procedure is performed before encoding. There-

for

e,thedown-samplinginthisframeworkshouldbedesigned

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38694674

粉丝: 6
资源: 971

跨视图下/上采样法：多视图深度视频编码的新策略

深度图压缩中的边缘保持上/下采样优化方法

提升3DV编码效率：基于时间下采样的深度图编码方法

深度图编码技术在JCT3V-B1005 Test Model 2中的应用

高效视频编码中用于深度图压缩的边缘保留下采样/上采样

基于自适应块压缩感知的深度图编码

使用基于熵的自适应测量分配进行深度图像编码

光场图像视图合成

点云生成多视角深度图

图像的均方误差的matlab代码-OptiFlex:利用光流增强的基于深度学习的基于多帧的动物姿态估计

基于深度学习的图像融合研究综述

最新资源