增强多视图预测与位分配优化技术

197 浏览量更新于2024-08-26 收藏 285KB PDF 举报

“增强的多视图预测和位分配”是一篇研究论文，主要探讨了在多视图视频编码中的改进技术，包括扩展的DFMC（Dual Frame Motion Compensation）结构、HQF（High Quality Frame）跳跃周期、增强的LQF（Low Quality Frame）以及优化的访谈预测和位分配策略。在多视图视频编码中，多个摄像机同时捕捉视频流，从而创建同一场景的多个视角。这种技术为观众提供了立体视觉体验，但同时也带来了更高的编码复杂性和带宽需求。为了提高编码效率，本文首先提出了一种扩展的DFMC结构。DFMC是一种利用前后两帧信息进行运动补偿的技术，旨在减少视频序列中的运动模糊和提高图像质量。扩展的DFMC结构在此基础上进一步优化，提升了运动估计的精度。接着，文章介绍了HQF跳跃周期的概念。HQF是编码过程中质量较高的帧，用于参考其他帧的预测。确定HQF的位置对整个编码过程至关重要，因为它直接影响到预测质量和编码效率。作者提出的方法考虑了时间视图和访谈预测结构，以确定最佳的HQF位置。基于确定的HQF，论文提出了增强的LQF。LQF是质量较低的帧，通常用于编码效率的权衡。通过改进LQF，可以在保持编码效率的同时提高图像质量。增强的LQF可能采用了更精细的量化策略或更高效的预测方法。然后，考虑到HQF和增强的LQF，论文提出了改进的访谈预测。访谈预测是多视图视频编码中的关键技术，它通过分析不同视角间的相关性来减少冗余信息。通过结合HQF和LQF的优势，提出的访谈预测策略可以更准确地估计不同视图间的运动信息，从而提高编码性能。最后，文章讨论了在所提多视图编码方案中的位分配策略。位分配是决定每个编码单元（如宏块）编码位数的过程，它直接影响到整体的码率控制和图像质量。通过优化位分配，可以确保关键帧得到足够的比特，同时保持整个序列的编码稳定性。实验结果表明，该方法相比于传统方案能取得更好的性能，这可能是由于其在运动补偿、预测结构优化和位分配上的创新。关键词包括双帧、运动补偿、多视图和位分配，显示了论文的核心研究领域。这篇研究论文深入研究了多视图视频编码的若干关键技术，包括扩展的DFMC、HQF和LQF的优化以及位分配策略，这些技术对于提高编码效率和图像质量具有重要意义，特别是在有限带宽条件下传输多视图视频流时。

Enhanced Multi-view Prediction and Bit Allocation

Da Liu,

Li Wang,

Yuncai Hao

Beijing Institute of Control Engineering

Beijing, China

lovda@outlook.com

Fang Yin,

Chunyan Li,

Jun Zhang

Beihang University

Beijing, China

Abstract

—

In this paper, firstly an extended DFMC

Structure is proposed, then HQF jump period in extended DFMC

is presented. Considering temporal-view and interview prediction

structure, HQF location is determined. From the HQF, an

enhance LQF is proposed. Then considering the HQF and

enhance LQF, improved interview prediction is proposed.

Finally bit allocation in the proposed multi-view is proposed.

Experimental results show that the proposed method can achieve

better performance than the previous schemes.

Keywords-dual frame; motion compensation; multiview; bit

allocation

I. INTRODUCTION

i N

 

i N



i N

 

i N

 





i N

 

LTR

Figure 1. Dual frame motion compensation.

In multi-view video coding, several cameras simultaneously

capture the video streams, thus multiple camera views of the

same scene is created. This approach needs large amounts of

data to transmit the video stream, so efficient compression

techniques are essential for this kind of application. In

temporal view, the frames relevance in a short period of time

is high, the coding efficiency can be further improved. In

inter-view, all cameras capture the same scene from different

viewpoints. A large amount of inter-view statistical

dependencies is existed. Improved temporal/inter-view motion

compensation can greatly reduce prediction error, thus coding

efficiency is improved.

In a single view, a kind of motion compensation method,

jump update dual frame motion compensation (JU-DFMC),

has been reported recently [1]-[3]. In JU-DFMC, two kinds of

frames are existed from the perspective of bit allocation. One

has relatively higher quality and is called high quality frame

(HQF), such as the i-1th frame



and the i-N-1th frame

i N

 

in Fig. 1; the other has relatively lower quality and is

called low quality frame (LQF), such as the i+kth frame (k=-

N, -N+1, -N+2, -N+3, …, -2, 0, 1). For each current frame, two

reference buffers are utilized for motion compensation, as

shown in Fig. 1, the first reference buffer contains the most

recently decoded frame, which is called short-term reference

frame (STR), and the second one contains a HQF from the

past, which is named as long-term reference frame (LTR).

A number of DFMC related work have been done. In [4],

bits were unevenly allocated among frames periodically, the

frames with higher bits utilized as the long-term reference

frame for the other frames. In the further work [1], the update

period of the LTR was set to ten frames. The PSNR of nine

frames that follow the LTR frame was utilized to determine

the bit allocation in the LTR. In [2], LTR was selected with

simulated annealing, the overall visual quality is better with

the cost that the computational complexity is relatively high.

At the same time, some DFMC related error resilient work has

also been done. In [5] and [6], the recursive optimal per-pixel

estimate (ROPE) algorithm was utilized in DFMC, and the

error propagation is restrained. In [7], a binary decision tree

was utilized in decoder to choose reference frame, the LTR

and STR are adaptively chose for error concealment. The

results show that it can bring better performance than just

using the short-term median MV block.

In recent years, some temporal/inter-view motion

compensation schemes have been reported. In [8], the problem

of coding N multiview video sequence was theoretically

studied. The impact of both inaccurate disparity compensation

and temporal GOP size K on the overall rate-distortion

efficiency was discussed. In [9], a geometric prediction

methodology for accurate disparity vector prediction was

proposed to reduce the disparity compensation cost. In [10], an

inter-view direct mode was proposed to signal the decoder that

the motion of a macroblock can be achieved from the coded

view without any coding bits. In [11], the statistical

dependencies from both temporal and inter-view reference

pictures for motion-compensated prediction are exploited. In

[12], the main view of the multiview video are encoded using

an MPEG-4 encoder and the auxiliary views are encoded by

joint disparity and motion compensation. In [13], view

temporal prediction structures that are adjusted to various

characteristics of general multi-view video were proposed.

In the paper, a multi-view coding structure with extend

DFMC is proposed. Firstly, the extend DFMC is proposed.

Secondly, HQF jump period in temporal view and further

adjustment for inter-view prediction are proposed. Thirdly, bit

allocation in the proposed multi-view structure is determined.

The rest of the paper is organized as follows. In Section II,

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38670186

粉丝: 8
资源: 945

增强多视图预测与位分配优化技术

行业分类-设备装置-增强实时视图.zip

最有用的DBA视图

4+1视图方式的销售管理系统架构文档.doc

c语言 模拟 内存分配管理 用 位视图

在分页存储管理方式下采用位视图表示主存分配情况，实现主存空间的分配和回收

用C语言实现在分页存储管理方式下采用位视图表示主存分配情况，实现主存空间的分配和回收

请解释多视图一致性聚类、多视图子空间聚类和相互正则化的含义

用C语言写一个程序：文件管理位视图，要求可以显示位视图，文件分配，文件回收和退出。输入输出用中文表示

9什么是甘特图、任务分配状况视图、日历视图、网络图、资源工作表视图、资源使用状况视图、资源图表视图和组合视图？

最新资源

c语言模拟内存分配管理用位视图