分层帧视频压缩感知：时空恢复方法

166 浏览量更新于2024-08-26 1 收藏 123KB PDF 举报

"本文提出了一种基于分层帧的视频压缩感知框架，该框架通过更好地利用与参考帧的帧相关性、不同层之间帧的不等采样子比率设置以及减少错误传播来超越传统框架。它考虑了视频序列的空间和时间相关性，提出了一种基于空间-时间稀疏表示的恢复方法。当前帧和恢复的参考帧中的相似块组成一个空间-时间组，定义为稀疏表示的单位。通过对每个组的低维子空间描述进行利用，视频压缩感知恢复转化为低秩矩阵近似问题，可以通过优化算法解决。" 在视频压缩领域，压缩感知（Compressed Sensing, CS）是一种革命性的技术，它允许以低于奈奎斯特定理所规定的速率对信号进行采样，仍能重构信号。基于分层帧的视频压缩感知框架扩展了这一概念，特别关注于视频数据的时间连续性和空间冗余特性。传统视频压缩方法通常采用逐帧压缩，而新提出的框架引入了层次结构，将视频帧分为不同的层，每一层都具有不同程度的细节信息。这种分层方式有助于更有效地利用帧间的相关性，尤其是在运动物体和背景之间的变化上。通过在不同层间设置不等的采样子比率，可以更高效地处理关键帧和插值帧，从而减少带宽需求，同时保持图像质量。为了进一步提高恢复效果，文章提出了基于空间-时间相关性的恢复策略。视频序列中的相邻帧和同一帧内的相邻块往往存在相似性，这被称为空间和时间相关性。通过识别和组合这些相似块，形成空间-时间组，可以构建一个稀疏表示，即用少量非零元素来描述大量的数据。这降低了数据的复杂度，使得恢复过程更加高效。每个空间-时间组被表示为一个低秩矩阵，这是因为相似块在时间和空间上的局部变化可以用低维度的子空间来描述。将视频CS恢复问题转换为求解低秩矩阵的近似问题，可以通过矩阵分解或优化算法（如凸优化、交替方向乘子法等）来解决。这种方法可以有效地减少错误传播，提高恢复的准确性和稳定性。这篇研究论文探索了一种新颖的基于分层帧的视频压缩感知方法，它结合了空间和时间信息，优化了采样策略，并利用低秩矩阵理论来改进视频恢复的质量。这种方法对于视频压缩和传输具有重要的实际应用价值，尤其是在有限带宽条件下的高分辨率视频处理。

SPATIAL-TEMPORAL RECOVERY FOR HIERARCHICAL FRAME BASED VIDEO

COMPRESSED SENSING

Wenbin Che, Xinwei Gao, Xiaopeng Fan, Feng Jiang, Debin Zhao

Dept. of Computer Science and Technology, Harbin Institute of Technology, Harbin, China

{chewenbin, xwgao.cs, fxp, fjiang, dbzhao}@hit.edu.cn

ABSTRACT

In this paper, the hierarchical frame based video compressed

sensing (CS) framework is proposed, which outperforms

the traditional framework through the better exploitation of

frames correlation with reference frames, the unequal sample

subrates setting among frames in different layers and the re-

duction of the error propagation. By considering the spatial

and temporal correlations of the video sequence, a spatial-

temporal sparse representation based recovery is proposed

for this framework. The similar blocks in both the current

frame and these recovered reference frames are composed

as a spatial-temporal group, which is deﬁned as the unit of

the sparse representation. By exploiting the low dimensional

subspace description of each group, the video CS recovery

is converted as a low-rank matrix approximation problem,

which can be solved by exploiting the hard thresholding

and the gradient descent. Experimental results show that

the proposed method achieves better performance against

both the state-of-art still-image CS recovery algorithms and

the existing residual domain based video CS reconstruction

approaches.

Index Terms— Video compressed sensing, hierarchical

structure framework, spatial-temporal sparse representation

1. INTRODUCTION

As a new methodology of signal-sampling and recovery, com-

pressed sensing(CS) has been extensively studied in recent

years. As applied to video frames, this theory makes the sam-

pling process faster than traditional sampling methods. Sig-

niﬁcant process in video CS has been made with a single-

pixel cameras[1], based on representing a video in the Fouri-

er domain or the wavelet domain. However, video CS faces

challenges including high recovery quality at a relatively low

subrate[2]. Low subrate which makes it easier to capture

video sequences at a high speed by camera will result in a

poor recovery performance using the still-image CS recovery

This work has been supported in part by the Major State Basic Research

Development Program of China (973 Program 2015CB351804), the National

Science Foundation of China under Grant No. 61272386.

algorithms. By considering the spatial and temporal correla-

tions, it is possible to achieve a high-quality even employing

a low subrate[3]. Mun et al.[4] proposed a residual recov-

ery based on Motion Compensation(MC), which utilized the

temporal redundancy and residual sparse property in video se-

quence. Two subtrates are used in sampling stage of the resid-

ual recovery, where high subrate is adopted for key frames

and low subrate for non-key frames.

In the CS theory, the signal can be well recovered if it is

sparse enough in some domain. Mun et al.[5] cast the CS re-

construction in the base of contourlet transform or complex-

valued dual-tree wavelet transform(DWT), resulting in bet-

ter performance compared to the conventional ﬁxed domain

based recovery methods. However, it is almost impossible

to ﬁnd a universal domain in which all kinds of signals are

sparse. As an alternative to the CS reconstruction scheme,

the iterative algorithms based on non-local patches have been

proposed recently (e.g.[6, 7]). In [6], the number of nonzeros

3-D transformation coefﬁcients of a group, which is stacked

by the non-local patches, was used to measure the non-local

sparsity. Additionally, the collaborative sparsity measure was

established in [6], enforcing local smoothness and non-local

sparsity simultaneously. A group sparse representation (GSR)

modeling was further developed in [7], using the non-local

grouping technique as well. In essence, this modeling efﬁ-

ciently utilized the intrinsic low-rank property of natural im-

ages, which also exhibits the patch similarity among patch

group. Also, GSR modeling improves the performance of re-

covery over conventional ﬁxed domain based recovery meth-

ods.

In this paper, we consider the Block Compressed Sens-

ing(BCS) recovery of video sequences in which the hierarchi-

cal structure and group sparse representation based method

are used to aid the recovery process. We employ different

subrates for different layers. The 3D patch matching model-

ing, the hard thresholding and the gradient descent are also

adopted to the recovery stage. It can be found in experimen-

tal simulations that the proposed CS recovery based on hier-

archical structure outperforms the state-of-art still-image re-

covery method. Additionally, the proposed technique exceeds

the quality of residual domain based reconstruction by a large

margin.

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38723683

粉丝: 6
资源: 908

分层帧视频压缩感知：时空恢复方法

基于压缩感知的分布式视频编码框架matlab代码

基于多重假设的视频压缩感知分层重建

论文研究-TA-chord2：基于分层DHT的拓扑感知流媒体体系.pdf

一种基于分层AP的视频关键帧提取方法研究 (2016年)

基于两层压缩感知的联合频率和DOA估计方法

基于分层曲线简化的运动捕获数据关键帧提取

基于随机阵列的降维压缩感知三维成像方法 (2016年)

认知无线电网络中基于信誉的分层协作频谱感知方案

基于分层贝叶斯Lasso的稀疏ISAR成像算法：压缩感知与优化策略

基于分层B帧的多视点视频编码快速运动与视差估计算法

最新资源