基于块SW-SPIHT的可扩展分布式视频编码

193 浏览量更新于2024-08-27 收藏 248KB PDF 举报

“Scalable Distributed Video Coding based on Block SW-SPIHT”是关于分布式视频编码（DVC）的一篇学术文章，由Anhong Wang、Yao Zhao、Zhenfeng Zhu和Hao Wang共同撰写，发表在2007年的《Chinese Optics Letters》杂志上。该研究关注的是在不采用分层编码的情况下实现可伸缩性DVC方案，旨在满足当前网络通信中不断增长的新需求。分布式源编码和分布式视频编码由于其易于编码的特性而受到越来越多的关注。分布式视频编码利用编码器和解码器之间的信源相关性，降低了编码复杂度，提高了编码效率。然而，随着网络通信需求的发展，比特流的可伸缩性成为实际应用中的新焦点。该文章提出了一种新的可伸缩DVC方案，该方案继承了DVC的易编码性和鲁棒性，并同时整合了可伸缩性的特点。文章的核心是基于块级Slepian-Wolf集合分割的层次树（SW-SPIHT）方法。这是一种对Wyner-Ziv帧进行编码的方式，以生成可伸缩的比特流。Slepian-Wolf编码是一种分布式信源编码技术，它允许在没有解码器辅助信息的情况下，对相关数据源进行高效编码。SPIHT（Set Partitioning In Hierarchical Trees）是用于图像压缩的算法，以其高效和优良的视觉质量著称。通过将这两种技术结合，研究人员能够构建一个既能保持编码效率，又能适应不同带宽或质量需求的可伸缩系统。此外，文章还涉及二进制运动搜索，这是视频编码中的关键步骤，用于寻找最佳的运动矢量估计，以减少运动补偿的误差，提高视频压缩效率。二进制搜索通常比全搜索更快，但可能牺牲一些精度，不过在分布式编码环境下，它可以有效地平衡编码速度和性能。这篇论文介绍的可伸缩DVC方案结合了分布式编码的优势和可伸缩性，对于处理现代网络通信中的动态带宽分配和多质量需求问题具有重要的理论和实践意义。

336 CHINESE OPTICS LETTERS / Vol. 5, No. 6 / June 10, 2007

Scalable distributed video coding based on

block SW-SPIHT

Anhong Wang (



ËËË



)

1,2

, Yao Zha o (



)

, Zhenfeng Zhu (

ýýý¨¨¨



)

, and Hao Wang (





)

Institute of Information Science, Beijing Jiaotong University, Beijing 100044

Taiyuan University of Science and Technology, Taiyuan 030024

Received September 5, 2006

Nowadays, distributed source coding (DSC) and distributed video coding (DVC) have been receiving more

and more attention due to the distinct contributions to the easy encoding. At the same time, with more

new requirements coming forth in the current network communication, the scalability of bit stream has

been a new focus in the real applications. A scalable DVC scheme is presented without requiring layered

coding in which the main attributions of D VC, namely the capabilities of easy encoding and robustness,

are inherited remarkably and the property of scalability is also integrated simultaneously. Based on the

block Slepian-Wolf set partitioning in hierarchical trees (SW-SPIHT), the Wyner-Ziv frames are enco ded

to get the scalable bit stream. In addition, the binary motion searching is explored at the decoder with

the help of a rate-variable ‘hash’ from the encoder to improve the performance of the whole system. The

final experimental results show that our system has higher peak signal-to-noise ratio (PSNR) than the

pixel-domain DVC at the high bit rate. What is more, the scalability in signal-to-noise ratio (SNR) is also

achieved satisfactorily.

OCIS codes: 100.2000, 040.7290, 330.7310, 100.7410.

Presently, the easy encoding is required by the friendly

up-linking multimedia services. Conventional MPEG

and H.26

∗

cannot meet this need because of the complex

motion estimation at the enco der. Based on the Slepian-

Wolf

[1]

and Wyner-Ziv

[2]

theories, which have set solid

foundation for easy encoding, distributed source coding

(DSC) and distributed video coding (DVC) have shown

great potential and achieved almost the same coding

performance by exploiting dependences between sources

at the decoder. Since then, lots of related works have

been put forward. In Ref. [3], a syndrome-based PRISM

scheme was proposed. The similar scheme taken by Anne

and Girod can be referred to Ref. [4]. Based on the works

in Refs. [3,4], some improvements have been exploited as

shown in Refs. [5,6]. But unluckily, the aforementioned

strategies only show DVC’s efficiency in the view of easy

encoding and robustness without considering the scala-

bility of bit stream.

As a matter of fact, the scalability of bit stream has

been considered as a crux in many real applications,

for example, a set o f heterogeneous mobile receivers

may have various computational and display capabilities

and/or channel capacities. However, only some tentative

schemes have been propos ed for scalable DVC, such as

Refs. [7—9]. And these schemes are all built on a lay-

ered video framework, in which one standar d video cod-

ing scheme is treated as the base layer. Particularly, the

non-complete intra-frame encoding with motion estima-

tion at the base layer is still adopted, which will bring

some negative influences inevitably on the property of

easy encoding at the encoder. In addition to this, the

demerit of fragility to the lossy channel at the base layer

is distinctly obvious because of the prediction shift in mo-

tion compensation.

In this paper, we will give more considerations to the

scalable DVC and try to preserve the properties of easy

encoding and robustness. A complete intra-frame en-

coding model based on the block Slepian-Wolf set par-

titioning in hierarchical trees (SW-SPIHT) is proposed

for Wyner-Ziv frames. Similar to SPIHT, the block SW-

SPIHT is provided with the embedded bit stream. And

this embedded bit strea m can possess more flexibly trun-

cated rates than that in the layered coding. Enlightened

by Ref. [10], which has applied SW-SPIHT to distributed

hyp erspectral imagery successfully and shown better per-

formance than intra-frame SPIHT, we extend the idea of

SW-SPIHT to wavelet block and develop a block SW-

SPIHT technique. Additionally, a binary motion search-

ing (BMS) at deco der with rate-adaptive ‘hash’ is pro-

posed for block SW-SPIHT to improve the performance

of the whole system. The rate-adaptive ‘hash’ in our case

is based on some parity bits from a rate compatible chan-

nel coding, which is different from the fixed-rate ‘hash’ in

Ref. [11]. Moreover, the complete intra-frame encoding

takes on property of robustness. What we should note is

that the ‘hash’ here refers to a kind of encoding-related

information representation. Once sent to decoder, those

assistant information contained in ‘hash’ can be expected

reliably to b e great helpful for motion searching at de-

co der.

The propo sed scalable DVC for Wyner-Ziv frame is

showninFig.1,inwhichtheevenframeX

is the

Wyner-Ziv frame and the odd frames X

2i−1

and X

2i+1

act as the key frames. To the key frames, the conven-

tional SPIHT can b e used, while to the Wyner-Ziv frame,

the coding process is based on the following steps.

1) Intraframe encoding.

Step 1. Generating the wavelet blocks (WBs). The

module of ‘Generating WBs’ refers to rearranging the

wavelet coefficients to form cross-scale wavelet block

(WB) as shown in Fig. 2. That is to say, a 3-scale (it

can be extended to multi-scale easily) discrete wavelet

1671-7694/2007/060336-04

 2007 Chinese Optics Letters

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38664532

粉丝: 9
资源: 945

基于块SW-SPIHT的可扩展分布式视频编码

论文研究-Resolution-Scalable Motion Vector Coding Based on Inter-Resolution Prediction.pdf

Scalable-Distributed-Decision-Trees-in-Spark-Made-Das-Sparks-Talwalkar

AVCC sequence header

xapp1317-scalable-matrix-inverse-hls

NoC-based SoC Design

wrox professional node.js building javascript based scalable software (e-boo

<meta name="viewport" content="width=device-width, initial-scale=0, maximum-scale=0, user-scalable=yes,shrink-to-fit=no">解释下这段代码里每个属性的作用

<meta name="viewport" content="initial-scale=1.0, user-scalable=no, width=device-width">

Apache Mesos frameworks

是什么

最新资源