高效H.264/AVC转码算法：实时高清视频优化

18 浏览量更新于2024-08-26 收藏 628KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"本文提出了一种高效的H.264/AVC实时高清视频转码中的帧间模式决策算法，旨在降低比特率。该算法通过模式转换、模式预检测、搜索中心选择和SAD(Sum of Absolute Differences，差值绝对和)重用来提升编码效率和模式决策的准确性。实验结果显示，与UMHexagonS算法相比，该算法在运动估计上提高了0.45dB的Y-PSNR（峰值信噪比），同时减少了73%的搜索点，从而在保持画质的同时显著降低了计算复杂度。" 在H.264/AVC视频编码标准中，运动估计是至关重要的一步，它涉及到帧间的像素匹配，以确定最佳预测模式，进而减少冗余信息，达到更高的压缩效率。实时高清视频转码面临的主要挑战是如何在有限的计算资源下，实现高效的比特率降低，以适应网络传输的需求。针对这一问题，本文提出的算法做出了以下四个方面的改进： 1. **模式转换**：通过对已有编码模式的智能转换，优化编码过程，减少不必要的计算。 2. **模式预检测**：预先评估可能的预测模式，避免了对所有可能模式的全面搜索，减少了计算量。 3. **搜索中心选择**：采用更有效的搜索策略来确定运动矢量，这有助于快速找到最佳预测位置，进一步节省计算资源。 4. **SAD重用**：利用前一帧的SAD信息，减少当前帧的SAD计算，降低了计算复杂度，提高了转码速度。实验结果证明了这些策略的有效性，尤其是在运动估计方面，0.45dB的Y-PSNR提升意味着图像质量得到了改善，而搜索点的大幅度减少则显著降低了计算需求，这对于实时转码应用尤其重要。此外，由于H.264/AVC编码器在处理高清视频时的高计算负荷，这种优化对于移动和互联网应用来说是极其关键的。该算法为H.264/AVC实时高清视频转码提供了一种高效解决方案，它在保持或提升视频质量的同时，大幅降低了所需的比特率，对于解决网络带宽限制问题具有重要意义。未来的研究可以在此基础上进一步探索如何在更低的计算复杂度下，实现更高质量的视频转码，以满足不断增长的高清视频传输需求。

资源详情

资源推荐

Session-1-mm13-133

Abstract—In this paper, we propose an efficient inter frame

mode decision algorithm for H.264/AVC transcoding mainly

applied to HD video bit-rate reduction. In the proposed algorithm,

four measures including mode conversion, mode pre-detection,

search centre selection, and SAD reuse are presented for

improving the coding efficiency and mode decision accuracy. The

experimental results show that the proposed algorithm has about

0.45dB Y-PSNR enhancement and 73% less search points in

motion estimation compared with the UMHexagonS algorithm.

Index Terms—H.264/AVC, HD video transcoding, mode

decision, bit-rate reduction, motion estimation

I. I

NTRODUCTION

ITH the development of multimedia techniques,

high-definition (HD) videos have been widely used in

broadcasting and network areas. However, network bandwidth

resource is still limited especially for internet and mobile

applications, which requires higher compression and low

bit-rate for HD videos. Thus, efficient HD video transcoding

technologies for bit-rate reduction on H.264/AVC [1] have

been paid much attention.

The most straightforward way for bit-rate reduction is to

decode the video bit-stream and re-encode the reconstructed

video sequence at a new bit-rate. However, this process which

is named as complex cascaded pixel domain transcoding

(CCPDT) is quite complicated and time consuming. In the

earlier works, researchers have proposed four methods for

bit-rate reduction [2]. The first method is cutting the AC

coefficients, but discarding high-frequency coefficients would

lead to losing image details and producing blocking artifacts.

Manuscript received January 6, 2013. This work was supported Chinese

National Key Science and Technology Special Program High-quality TV Image

Display Processing Chip R&D and Small Batch Applications

(No.2013ZX01033001-002-002), Techniques for Complicated Scene

Modeling and Super-high Resolution Rendering NSFC key project

(No.61133009), Shanghai Key Laboratory of Digital Media Processing and

Transmission (STCSM 12DZ2272600), National Natural Science Foundation

of China (61221001), the 111 Project (B07022) and the Shanghai Key

Laboratory of Digital Media Processing and Transmissions.

Sai Yin, Xiaoyun Zhang, Zhiyong Gao, and Yingqi Chen are with the

Institute of Image Communication and Network Engineering, Shanghai Key

Laboratory of Digital Media Processing and Transmission, Shanghai Jiao Tong

University, Shanghai 200240, China. (E-mail: saiyin@sjtu.edu.cn,

xiaoyun.zhang@sjtu.edu.cn, zhiyong.gao@sjtu.edu.cn, clenny@163.com).

The second method is to increase quantization steps, this

method is easy to adjust the video bit streams to an appropriate

bit-rate but cause a picture drift error accumulation [3]. The

third bit-stream scaling method is executed by re-encoding the

reconstructed pictures with motion vectors and coding decision

modes extracted from the original high-quality bit-stream, this

algorithm is referred to as simple cascade pixel domain

transcoding (SCPDT) algorithm. The last one is re-encoding

the reconstructed pictures with motion vectors extracted from

the original high quality bit-stream, but new coding decisions

are needed to be computed based on reconstructed pictures.

The third and the forth methods effectively accelerate the

transcoding process, but they have a severe degradation in the

coding performance due to the motion vectors and modes

mismatched problems. To reduce the computational

complexity while preserving the video quality simultaneously,

in this paper, we propose an efficient inter frame mode decision

algorithm for H.264/AVC transcoding, which is realized by

selectively reusing the original information which is obtained

from the decoding process and adding several fast motion

estimation schemes in the re-encoding process. The

experimental results indicate that the proposed algorithm can

maintain the coding performance with much lower

computational cost.

H.264/AVC supports seven macroblock partitions including

16x16, 16x8, 8x16, 8x8, 8x4, 4x8, and 4x4. Despite of I slice

and B slice, there are five macroblock types and four

sub-macroblock types including the skip mode and other types

in accordance with the partition size mentioned above. These

macroblock partitions are divided into two levels. The first

level L1 includes modes of 16x16, 16x8, 8x16, 8x8, and the

skip mode; while the second level L2 includes modes of 8x8,

8x4, 4x8, and 4x4.

During the process of transcoding for bit-rate reduction,

quantization parameter (QP) influences the coding

characteristic significantly. A non-homogeneous region with a

small QP can be re-encoded to be a homogeneous one by

increasing the quantization step size [4]. Usually, in the process

of low bit-rate encoding, the adoption of modes of level L2 just

increases a negligible compression rate while consumes lots of

computation time, which is particularly prominent in HD

sequences. Considering the coding efficiency, our re-encoding

process in HD videos does not contain the macroblock modes

An Efficient Mode Decision Algorithm for

Real-Time High-Deﬁnition H.264/AVC

Transcoding

Sai Yin, Xiaoyun Zhang, Zhiyong Gao, and Yingqi Chen

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38499553

粉丝: 11
资源: 904

高效H.264/AVC转码算法：实时高清视频优化

H.264/avc经典教程

新一代视频压缩编码标准-H.264_AVC(第二版).pdf

H.264/AVC、H.265/HEVC、VP8、VP9、AV1的对比

h.265/hevc:视频编码新标准及其扩展 pdf

H.264/265的码流格式

使用 H.264/AVC 压缩格式对视频流进行压缩，请详细写一下流程以及具体代码实现

NAL单元\和NALU单元

h264 NALU 全程是什么

什么是avc layer

常用的视频传输标准都有哪些

h264的profile是怎么组成的

怎么学习H264编码标准

视频视频格式解码最快

h265的VPS\PPS\SPS

H.264图像压缩算法是啥

H.264、H.265、H.266

列举视频流压缩格式，并分析各个格式的优劣

H.265 H.264基础知识

H.264、H.265、smartH.265

最新资源