SLIC超像素与JND高斯滤波：HEVC视频预处理新策略

65 浏览量更新于2024-08-29 收藏 316KB PDF 举报

"基于JND的超像素高斯滤波的视频预处理" 本文提出了一种针对高效视频编码（HEVC）的预处理技术，利用了人类视觉系统的感知特性（Just Noticeable Difference, JND）并结合超像素分割与高斯滤波。此方法旨在在不降低视觉质量的前提下，最大程度地减少视频的比特率，从而实现更高效的视频压缩。首先，文章介绍了一种名为简单线性迭代聚类（Simple Linear Iterative Clustering, SLIC）的超像素分割算法。SLIC是一种改进的k均值聚类方法，能够快速地将图像像素分组成具有相似颜色和纹理特征的超像素区域。这种方法使得处理过程中能以更粗粒度的超像素进行操作，而非原始像素，从而降低了计算复杂度，同时保持了图像的结构信息。接着，文章探讨了如何根据JND理论来确定适合的高斯滤波器参数。JND是视觉感知理论中的一个重要概念，它描述了人眼能够察觉到的最小变化。在视频预处理中，通过计算超像素内各像素点的亮度差异的加权平均值，可以估计出对人眼来说几乎不可察觉的变化。以此为基础，可以调整高斯滤波器的参数，使得滤波后的视频在视觉上保持原有的质量，同时尽可能地减少冗余信息，进而降低比特率。实验结果显示，采用这种基于JND的超像素高斯滤波方法进行预处理，可以在不损害视觉质量的前提下，使视频比特率最高降低29%。这表明该方法在HEVC编码标准下，能显著提升视频压缩效率，对实际的视频编码应用有着重要的价值。关键词：HEVC，JND，超像素，视频预处理，高斯滤波该研究结合了超像素分割、JND理论以及高斯滤波技术，为HEVC视频编码提供了一种有效的预处理手段，通过优化参数，能够在保留视觉质量的同时，大幅降低视频数据的传输和存储需求。这对于高清晰度、高带宽要求的视频应用，如在线视频流、高清电视和远程监控等，都具有显著的实用意义。

Video pre-processing with JND-based Gaussian filtering of

superpixels

Lei Ding, Ge Li*, Ronggang Wang, Wenmin Wang

School of Electronic and Computer Engineering, Shenzhen Graduate School, Peking University

ABSTRACT

In this paper, an innovative method of HEVC video pre-processing is proposed. The method applies a simple linear

iterative clustering (SLIC), which adapts a k-means clustering to group pixels into perceptually meaningful atomic

regions of superpixels. By calculating the average of weighted average of luminance differences around each pixel in the

superpixel, a suitable parameter of Gaussian filter for the superpixel is determined. Experimental results show that bit

rate can be reduced up to 29% without loss in visual quality.

Keywords: HEVC, JND, superpixel, video pre-processing, Gaussian filtering

1. INTRODUCTION

The recently developed HEVC [1], high efficiency video coding standard, is becoming more and more popular. Its

improved compression performance relative to the existing standard is in the range of 50% bit rate reduction.The human

eyes perceive images through the human visual system (HVS), which provides a possibility to get a higher video

compression ratio. Extensive research has been conducted to improve the performance of encoder conformable with

standards.

Video pre-processing [2] can improve the subjective quality of a reconstructed video or reduce the bit rate in the

generation of a compressed bit stream. The usual video pre-processing adopted video data spatial filtering, temporal

filtering and image sharpening [3]. By applying a visual perception threshold (PTHD) with just noticeable distortion

(JND), one can achieve a compression gain up to 10% to 15% by exploiting video data perceptual redundancy [4-6].The

Gaussian filter has been widely used for de-noising in image processing. By applying Gaussian filtering, the new value

of pixel (x, y) is the weighted average of the pixels around itself. Gaussian filtering makes the image smoother, which

means the deviation of the pixels in a coding block will become smaller, and thus will reduce the bit rate in the process

of motion estimation, transformation, scaling and quantization. Smooth areas can withstand strong filtering without

being noticed, while edge areas or textured areas will be blurred, thus these areas should be filtered slightly or not at all.

The superpixel is an area with similar texture, contour, colour, etc. Superpixel algorithms group pixels into perceptually

meaningful atomic regions. They capture image redundancy and provide a convenient primitive from which to compute

image features. Algorithms for generating superpixels can be broadly categorized as either graph-based or gradient

ascent methods. Graph-based approaches to superpixel generation treat each pixel as a node in a graph. Edge weights

between two nodes are proportional to the similarity between neighbouring pixels. The superpixels are created by

minimizing a cost function defined over the graph. Whereas the gradient-ascent-based methods start from a rough initial

clustering of pixels, iteratively refine the cluster until some convergence criterion is met to form superpixels. SLIC [7], a

state-of-the-art method for generating superpixels based on K-means clustering, has been shown to outperform existing

superpixel methods.

Human eyes cannot perceive any changes below the JND threshold of around a pixel due to their underlying

spatial/temporal masking properties [8]. The sensitivity of distortion by human eyes can vary significantly in different

areas of a frame, upon which the JND model is set. The major factors that contribute to the JND model are spatial

contrast sensitivity function, luminance adaption [9-10] etc. These factors reflect the texture, edge and boundary of the

frame, and the frame can be filtered according to these factors.

Based on the related works above, we present a new approach of incorporating SLIC and JND for video pre-processing

in order to reduce the bit rate without loss of visual quality. Subjective and objective evaluation is carried out to verify

the effectiveness of the proposed approach.

Visual Information Processing and Communication VI, edited by Amir Said, Onur G. Guleryuz,

Robert L. Stevenson, Proc. of SPIE-IS&T Electronic Imaging, SPIE Vol. 9410, 941004

Proc. of SPIE-IS&T Vol. 9410 941004-1

Downloaded From: http://proceedings.spiedigitallibrary.org/ on 04/24/2015 Terms of Use: http://spiedl.org/terms

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38531017

粉丝: 8

SLIC超像素与JND高斯滤波：HEVC视频预处理新策略

jnd777.github.io

JND-Pano：用于JPEG压缩全景图像的明显差异的数据库

利用JND进行图像增强Python

利用JND进行图像增强Python实现：读图像，计算JND，计算灰度映射函数，得到灰度处理的图像，显示图像A，B

如何使用matlab计算JND

matlab计算JND

JND损失

怎么理解和实现jnd中的回调函数，注册回调，回调函数执行的流程请举例说明

在设计无刷直流电机的电子调速器时，如何正确选择MOSFET，并确保其在驱动电路中的可靠工作？

如何理解子网掩码在计算机网络中的作用，并通过一个实例来展示其如何影响IP地址的网络部分和主机部分划分？

最新资源