图像认证的感知哈希技术：一项综述

研究论文

135 浏览量更新于2024-07-15 收藏 1.8MB PDF 举报

身份认证购VIP最低享 7 折!

领优惠券(最高得80元）

"Perceptual hashing for image authentication: A survey" 这篇研究论文主要关注的是感知哈希在图像认证中的应用，并对这一领域进行了全面的调查。感知哈希是一种用于多媒体内容识别和认证的技术，它基于对多媒体内容的理解生成感知摘要。这种方法的独特之处在于，它能够考虑到人类视觉系统的特性，即使图像经过微小的篡改，也能检测到差异。文章关键词包括篡改检测、感知图像哈希、内容真实性分析和安全。这些关键词揭示了研究的核心主题： 1. **篡改检测（Tamper detection）**：在数字图像可能被恶意修改的情况下，篡改检测是确保数据完整性和安全性的关键步骤。感知哈希技术通过比较原始图像和疑似篡改后的图像的哈希值，可以识别出图像是否被篡改以及篡改的位置。 2. **感知图像哈希（Perceptual image hashing）**：这是一种创建图像指纹的方法，它不依赖于像素级别的精确匹配，而是考虑了人类视觉系统如何处理和解析图像。感知哈希算法通常会忽略图像中的噪声和微小变化，而关注于决定性的特征，如颜色分布和形状。 3. **内容真实性分析（Content authenticity analysis）**：这是评估图像未经篡改或编辑的重要过程。通过感知哈希，研究人员可以分析图像的原始状态和当前状态，判断其真实性，这对于新闻媒体、法律证据和网络安全等领域至关重要。 4. **安全**：在数字世界中，信息安全是至关重要的。感知哈希提供了一种安全手段，可以保护图像免受未经授权的修改，确保信息的完整性和可信度。论文可能涵盖了各种感知哈希算法的比较，如A-Hash、P-Hash、DCT-Hash等，并讨论了它们的优缺点，以及在不同场景下的适用性。此外，作者可能还探讨了该领域的挑战，如抗有损压缩、图像缩放和色彩转换的影响，以及如何提高鲁棒性和抗攻击性。这篇综述提供了关于感知哈希在图像认证领域的最新进展和未来研究方向的全面概述，对于从事图像处理、安全和多媒体分析的研究人员来说，是一份宝贵的参考资料。

资源详情

资源推荐

L. Du, A.T.S. Ho and R. Cong Signal Processing: Image Communication 81 (2020) 115713

Fig. 4. Examples of tampering localization results for QFT based method proposed

in [34]: (a) source image; (b) normalized reconstructed image corresponding; (c)

tampered image with attack; (d) normalized reconstructed images corresponding to

(c); (e) binary map; (f) detected tampered region.

defined in polar coordinates, in which 𝑓

𝑟

, 𝑓

𝑔

and 𝑓

𝑏

respectively repre-

sents the red, green and blue channel of image 𝑓 (𝑥, 𝑦). They first defined

a quaternion image that combined multiple features and reconstructed

a stable image by quaternion low-pass filter. After obtaining the stable

image, they divided the image into nonoverlapping blocks and used

a local binary coding to represent each block. For tamper detection,

a multiscale difference map fusion approach was investigated to fuse

difference maps, resulting from the analysis of the subtraction between

two binary maps with different sliding windows.

Remarks. These approaches mainly depend on the properties of the

applied transforms. The input image undergoes frequency transforma-

tion to make extracted features depend on the values of the image

frequency coefficients. The principal aim of the transforms is to make

all extracted features depend upon the values of its frequency coeffi-

cients in the transform space. Currently, most features are only robust

against for one or several types of attacks. It may not be feasible to

extract one absolute robust feature which can satisfy different scenar-

ios. It is worth mentioning that the image feature hash construction

via QFT is an effective way to fulfill the requirement of processing

different features in a holistic manner. A brief summary of invariant

feature transform based methods is presented in Table 3.

3.2. Local feature points based methods

Local feature patterns are a group of important robust features for

generating image hashes. Local feature patterns usually include edges,

corners, blobs, salient regions and so on. Since image hashes should be

invariant to content-preserving processing, robust repeatable features

with small computations are desired.

Yang et al. [38] proposed content based image hashing using com-

panding and gray code. Morlet wavelet coefficients were used at feature

points to generate robust image features. Then, they combined robust

feature point detector and robust content singularity descriptor at these

feature points. Finally, the Morlet wavelet coefficients are quantized

and coded by using companding and Gray code. Morlet wavelet is a

continuous wavelet with single-frequency sinusoidal Gaussian function.

Morlet wavelet is used to detect linear structures perpendicular to the

orientation of the wavelet. 2D Morlet wavelet is defined as

𝜑

𝑀

(𝑅) = (𝑒

𝑖𝑉

𝑅

− 𝑒

−1∕2



𝑉



)𝑒

−1∕2



𝑅



(14)

where 𝑅 = (𝑟

, 𝑟

) is the 2-dimensional spatial coordinates, and 𝑉

(𝑣

, 𝑣

) is the wave-vector of the mother wavelet. Liu et al. [39] pro-

posed a SIFT operator based hash algorithm, which was mainly focused

on the robustness against geometric attacks. For decision making, a

generalized set distance based matching operation was designed. Then,

Lv et al. [40] proposed a novel shape contexts based image hashing

approach using robust local feature points. They first used scale in-

variant feature transform (SIFT) to detect robust feature points and

incorporated the Harris criterion to select the most stable points. To

characterize local information, they introduced the shape contexts into

hash generation to represent the geometric distribution of the detected

feature points. They used a descriptor to represent these feature points

as an unique signature. Local extremum search is performed on a series

of difference-of-Gaussian (DOG) images in the scale space 𝜎, and local

feature points are obtained as candidate points for scale-invariant key

points.

The construction of DOG is shown as follows: Image 𝐼(𝑥, 𝑦) is first

convolved with a series of Gaussian kernel functions 𝐺(𝑠, 𝑦, 𝜎), the scale

of 𝜎 = (𝜎

, 𝜎

, … , 𝜎

𝑛

) is continuously increasing, where 𝜎

< 𝜎

< ⋯ 𝜎

𝑛

𝐿(𝑥, 𝑦, 𝜎) = 𝐺(𝑥, 𝑦, 𝜎) ∗ 𝐼(𝑥, 𝑦) (15)

Then, a DOG are generated by two Gaussian blurred images with

nearby scales 𝑐𝜎 and 𝜎 as

𝐷(𝑥, 𝑦, 𝜎) = 𝐿(𝑥, 𝑦, 𝑐𝜎) − 𝐿(𝑥, 𝑦, 𝜎)

= (𝐺(𝑥, 𝑦, 𝑐𝜎) − 𝐺(𝑥, 𝑦, 𝜎)) ∗ 𝐼(𝑥, 𝑦)

(16)

It provides a close approximation of the scale-normalized Laplacian of

Gaussian

𝐺(𝑥, 𝑦, 𝑐𝜎) − 𝐺(𝑥, 𝑦, 𝜎) ≈ (𝑐 − 1)𝜎

▿

𝐺 (17)

Substituting (16) into (15) and using the property of convolution, we

could obtain that

𝐷(𝑥, 𝑦, 𝜎) ≈ (𝑐 − 1)𝜎

▿

∗ 𝐼(𝑥, 𝑦)

= (𝑐 − 1)𝜎

𝐺 ∗ ▿

𝐼(𝑥, 𝑦)

(18)

where ▿

𝐼(𝑥, 𝑦), i.e., ▿

= 𝜕

∕𝜕𝑥

+ 𝜕

∕𝜕𝑦

, is the Laplacian operator.

Wang et al. [41] proposed an image forensic signature for content

authenticity analysis. In the proposed method, adaptive Harris corner

detection algorithm was used to extract image feature points. Corner

detection is a method used in computer vision systems to extract

specific types of features and infer image content. In various corner

detection methods, a typical algorithm is Harris operator. Denotes an

image 𝐼, 𝐼

𝑥

, 𝐼

𝑦

represent the gradients of image gray value on horizon-

tal and vertical direction, respectively. Discrete two-dimensional zero

means Gaussian kernel function is:

𝐺(𝜎) =

2𝜋𝜎

exp



−

(𝑥

+ 𝑦

)

2𝜎



(19)

Let 𝐴 = 𝐺(𝜎) ⊗ 𝐼

𝑥

, 𝐵 = 𝐺(𝜎) ⊗ 𝐼

𝑦

, 𝐶 = 𝐷 = 𝐺(𝜎) ⊗ 𝐼

𝑥

𝐼

𝑦

, Harris corner

detection function is:

𝑅 =



𝐴 × 𝐵 − (𝐶 × 𝐷)



− 𝑐 ⋅ (𝐴 + 𝐵)

(20)

Here, 𝜎 is a scale parameter, ⊗ is convolution operator, 𝑐 is a constant.

For each feature point, they defined a circular feature point neigh-

borhood, and computed the mean and variance of the vector con-

structed by pixel gray value in the neighborhood. These statistics of

feature point neighborhood, together with the position coordinates

of feature point, were used to construct forensic signature by using

Huffman coding. By using the Fisher criterion, it provided an adaptive

method to generate the signature matching threshold value. However,

just as other feature point based image hash methods, the hash size was

depended on the image size and texture.

Image hashing using feature points has limitations when considering

the distortions of additive noise and blurring in large scale. This is

due to the detected key points are not exactly the same as the original

image. In order to address these limitations, Yan et al. [42,43] proposed

a multi-scale image hashing method by using the location-context in-

formation of the features generated by adaptive local feature extraction

techniques. Firstly, they produced multiple content-preserving attacked

images, and then extracted SIFT features. The SIFT feature points

were matched with the corresponding feature points extracted from

the host image by a matching algorithm. Thus, the adaptive feature

points together with their corresponding descriptor were generated,

which were more robust for hashing generation. Finally, the Round

剩余22页未读，继续阅读

weixin_38614112

粉丝: 3
资源: 930

图像认证的感知哈希技术：一项综述

A Secure Perceptual Hash Algorithm for Image Content Authentication

Multi-attack Reference Hashing Generation for Image Authentication

DFT的matlab源代码-Perceptual-Image-Hashing:用于Rust的感知图像哈希库

if train_opt.get('perceptual_opt'): percep_type = train_opt['perceptual_opt'].pop('type') cri_perceptual_cls = getattr(loss_module, percep_type) self.cri_perceptual = cri_perceptual_cls( **train_opt['perceptual_opt']).to(self.device) else: self.cri_perceptual = None代码中文含义

不同大小内容一致的两张图片如何比较相似性

opencv 利用哈希值算法大批量删除数据集中重复的图片

matlab怎么比较两幅彩色图片近似

图像质量评价指标 MUSIQ

python图像结构相似性

Zhang, R., Isola, P., Efros, A.A., Shechtman, E., Wang, O.: The unreasonable effectiveness of deep features as a perceptual metric. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 586–595 (2018)

matlab图片相似度计算

怎么比较两个图像的相似程度

如何计算2张图像的相似度

黑白图像的彩色化效果评价模型

python图片去重

python 五种图片相似度比较方法

深度学习的图像质量评价

perceptual loss 没有梯度

最新资源