双向非局部模型提升图像去噪效果：探索列行间相似性

194 浏览量更新于2024-08-27 收藏 964KB PDF 举报

图像去噪是一项关键的计算机视觉任务，其目标是从含有噪声的数据中恢复清晰的图像。传统的去噪方法往往基于局部统计，然而，自然图像中的非局部相似性提供了更强大的去噪能力。本文探讨的"图像去噪的双向非局部模型"（Two-Direction Nonlocal Model，简称TDNL）正是利用这一特性，旨在提升去噪效果。在TDNL模型中，作者指出，当一组相似的图像块被组织成矩阵时，不仅列间存在相似性，行间也有。这种双方向的非局部性意味着模型能够同时考虑上下文信息，不仅依赖于像素与其周围邻域的关系，还考虑到整个图像中相似区域的全局信息。这种策略使得模型能够捕捉到更广泛的图像结构和纹理模式，有助于减少噪声的影响。模型的解决方案由三部分组成：首先，是对原始图像的一个缩放版本进行处理，这可能是为了适应不同的尺度和分辨率；其次，通过分析列间的相似性，模型计算每个补丁的非局部均值类估计，这一步使用了一组聚类系数而非简单的成对相似性，从而更准确地估计像素的噪声背景；最后，行间的相似性被用来获取类似补丁中心像素的非局部自回归估计，进一步增强了去噪的效果。相比于传统的单向非局部模型，TDNL模型引入了额外的维度，这要求一种创新的最小化算法来求解优化问题。实验结果表明，TDNL模型在图像去噪性能上达到了相当高的水平，甚至可以与当前最先进的降噪方法相媲美。这种方法的优点在于它能更好地保留图像细节，减少失真，并且在处理彩色图像时也能展现出色的性能，如颜色平面插值、最优恢复和颜色差异的方差等技术在其中起到了辅助作用。参考文献中引用的研究，如Gunturk et al. (2002)关于交替投影的彩色平面插值，Muresan and Parks (2005)的最优恢复方法，Li (2005)的逐步逼近法，以及Chung和Chan (2006)基于颜色差方差的彩色去马赛克，都是研究者在探索图像处理领域内非局部性的不同应用。这些工作为TDNL模型提供了理论基础和技术支撑，证明了非局部性在图像处理任务中的显著价值。图像去噪的双向非局部模型是利用自然图像的非局部相似性进行深度学习和优化的一种创新方法，它在提升去噪效果、保持图像细节和增强整体一致性方面展现出强大潜力。

408 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 22, NO. 1, JANUARY 2013

[17] B. K. Gunturk, Y. Altunbask, and R. M. Mersereau, “Color plane

interpolation using alternating projections,” IEEE Trans. Image Process.,

vol. 11, no. 9, pp. 997–1013, Sep. 2002.

[18] D. D. Muresan and T. W. Parks, “Demosaicing using optimal recovery,”

IEEE Trans. Image Process., vol. 14, no. 2, pp. 267–278, Feb. 2005.

[19] X. Li, “Demosaicing by successive approximations,” IEEE Trans. Image

Process., vol. 14, no. 3, pp. 370–379, Mar. 2005.

[20] K.-H. Chung and Y.-H. Chan, “Color demosaicing using variance of

color differences,” IEEE Trans. Image Process., vol. 15, no. 10, pp.

2944–2955, Oct. 2006.

Two-Direction Nonlocal Model for Image Denoising

Xuande Zhang, Xiangchu Feng, and Weiwei Wang

Abstract—Similarities inherent in natural images have been

widely exploited for image denoising and other applications. In

fact, if a cluster of similar image patches is rearranged into a

matrix, similarities exist both between columns and rows. Using

the similarities, we present a two-directional nonlocal (TDNL)

variational model for image denoising. The solution of our model

consists of three components: one component is a scaled version

of the original observed image and the other two components

are obtained by utilizing the similarities. Speciﬁcally, by using

the similarity between columns, we get a nonlocal-means-like

estimation of the patch with consideration to all similar patches,

while the weights are not the pairwise similarities but a set of

clusterwise coefﬁcients. Moreover, by using the similarity between

rows, we also get nonlocal-autoregression-like estimations for the

center pixels of the similar patches. The TDNL model leads to

an alternative minimization algorithm. Experiments indicate that

the model can perform on par with or better than the state-of-

the-art denoising methods.

Index Terms—Image denoising, similarity, two-direction non-

local model.

I. INTRODUCTION

Denoising is a fundamental and widely studied problem in image

processing. Various denoising methods have been proposed following

different disciplines such as statistics, variational theory, etc. Most of

these methods exploit the local correlation of image pixels. Recently,

the introduction of NLM opens the ﬂoodgate to the exploitation of

nonlocal similarities inherent in natural images for denoising and

other applications [1], [2]. The NLM estimates each pixel by the

weighted average of many pixels in the image, and the weights

are respectively evaluated according to pair-wise similarity between

two patches. The advantage of NLM is that it greatly reduces the

interference of noise and well preserves the details such as edges

and textures in the denoised image.

Manuscript received September 9, 2011; revised June 20, 2012; accepted

August 1, 2012. Date of publication September 19, 2012; date of current

version December 20, 2012. This work was supported by the National

Science Foundation of China under Grant 61001156, Grant 61105011, Grant

11101292, and Grant 60872138. The associate editor coordinating the review

of this manuscript and approving it for publication was Prof. Sina Farsiu.

X. Zhang is with the Department of Applied Mathematics, School of

Science, Xidian University, Xi’an 710071, China, and also with the School of

Mathematics and Computer Science, Ningxia University, Yinchuan 750021,

China (e-mail: love_truth@126.com).

X. Feng and W. Wang are with the Department of Applied Mathemat-

ics, School of Science, Xidian University, Xi’an 710071, China (e-mail:

xcfeng@mail.xidian.edu.cn; wwwang@mail.xidian.edu.cn).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TIP.2012.2214043

Another line of work developed in recent years is built on sparse

representation. As early as in wavelet era, it is recognized that natural

image has a sparse representation in wavelet basis and its directional

extensions, including curvelet, contourlet and bandelet [3], while

noise does not. This discriminative property is exploited in various

wavelet shrinkage denoising methods. To alleviate the problems

caused by using the ﬁxed transformation, several authors propose

to use learned dictionary, which is data adaptive and can characterize

image structures more efﬁciently. Elad et al. [4], [5] introduced the

K-SVD algorithm to learn the overcomplete dictionary for image

representation and denoising. Zhang et al. [6] presented an adaptive

shrinkage algorithm based on locally learned principle component

analysis (PCA) basis for image denoising. All these patch-based,

adaptive learning methods show promising denoising performance.

All the methods mentioned above use certain prior information

of the given data. For example, NLM exploits similarity inherent

in images, wavelet shrinkage and K-SVD utilize sparsity of images

in certain domain. By combining the nonlocal similarity and the

sparsity, Dabov et al. [7] presented a block matching based, two-stage

3D ﬁltering algorithm (BM3D) for image denoisng, which achieves

so far the best denoising performance. In this algorithm, similar

patches are stacked into a 3D array, and the array is transformed

into wavelet domain by a 3D separable wavelet transform, then a

ﬁltering is performed in the wavelet domain. The employment of Haar

wavelet on the third dimension contributes much to the denoising

performance of BM3D. Note that the Haar wavelet has only a zero-

order vanishing moment, which implies BM3D implicitly assumes

that similar patches have similar sparse representation. Explicitly

using this assumption, Mairal et al. [8] proposed a mixed-sparse

model with learned redundant dictionaries for image denoising and

color demosaicking. Dong et al. [9] proposed a centralized sparse

model with appropriately learned and chosen principle component

analysis (PCA) basis for image denoising. The latter two methods

achieve comparable denoising performances with BM3D.

In [10] and [11], the authors studied the performance bounds on

image denoising from an estimation theory perspective and provided

the fundamental limits of the problem. In [12], the same authors

proposed the patch-based locally optimal wiener ﬁlter (PLOW)

for image denoising with motivation to achieve the near optimal

performance. This method uses both geometrically and photomet-

rically similar patches to estimate the ﬁlter parameters and achieves

equal or slightly better performance than BM3D.

In this brief, we provide a different point of view. Similar to the

work in [7]–[9], for each patch centered at the pixel in consideration,

a cluster of similar patches is collected and rearranged into a matrix.

Then similarity between both columns and rows of the matrix will

be exploited in different ways. Speciﬁcally, the similarity between

columns is actually that between patches. We will design an NLM-

like ﬁltering to get a denoised patch from all the columns. Different

from NLM, the weights will be jointly obtained by solving a

minimization problem involving all the similar patches. In this sense,

we say the weights are cluster-wise while the weights of NLM are

pair-wise similarities. Motivated by the assumption in [13], here we

assume that, the central pixel in each patch can be linearly represented

by neighboring pixels through autoregression, and furthermore, the

central pixels of all similar patches, corresponding to a row in the

matrix, have the same AR coefﬁcients. The similarity between rows

needs to be understood in this sense. Combining the two-directional

similarity, we present a two-directional nonlocal (TDNL) variatiotnal

model for image denoising. The model can be viewed as a two-

direction regression method, an enhancement to NLM, a special

case of dictionary learning methods, or an improvement of PCA

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38744207

粉丝: 344
资源: 2万+

双向非局部模型提升图像去噪效果：探索列行间相似性

TV图像去噪模型_TV图像去噪_图像去噪_去噪图像_TV模型_图像去噪模型TV

图像去噪图像去噪的非局部均值 NLM滤波器研究Matlab代码.rar

图像去噪 TV模型程序

双向非局部模型提升图像插值性能：利用自相似性创新算法

非局部图像去噪：深度解析与‘求同存异’策略

双向增强扩散滤波在图像去噪中的应用

Matlab图像去噪教程：形态学权重自适应法

Matlab图像去噪实战教程：快速跨尺度小波降噪方法

基于PDE的图像处理程序与BSCB模型实现

双目视觉的3阶段策略：基于双向双极线的匹配与去噪

最新资源