深度去噪先验在图像恢复中的应用

需积分: 0 110 浏览量更新于2024-07-01 收藏 9.54MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"题目4论文41 - 插拔式图像恢复与深度去噪先验" 这篇论文"题目4论文41"主要探讨了在图像恢复领域中的插拔式（Plug-and-Play）方法，该方法利用深度去噪器作为图像先验来解决基于模型的方法中的逆问题。近年来，这种方法因其能够结合模型基方法的灵活性和学习基方法的有效性而受到关注。特别是，当去噪器通过深度卷积神经网络（CNN）进行判别性学习时，其强大的建模能力可以显著提升图像恢复效果。然而，尽管更深层次和更大的CNN模型在图像处理领域越来越受欢迎，现有的插拔式图像恢复方法却因缺乏合适的去噪器先验而限制了性能。论文指出，随着CNN模型复杂度的增加，如何有效地利用这些模型成为了一个挑战。为了突破插拔式图像恢复的局限，研究者建立了一个基准深度去噪先验，训练了一个高度灵活且高效的CNN去噪器。这个深度去噪器被设计成一个模块化部分，可以嵌入到基于半二次分割（Half Quadratic Splitting）的迭代算法中，用于解决各种图像恢复问题。半二次分割算法是一种常用于优化非凸、非线性问题的方法，通过分解复杂问题为更易于管理的子问题来逐步逼近解决方案。论文中可能涉及的知识点包括： 1. 插拔式图像恢复：这是一种允许在不同恢复任务中替换或“插拔”去噪模块的方法，它使得预训练的去噪模型可以应用于其他逆问题，如图像去噪、超分辨率、去雾等。 2. 深度去噪器：利用深度学习，特别是CNN，训练出的高效去噪模型，能够学习到复杂的图像特征并进行有效的噪声去除。 3. 判别性学习：在训练深度模型时，通过监督学习的方式让模型学习区分不同类别的样本，以增强模型的区分能力和泛化能力。 4. 半二次分割算法：一种优化策略，将非凸、非线性的优化问题转化为一系列简单的半凸优化子问题，通过迭代求解。 5. CNN模型的复杂度：更深层次和更大的模型通常能捕获更复杂的图像模式，但也可能导致计算量增加和训练难度提高。 6. 图像恢复的逆问题：图像恢复通常涉及到从受损或失真的图像中恢复原始图像的过程，如去噪、去模糊、超分辨率等，这些问题通常被视为逆问题，因为它们需要从观测数据反推原始信号。 7. 模块化设计：深度去噪器被设计为可复用的模块，可以方便地与其他恢复算法集成，增强了算法的通用性和适应性。这篇论文不仅提出了一个创新的深度去噪器设计，还展示了如何将其应用于插拔式图像恢复框架，以改善当前方法的性能，推动了图像恢复领域的技术边界。

资源详情

资源推荐

TABLE 1

Average PSNR(dB) results of different methods with noise levels 15, 25 and 50 on the widely-used Set12 and BSD68 [3], [44], [49] datasets. The

best and second best results are highlighted in red and blue colors, respectively.

Datasets

Noise

BM3D WNNM DnCNN

Net

NLRN RNAN FOCNet IRCNN FFDNet DRUNet

Level

15 32.37 32.70 32.86 – 33.16 – 33.07 32.77 32.75 33.25

Set12 25 29.97 30.28 30.44 30.55 30.80 – 30.73 30.38 30.43 30.94

50 26.72 27.05 27.18 27.43 27.64 27.70 27.68 27.14 27.32 27.90

15 31.08 31.37 31.73 – 31.88 – 31.83 31.63 31.63 31.91

BSD68 25 28.57 28.83 29.23 29.30 29.41 – 29.38 29.15 29.19 29.48

50 25.60 25.87 26.23 26.39 26.47 26.48 26.50 26.19 26.29 26.59

design for better restoration. However, these methods learn

a separate model for each noise level. Perhaps the most

suitable denoiser for plug-and-play IR is FFDNet [18] which

can handle a wide range of noise levels by taking the

noise level map as input. Nevertheless, FFDNet only has

a comparable performance to DnCNN and IRCNN, thus

lacking effectiveness to boost the performance of plug-and-

play IR. For this reason, we propose to improve FFDNet

by taking advantage of the widely-used U-Net [20] and

ResNet [19] for architecture design.

3.1 Denoising Network Architecture

It is well-known that U-Net [20] is effective and efﬁcient for

image-to-image translation, while ResNet [19] is superior

in increasing the modeling capacity by stacking multiple

residual blocks. Following FFDNet [18] that takes the noise

level map as input, the proposed denoiser, namely DRUNet,

further integrates residual blocks into U-Net for effective

denoiser prior modeling. Note that this work focuses on

providing a ﬂexible and powerful pre-trained denoiser to

beneﬁt existing plug-and-play IR methods rather than de-

signing new denoising network architecture. Actually, the

similar idea of combining U-Net and ResNet can also be

found in other works such as [61], [62].

The architecture of DRUNet is illustrated in Fig. 1. Like

FFDNet, DRUNet has the ability to handle various noise

levels via a single model. The backbone of DRUNet is

U-Net which consists of four scales. Each scale has an

identity skip connection between 2 × 2 strided convolution

(SConv) downscaling and 2 × 2 transposed convolution

(TConv) upscaling operations. The number of channels in

each layer from the ﬁrst scale to the fourth scale are 64,

128, 256 and 512, respectively. Four successive residual

blocks are adopted in the downscaling and upscaling of

each scale. Inspired by the network architecture design for

super-resolution in [63], no activation function is followed

by the ﬁrst and the last convolutional (Conv) layers, as well

as SConv and TConv layers. In addition, each residual block

only contains one ReLU activation function.

It is worth noting that the proposed DRUNet is bias-

free, which means no bias is used in all the Conv, SConv

and TConv layers. The reason is two-fold. First, bias-free

network with ReLU activation and identity skip connection

naturally enforces scaling invariance property of many im-

age restoration tasks, i.e., f (ax) = af(x) holds true for any

scalar a ≥ 0 (please refer to [64] for more details). Second,

we have empirically observed that, for the network with

bias, the magnitude of bias would be much larger than that

of ﬁlters, which in turn may harm the generalizability.

3.2 Training Details

It is well known that CNN beneﬁts from the availability of

large-scale training data. To enrich the denoiser prior for

plug-and-play IR, instead of training on a small dataset that

includes 400 Berkeley segmentation dataset (BSD) images

of size 180×180 [9], we construct a large dataset consisting

of 400 BSD images, 4,744 images of Waterloo Exploration

Database [65], 900 images from DIV2K dataset [66], and

2,750 images from Flick2K dataset [63]. Because such a

dataset covers a larger image space, the learned model can

slightly improve the PSNR results on BSD68 dataset [3]

while having an obvious PSNR gain on testing datasets from

a different domain.

As a common setting for Gaussian denoising, the noisy

counterpart y of clean image x is obtained by adding

AWGN with noise level σ. Correspondingly, the noise level

map is a uniform map ﬁlled with σ and has the same spatial

size as noisy image. To handle a wide range of noise levels,

the noise level σ is randomly chosen from [0, 50] during

training. Note that the noisy images are not clipped into the

range of [0, 255]. The reason is that the clipping operation

would change the distribution of the noise, which in turn

will give rise to inaccurate solution for plug-and-play IR.

The network parameters are optimized by minimizing the

L1 loss rather than L2 loss between the denoised image and

its ground-truth with Adam algorithm [67]. Although there

is no direct evidence on which loss would result in better

performance, it is widely acknowledged that L1 loss is more

robust than L2 loss in handling outliers [68]. Regarding

to denoising, outliers may occur during the sampling of

AWGN. In this sense, L1 loss tends to be more stable than

L2 loss for denoising network training. The learning rate

starts from 1e-4 and then decreases by half every 100,000

iterations and ﬁnally ends once it is smaller than 5e-7. In

each iteration during training, 16 patches with patch size

of 128×128 were randomly sampled from the training data.

We separately learn a denoiser model for grayscale image

and color image. It takes about four days to train the model

with PyTorch and an Nvidia Titan Xp GPU.

3.3 Denoising Results

3.3.1 Grayscale Image Denoising

For grayscale image denoising, we compared the proposed

DRUNet denoiser with several state-of-the-art denoising

methods, including two representative model-based meth-

ods (i.e., BM3D [23] and WNNM [10]), ﬁve CNN-based

methods which separately learn a single model for each

noise level (i.e., DnCNN [44], N

Net [60], NLRN [59],

剩余15页未读，继续阅读

Xhinking

粉丝: 29
资源: 320

深度去噪先验在图像恢复中的应用

Matlab的各校ppt历年题目及获奖论文-建模题目及论文3.rar

云计算毕业论文题目免费参考——毕业论文写作攻略.docx

基于matlab的毕业论文题目.docx

学校公共设施维护与管理平台论文题目的阐述和观点

请帮我修改一个更吸引人的论文题目：长期激励有效期差异影响企业增长方式的研究

帮我写一篇3000字论文；题目：多边主义：世界发展的破局之策；知网论文形式

出10个简单的金融类型的本科毕业论文题目

推荐几个关于人脸性别和年龄预测的毕业相关论文题目

帮我找一篇论文，论文题目为“Magnetic compensation of MAD equipped aircraft”

围绕计算机科学与技术专业，论文题目自拟。论文正文字数不低于10000字。论文要求中心突出、内容充实、证据充分、数据可靠、格式规范、层次分明、文字通畅、结论明确

想使用激励相容理论和政策工具智能识别方法写一篇论文，请给出一个论文题目和研究思路

毕业论文进度安排 题目：60kv营口降压变电所电气部分设计

生成论文题目、作者、摘要、关键词、发表时间等数据的csv文件

关于人脸分割，有没有好的研究方向，创新点，论文题目及其他衍生的好写论文的计算机视觉研究方向

在maskformer出现后，可不可以帮我想几个关于语义分割或实例分割或全景分割或视频方向的分割的论文题目以及创新点研究方向

怎么做爬取论文网的论文题目、作者、摘要、关键词、发表时间等数据的词云图

美赛2022题目pdf

写一篇The Use of Social Media in Distance Learning题目的7000字论文

最新资源

毕业论文进度安排题目：60kv营口降压变电所电气部分设计