非凸加二次罚分低秩稀疏分解：噪声图像对齐的创新策略

需积分: 10 168 浏览量更新于2024-08-12 收藏 2.7MB PDF 举报

本文探讨了在图像处理领域中一个重要的技术挑战——噪声图像对齐，特别关注的是低秩结构的恢复。传统的方法往往依赖于凸优化技术，然而这在处理数据中可能出现的稀疏或非稀疏噪声时可能存在局限性。为克服这一问题，研究者提出了一种非凸加二次惩罚性低秩和稀疏分解（Nonconvex plus Quadratic Penalized Low-Rank and Sparse Decomposition, NLSD）方法。 NLSD的核心理念是通过引入非凸罚分技术，增强模型对复杂数据结构的适应性。非凸性有助于捕捉到数据中的潜在结构，即使在存在噪声的情况下也能更准确地恢复低秩特征和稀疏成分。与传统的凸优化相比，非凸优化允许更广泛的局部最优解，这在处理非稀疏噪声时提供了额外的优势。作者利用局部线性逼近（LLA）策略，将非凸问题分解为一系列可管理的加权凸子问题，这样就可以通过增强拉格朗日乘数法（ALM）进行有效求解。这种方法不仅保留了低秩和稀疏性带来的信息压缩效果，而且能够有效地抑制噪声的影响，从而提高图像对齐的精度。为了验证NLSD的有效性，研究者将其与线性相关图像的稀疏和低秩分解的鲁棒对准方法（RASL）进行了对比实验。结果表明，在处理受控和非受控数据时，NLSD在性能上超越了RASL和其他现有方法，展示了其在噪声图像对齐任务中的优越性。该论文提供了一种创新的图像处理框架，通过非凸加二次惩罚的策略，有效地解决了噪声图像对齐中的低秩结构恢复问题，对于提升图像处理技术在实际应用中的鲁棒性和准确性具有重要意义。

Chen X A, et al. Sci China Inf Sci

sparse and low-rank decomposition via Principal Component Pursuit based on the assumption that a

batch of aligned images should form a low-rank matrix. Suppose I

,...,I

∈ R

w×h

are n well-aligned

grayscale images of some objects. The function vec : R

w×h

→ R

stacks each of above images as a

vector. Then the matrix

L =[vec(I

)|···|vec(I

)] ∈ R

m×n

(1)

should be approximately low-rank. This kind of assumption is very common. For instance, Ref. [20]

assumes that a rank-9 approximation suﬃces, when the images I

,i =1,...,n are obtained from some

Lambertian objects under varying illumination.

However, in practice, the object in image i is often partially occluded or corrupted by some noises

. So it may be more appropriate to assume that we observe I

= I

+ ε

,...,I

= I

+ ε

instead of

,...,I

. Thus, the original data D can be represented as

D = L + S. (2)

Here, S is the noise matrix and is generally regarded sparse.

The above model explicitly requires the original data to be well aligned. For real data, in order to

compensate for the misalignments, certain transformation τ =[τ

|···|τ

] ∈ R

p×n

has been applied to

act on the original data. Then Eq. (2) can be changed to

D ◦ τ = L + S, (3)

where D ◦ τ means applying the transformation τ

to each misaligned image I

When taking both the corruption and misalignment into consideration, the Lagrange form of the batch

image alignment model can be formulated as follows:

min

L,S,τ

rank(L)+

λS

s.t. D ◦ τ = L + S

. (4)

Here, the 

-norm ·

counts the number of nonzero entries in the sparse error matrix S.

The above optimization problem (4) is NP hard and thus cannot be directly tractable. Based on the

Robust PCA, the highly nonconvex objective function in (4) is relaxed to its convex surrogate, i.e. replace

the rank and 

-norm with the nuclear norm and 

-norm, respectively. Then the model can be modiﬁed

as follows:

min

L,S,τ

L

∗

+ λS

s.t. D ◦ τ

= L + S. (5)

In [21], the model of (5) is successfully extended for unsupervised subspace discovery. The main

goal is to deal with a much less constrained problem, in which the common pattern (object) observes

large-scale diﬀerences at unknown locations in the images. It is a learning and matching algorithm, but

has nothing to do with the image alignment. Ref. [22] proposes a new method for face alignment and

recognition by integrating sparse representation based classiﬁcation (SRC) [23] to the model of (5), which

have equivalently good performance on the recognition rate. Ref. [24] proposes coupling alignments with

recognition for still-to-video face recognition based on the theory of sparse representation and subspace

segmentation. From a diﬀerent aspect, Ref. [25] applies the form of model (5) to generate transformation

invariant low-rank textures (TILT), in which the rectiﬁed image(s) is a single image instead of image

sequences based on the assumption that the textures of the rectiﬁed image are usually symmetric patterns

and consequently form low-rank matrices.

In these studies, the low-rank component is required to be exactly low-rank and the sparse component

are required to be exactly sparse as in [18]. What is more, though solving (5) could obtain good image

alignment results in many situations, there are some speciﬁc applications in which convex relaxation

based approach (5) cannot provide desirable results. Therefore, it would be better to replace the convex

nuclear norm and 

norm with folded-concave penalties to remedy the drawbacks of convex penalization

method.

剩余12页未读，继续阅读

weixin_38747592

粉丝: 6
资源: 937

非凸加二次罚分低秩稀疏分解：噪声图像对齐的创新策略

基于非凸的全变分和低秩混合正则化的图像去模糊模型和算法.pdf

低秩矩阵分解

向量范数在图像处理中的应用：去噪与边缘检测，提升图像处理的精度

图像处理的加速神器：目标识别性能优化秘籍

：ResNet在医学图像分类的潜力：探索其应用前景

MATLAB非线性拟合进阶攻略：高级算法和优化策略大揭秘

【Python编码技巧】：损失函数的实现与调优全攻略

Transformer模型的损失函数设计和优化方法

YOLO算法中的难点与挑战：小目标检测、遮挡处理和复杂背景的应对之道

人脸识别再升级：深度度量学习引领突破性进展，提升识别准确率

最新资源