鲁棒灵活判别词典学习：联合子空间恢复与增强局部性

需积分: 9 91 浏览量更新于2024-07-14 收藏 2.45MB PDF 举报

"本文介绍了一种名为鲁棒灵活判别词典学习（RFDDL）的新方法，该方法结合了联合子空间恢复和增强局部性，旨在提高数据表示和分类的性能。RFDDL针对数据和原子中的噪声及稀疏错误进行鲁棒处理，同时确保标签一致性和准确的重构。通过恢复基础干净数据和原子子空间，RFDDL增强了数据编码的鲁棒性。此外，它通过灵活的重建误差最小化处理非线性流形上的数据，防止过拟合。RFDDL还利用灵活稀疏编码误差来编码标签一致性，以及在恢复的原子上定义拉普拉斯矩阵以编码局部性，结合类内紧凑性和类间分离，形成具有判别性的系数约束和分类器。实验结果证明了RFDDL方法的有效性。" 本文关注的焦点是机器学习领域中的特征表示和分类问题，特别是通过词典学习来提升模型的性能。词典学习（Dictionary Learning, DL）是一种将数据表示为稀疏编码的线性组合的方法，常用于图像处理、信号处理等领域。RFDDL是DL的一个扩展，它在两个关键方面进行了创新： 1. 联合子空间恢复：RFDDL首先考虑数据和原子中的噪声和稀疏错误，通过联合恢复基础干净数据和原子子空间，提高了数据表示的准确性和鲁棒性。这一步骤有助于减少噪声影响，提高编码的质量。 2. 增强局部性：在恢复的子空间中，RFDDL强调了数据的局部特性。通过编码局部性，可以更好地捕捉数据的内在结构，这对于非线性数据尤其重要。这有助于提升分类的准确性。此外，RFDDL还解决了两个关键问题： 3. 灵活的重建误差最小化：为了解决非线性流形上数据的处理，RFDDL采用灵活的方式最小化重建误差，避免了过拟合，使得模型能够适应复杂的数据分布。 4. 标签一致性编码：RFDDL引入了区别性的灵活稀疏代码错误，促进了系数的平滑，这有助于保持标签的一致性，确保分类结果的可靠性。最后，RFDDL通过在原子上构建拉普拉斯矩阵，结合了类内紧凑性和类间分离的信息，形成了具有判别性的系数约束，这进一步优化了分类器的性能。 RFDDL是词典学习的一种增强形式，它通过联合子空间恢复、增强局部性、灵活的重建误差最小化和标签一致性编码等策略，提升了数据表示的鲁棒性和分类的准确性。这一方法对于处理噪声数据和非线性结构数据的机器学习任务具有重要的应用价值。

where H is the label set with binary entries (0 or 1), α and β are

positive scalars.

 

, ,...,

Q q q q





are “discriminative”

sparse codes for

, where

 

0 ,1,1, 0

q 

is an ideal

“discriminative” sparse code for x

if the nonzero values of q

occur at those indices where x

and atom d

share the same label

[9]. Suppose that

 



has 3 classes, where

1 2 3 4

, , ,x x x x

are

from class 1,

,xx

are from class 2, and the rest from class 3,

the “discriminative” sparse codes matrix Q is defined as

1 1 1 1 0 0 0 0 0

0 0 0 0 1 1 0 0 0

0 0 0 0 0 0 1 1 1







. (3)

Thus, term

|| ||

Q AS

is the discriminative sparse-code error,

where





converts the original sparse codes in S to be

more discriminative in a feature subspace

|| ||

Q AS

can

encourage the label consistency in the resulting codes, but

arbitrarily defined, so it cannot preserve local information and

inherit the structure information of samples. Setting all nonzero

entries to ones is also too hard, since the coefficients in S are

essentially soft, i.e., a large value s

i,j

means that the contribution

of each

to reconstruct

is large, and small otherwise.

Note that D-KSVD and LC-KSVD can be equivalent with

the uniform atom allocation [36], which is also based on the

identical initialization conditions and optimization methods. By

the equivalence, we can reformulate LC-KSVD as

* * *

, , argmin

. . , 1,2,...,

  





   



   

   



D S W

D S W S

s t s T i N

, (4)

which is just the problem of D-KSVD, where



is the number

of dictionary atoms allocated per class. The analysis in [36] also

shows that D-KSVD is preferable because of its simplicity and

efficiency, compared to the LC-KSVD algorithm.

B. Review of LCLE-DL

LCLE-DL is another one related DL method, so we also briefly

revisit it. LCLE-DL calculates a discriminative dictionary D by

combining the label embedding of atoms and locality constraint

of atoms jointly. The problem of LCLE-DL is defined as

   

, , ,

min

, . . 1, 1, ,

D S V L

X DS tr S LS X DV tr V UV

S V st d i K





    

   

, (5)

where S and V denote the coefficient matrices,

X DS

and

X DV

denote the reconstruction error, and

SV

is the

regularization used to transfer the label constraint

 

Tr V UV

to/from the locality constraint

 

Tr S LS

. U is the scaled label

matrix constructed by the labels of dictionary.



and



are

parameters. L is a graph Laplacian matrix defined as

 

, , ,

K i i j

L G M where G diag g g g M



   



, (6)

where the nearest neighbor graph is weighted by M as

 

exp /

i j j i

d d if d kNN d

else





  











, (7)

which is defined by the Gaussian function, where



is the kernel

width,

 

kNN d

is the k-nearest neighbor set of atom

, G is a

diagonal matrix and

,ij

encodes the similarity between atoms

and

. By the above definitions,

 

X DS Tr S LS





can encode the reconstruction error with the locality constraint,

where

 

Tr S LS

inherits the manifold structure of training set.

 

X DV Tr V UV





encodes the reconstruction error with

label embedding, where

 

Tr V UV

forces the intra-class atoms

in D to have similar profiles. Term

SV

is a regularization

over the coefficients, which ensures the mutual transformation

between the label embedding and locality constraint.

III. ROBUST FLEXIBLE DISCRIMINATIVE DICTIONARY

LEARNING (RFDDL)

A. Problem Formulation

The presented RFDDL model improves the robust properties of

discriminative DL to noise and sparse errors in twofold. First, it

calculates the robust discriminative dictionary and codes in the

recovered clean data and atom subspaces. Specifically, RFDDL

decomposes the original X and dictionary D in each iteration to

recover underlying clean data

new

and clean dictionary

new

and models the errors

and

at the same time in terms of

new

X X E

and

new D

D D E

, where L

2,1

-norm is used on

and

so that the sparse errors in data and atoms can be

corrected jointly. Then, RFDDL performs the discriminant DL

over clean

new

and

new

for accurate representations. Second,

our RFDDL encodes the locality and defines a Laplacian matrix

by computing discrimination-promoting reconstruction weights

over recovered clean atoms, which can encourage intra-class

samples to have similar sparse codes and inter-class samples to

have different ones, and will be detailed in next subsection. By

combing the joint subspace recovery and Laplacian regularized

reconstruction error, discriminative sparse-code error and data

classification error, the initial problem of RFDDL is given as





 





2,1 2,1

, , , , ,

2 2 2

2,1

min

. . ,

new new D

D S L E E W

F F F

new new D

X D LS E E

Q LS SH H WLS W

s t X X E D D E







  

    

   

, (8)

where





2,1 2,1







is the L

2,1

-norm based error,



and



are the parameters. Note that the L

2,1

-norm can ensure the

regularized matrix to be sparse in rows, and the L

2,1

-norm based

metric is robust to noise and outliners in data and atoms [2], [7],

[27]. L

2,1

-norm based classifier

2,1



can force the columns of

W to be sparse so that the discriminative soft labels can be

predicted in the latent sparse subspace. Q and H are similarly

defined as LC-KSVD, and

is the Frobenius-norm based

coefficients,

/

H I ee N

is the “centering matrix”, that is,

/

SH S See N

can be considered as the normalized coding

coefficients. It is clear that the discriminative Laplacian matrix

is associated with the learning of codes and classifier, which

can potentially obtain the more accurate codes, discriminative

dictionary and powerful discriminative classifier jointly.

Please note that the linear reconstruction

new new

X D LS

may be overfitted especially when the number of training data

is limited. To enable the data sampled from nonlinear manifold

to be handled potentially by DL and avoid the overfitting, our

RFDDL proposes a flexible reconstruction residue motivated

剩余14页未读，继续阅读

weixin_38615591

粉丝: 5
资源: 976

鲁棒灵活判别词典学习：联合子空间恢复与增强局部性

rsrlayer-pytorch:用于无监督异常检测的鲁棒子空间恢复层的PyTorch实现https

基于OpenCV的人脸识别小程序.zip

精选毕设项目-宅男社区.zip

精选毕设项目-扫描条形码.zip

配网两阶段鲁棒优化调度模型 关键词：两阶段鲁棒优化，CCG算法，储能 仿真算例采用33节点，采用matlab+yalmip+cplex编写，两阶段模型采用CCG算法求解 模型中一阶段变量主要包括01

comsol光栅仿真 计算复合波导光栅准BIC增强古斯汉森位移

精选毕设项目-车源宝寻车广场.zip

数字农业产业项目整体解决方案.pdf

精选毕设项目-幸运大抽奖.zip

SRS构型七自由度冗余机械臂运动学建模全套matlab代码 代码主要功能: 1. 基于臂角参数化方法求解机械臂在给定末端位姿和臂角下的关节角度； 2. 求解机械臂在给定末端位姿下的有效臂角范围

最新资源

配网两阶段鲁棒优化调度模型关键词：两阶段鲁棒优化，CCG算法，储能仿真算例采用33节点，采用matlab+yalmip+cplex编写，两阶段模型采用CCG算法求解模型中一阶段变量主要包括01

comsol光栅仿真计算复合波导光栅准BIC增强古斯汉森位移

SRS构型七自由度冗余机械臂运动学建模全套matlab代码代码主要功能: 1. 基于臂角参数化方法求解机械臂在给定末端位姿和臂角下的关节角度； 2. 求解机械臂在给定末端位姿下的有效臂角范围