稀疏表示与成对约束的自适应半监督降维方法

126 浏览量更新于2024-08-27 1 收藏 840KB PDF 举报

"使用成对约束的稀疏表示的自适应半监督降维" 这篇研究论文探讨了在高维数据日益积累的背景下，如何利用成对约束进行半监督的降维方法。半监督学习是一种介于有监督学习和无监督学习之间的学习策略，它在有限的标注数据（监督信息）和大量未标注数据之间寻找平衡。在本文中，研究者提出了一种新的方法，将稀疏表示与成对约束相结合，用于自适应地降低数据的维度。在实际的数据处理和学习任务中，维度减少是一个关键步骤，因为它有助于降低计算复杂性，减少过拟合风险，并提高模型的解释性。传统的方法往往依赖于所有数据点的标签信息，但在许多情况下，获取大量标注数据是昂贵且耗时的。因此，半监督降维方法应运而生，它们试图利用少量的标注信息指导对大量未标注数据的处理。成对约束是半监督学习中的一个关键工具，它提供了关于数据实例间关系的先验知识。必须链接约束（must-link constraint）指明两个实例属于同一类别，而不能链接约束（cannot-link constraint）则表明它们属于不同类别。这些约束可以帮助算法捕获数据的内在结构，特别是在类别信息稀缺的情况下。论文中提出的自适应半监督降维方法基于稀疏表示理论。稀疏表示是一种表示数据的方式，其中每个数据点可以被表示为其他数据点的线性组合，而且这种组合尽可能地稀疏，即大多数系数为零。这种方法能够揭示数据的内在特性，同时通过引入成对约束来增强表示的准确性。在降维过程中，算法会尝试找到一个低维空间，使得在这个空间内，满足成对约束的数据点的位置关系得以保持。这可以通过优化问题来实现，目标是使满足约束的数据点在低维空间中的距离尽可能接近或远离，同时保持整体表示的稀疏性。实验部分可能展示了该方法在各种基准数据集上的性能，与其他半监督降维技术进行了比较，证明了其在保留类别信息和提高分类性能方面的优势。此外，可能还讨论了不同参数设置对结果的影响，以及如何选择合适的成对约束来优化降维效果。总结来说，这篇论文提出了一个结合稀疏表示和成对约束的自适应半监督降维框架，旨在解决高维数据处理中的挑战，特别是在有限标注信息的条件下。这种方法有望在模式识别、图像分类、社交网络分析等领域得到应用，为数据驱动的决策提供更高效、准确的解决方案。

Adaptive semi-supervised dimensionality reduction with sparse

representation using pairwise constraints

Jia Wei

, Meng Meng

, Jiabing Wang

, Qianli Ma

, Xuan Wang

School of Computer Science and Engineering, South China University of Technology, Guangzhou, China

Computer Application Research Center, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China

article info

Article history:

Received 20 August 2014

Received in revised form

29 September 2015

Accepted 19 November 2015

Communicated by Feiping Nie

Available online 2 December 2015

Keywords:

Semi-supervised learning

Dimensionality reduction

Pairwise constraints

Sparse representation

abstract

With the rapid accumulation of high dimensional data, dimensionality reduction plays a more and more

important role in practical data processing and learning tasks. This paper studies semi-supervised

dimensionality reduction using pairwise constraints. In this setting, domain knowledge is given in the

form of pairwise constraints, which speciﬁes whether a pair of instances belong to the same class (must-

link constraint) or different classes (cannot-link constraint). In this paper, a novel semi-supervised

dimensionality reduction method called Adaptive Semi-Supervised Dimensionality Reduction with

Sparse Representation (ASSDR-SR) is proposed, which can get the optimized low dimensional repre-

sentation of the original data by adaptively adjusting the weights of the pairwise constraints and

simultaneously optimizing the graph construction using the ℓ

graph of sparse representation. Experi-

ments on clustering and classiﬁcation task s show that ASSDR-SR is superior to some existing dimen-

sionality reduction methods.

1. Introduction

The goal of dimensionality reduction is to reduce the com-

plexity of the input data while some desired intrinsic information

of the data is preserved. Two of the most popular methods for

dimensionality reduction are Principal Component Analysis (PCA)

[1] and Linear Discriminant Analysis (LDA) [2], which are unsu-

pervised and supervised respectively.

In many real world applications such as image segmentation,

web page classiﬁcation and gene-expression clustering, a labeling

process is costly and time-consuming; in contrast, unlabeled

examples can be easily obtained. Therefore, in such situations,

it may be beneﬁcial to incorporate the information which is

contained in unlabeled examples into a learning problem, i.e.,

Semi-Supervised Learning (SSL) [3] should be applied instead

of supervised learning. Meanwhile, dimensionality reduction in

semi-supervised situation has also attracted more and more

attention [4,5].

However, in many cases, people cannot tell which category an

instance belongs to, that is we do not know the exact label of an

instance, and what we know is the constraint information of

whether a pair of instances belong to the same class (must-link

constraint) or different classes (cannot-link constraint) [6]. The

above pairwise constraint information is called “Side Information”

[7]. It can be seen that constraint information is more general than

label information, because we can get constraint information from

label information but it cannot work contrariwise [8].

Some related works have been proposed to make use of the

pairwise constraints to extract low dimensional structure in high

dimensional data. Bar-Hillel et al. proposed Relevant Component

Analysis (RCA) which can make use of the must-link constraints

for semi-supervised dimensionality reduction [9] . Xing et al. [7],

Tang et al. [10], Yeung et al. [11] and An et al. [12] proposed some

constraints based semi-supervised dimensionality reduction

methods, which can make use of both the must-link constraints

and cannot-link constraints. Zhang et al. proposed Semi-

Supervised Dimensionality Reduction (SSDR) [13] and Chen et al.

used SSDR in hyperspectral image classiﬁcation [14]. SSDR can use

the pairwise constraints as well as preserve the global covariance

structure of the unlabeled data in the projected low dimensional

subspace. Cevikalp et al. proposed Constrained Locality Preserving

Projections (CLPP) [15] which is the semi-supervised version of

LPP [16]. The method can make use of the information provided by

the pairwise constraints and can also use the unlabelled data by

preserving the local structure used in LPP. Wei et al. proposed

Neighborhood Preserving based Semi-Supervised Dimensionality

Reduction (NPSSDR) [17] by using the pairwise constraints and

preserving the neighborhood structure used in LLE [18]. Baghshah

et al. used the idea of NPSSDR in metric learning and used a

heuristic search algorithm to solve the proposed constrained trace

ratio problem [19]. Davidson proposed a graph driven constrained

Contents lists available at ScienceDirect

journal homepage: www.elsevier.com/locate/neucom

Neurocomputing

http://dx.doi.org/10.1016/j.neucom.2015.11.048

E-mail address: csjwei@scut.edu.cn (J. Wei).

Neurocomputing 177 (2016) 564–571

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38502428

粉丝: 6
资源: 886

稀疏表示与成对约束的自适应半监督降维方法

基于成对约束加权和图优化的自适应半监督降维

融合半监督降维与稀疏表示的人脸识别方法.pdf

论文研究-基于成对约束的稀疏嵌入遥感影像降维 .pdf

高光谱数据的非负稀疏半监督降维算法

网络稀疏表示：分解，降维和重构

自适应谱聚类降维方法在高维数据中的应用研究

稀疏流形自适应低秩表示提升半监督学习效果

噪声分布先验引导的稀疏表示自适应去噪算法

自适应分类成对降维算法

基于稀疏表示的图嵌入降维算法在人脸识别中的应用研究.pdf

最新资源