高光谱遥感数据集成分类：生成与判别模型融合算法

5 浏览量更新于2024-08-30 收藏 206KB PDF 举报

"这篇论文提出了一种用于高光谱遥感图像分类的集成算法，它结合了生成模型（高斯混合模型，MoGs）和判别模型（支持集群机，SCM），旨在解决在监督分类中训练样本不足和代表性差的问题。通过反射光学系统成像光谱仪获取的高光谱数据集的实验结果证明了这种方法的有效性。" 在高光谱遥感领域，数据的分类是至关重要的，特别是在识别具有相似光谱特征的地物覆盖类别时。然而，实践中经常遇到的一个挑战是难以获取足够数量且分布广泛的训练样本。这些样本对于建立准确的分类模型至关重要，因为它们直接影响模型的泛化能力。当训练样本不足或不能充分反映实际空间分布时，分类效果通常会受到影响。为了解决这个问题，本文提出了一种创新的集成学习方法，将生成模型和判别模型相结合。生成模型，如高斯混合模型（MoGs），能模拟数据的生成过程，从而理解和学习数据的概率分布。这种模型可以捕捉数据的内在结构，有助于弥补训练样本不足的情况。而判别模型，如支持集群机（SCM），则专注于学习数据类别的边界，直接优化分类性能。论文中提到的支持集群机（SCM）是一种基于支持向量机（SVM）的变体，它在处理集群任务时具有优势。SCM能够有效地处理高维数据，比如高光谱图像，通过找到最优的决策边界来区分不同的地物类别。当与MoGs结合使用时，SCM可以从MoGs生成的多样本中学习，进一步提升分类的精确性和稳定性。实验部分，研究人员利用反射光学系统成像光谱仪收集的高光谱数据集验证了该集成算法的性能。这种数据集通常包含数百个波段，提供了丰富的光谱信息，可以用来区分各种地物类型。实验结果表明，提出的集成算法能有效提高分类精度，即使在训练样本有限的情况下也能达到良好的分类效果。这篇研究工作为高光谱遥感图像的分类提供了一个新的解决方案，通过融合两种不同类型的模型，提高了分类的可靠性和鲁棒性。这种方法对于环境监测、土地利用分析、灾害评估等遥感应用具有广泛的应用潜力，并为进一步优化高光谱图像处理技术奠定了基础。

762 IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, VOL. 6, NO. 4, OCTOBER 2009

Ensemble Classiﬁcation Algorithm for Hyperspectral

Remote Sensing Data

Mingmin Chi, Member, IEEE, Qian Kun, Jón Atli Benediktsson, Fellow, IEEE,andRuiFeng

Abstract—In real applications, it is difﬁcult to obtain a sufﬁ-

cient number of training samples in supervised classiﬁcation of

hyperspectral remote sensing images. Furthermore, the training

samples may not represent the real distribution of the whole space.

To attack these problems, an ensemble algorithm which combines

generative (mixture of Gaussians) and discriminative (support

cluster machine) models for classiﬁcation is proposed. Experimen-

tal results carried out on hyperspectral data set collected by the

reﬂective optics system imaging spectrometer sensor, validates the

effectiveness of the proposed approach.

Index Terms—Ensemble classiﬁcation, hyperspectral remote

sensing images, mixture of Gaussians (MoGs), support cluster

machine (SCM).

I. INTRODUCTION

YPERSPECTRAL remote sensing images are very im-

portant for the discrimination of spectrally similar land-

cover classes. In order to obtain a reliable classiﬁer, a large

amount of representative training samples are necessary for hy-

perspectral data compared to multispectral remote sensing data.

In real applications, it is difﬁcult to obtain sufﬁcient number

of training samples for supervised learning. Furthermore, the

training samples may not represent the real distribution of the

whole space. These result in a quantity problem for training

samples in the design of a robust supervised classiﬁer.

In recent years, semisupervised learning (SSL) methods

[1]–[3], usually, have been exploited to overcome the problems

with small numbers of labeled samples for the classiﬁcation

of hyperspectral remote sensing images, such as self-labeling

approaches [1], low-density separation SSL approaches [2],

and label-propagation SSL approaches [3]. The methods pre-

viously mentioned usually exploit generative or discriminative

approaches, where the estimation criterion is used for adjusting

the parameters and/or structure of the classiﬁcation approaches.

There is little literature on the use of both generative and dis-

criminative models for the quantity problem. In [4], the authors

Manuscript received December 22, 2008; revised April 10, 2009. First

published July 28, 2009; current version published October 14, 2009. This

work was supported in part by the Natural Science Foundation of China under

Contract 60705008, by the Ph.D. Programs Foundation of the Ministry of

Education of China under Contract 20070246132, and by the Research Fund

of the University of Iceland.

M. Chi, Q. Kun, and R. Feng are with the School of Computer Science,

Fudan University, Shanghai 200433, China (e-mail: mmchi@fudan.edu.cn;

0314018@fudan.edu.cn; fengrui@fudan.edu.cn).

J. A. Benediktsson is with the Faculty of Electrical and Computer Engineer-

ing, University of Iceland, 107 Reykjavik, Iceland (e-mail: benedikt@hi.is).

Color versions of one or more of the ﬁgures in this paper are available online

at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/LGRS.2009.2024624

worked on a generative model and adopted a discriminative

model to correct the bias of the generative classiﬁer learnt by

small-size training samples.

In this letter, we propose an ensemble algorithm, which

beneﬁts the advantages of both generative and discriminative

models to deal with the quantity problem in the classiﬁcation

of hyperspectral remote sensing images. In particular, both

labeled and unlabeled data are represented with a generative

model [i.e., mixture of Gaussians (MoGs)]. Then, the estimated

model is used for discriminative learning. This is motivated

by the recently proposed discriminative classiﬁcation approach,

support cluster machine (SCM) [5]. The SCM was originally

used to address large-scale supervised learning problems. The

main idea in the SCM is that the labeled data are at ﬁrst mod-

eled using a generative model. Then, the kernel, the similarity

measure between Gaussians, is deﬁned by probability product

kernels (PPKs) [6]. In other words, the obtained PPK kernel

is used to train support vector machines (SVMs) where the

learned models contain support clusters rather than support

vectors (the name SCM is based on this).

In the SCM, the number of clusters is important to obtain

the best classiﬁcation results. If the selected number of Gaus-

sians (not limited to Gaussians) does not ﬁt the data well, the

classiﬁcation accuracy can decrease. For a small size training

set problem, the mixture model estimated by only labeled

samples cannot represent the distribution for the whole data.

To attack the aforementioned problem, it is proposed here to

ﬁrst use both labeled and unlabeled samples to estimate an

MoG. Then, different sets of the MoGs are generated by going

from few (coarse representation) to many (ﬁne representation)

numbers of clusters. Finally, the output classiﬁcation result is

integrated by an ensemble technique based on the ones obtained

from individual SCMs learnt by different sets of MoGs. In

terms of the different estimated MoGs, the corresponding PPK

kernel matrixes can be computed and used as inputs to standard

SVMs for training. The accuracies and the reliability of the

proposed algorithm have been evaluated on reﬂective optics

system imaging spectrometer (ROSIS) hyperspectral remote

sensing data collected over the University of Pavia, Italy. The

results are promising when compared to the state-of-the-art

classiﬁers.

The rest of this letter is organized as follows. The next section

describes the proposed ensemble algorithm with generative/

discriminative models. Section III discusses the data used in

the experiments, reports and discusses the results provided by

the different algorithms. Finally, conclusions and discussion are

given in Section IV.

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38709379

粉丝: 3
资源: 954

高光谱遥感数据集成分类：生成与判别模型融合算法

使用SVM代码对AVIRIS_Indiana_16class高光谱数据集进行分类

如何利用机器学习算法对高光谱遥感数据进行训练

matlab高光谱遥感数据降维

高光谱遥感 .rar

matlab 高光谱遥感

som算法 高光谱遥感图像分类

对高光谱遥感发展的见解

基于cnn的高光谱遥感图像的分类研究 matlab代码

高光谱遥感技术csdn

什么是高光谱遥感？它具有哪些特点？

最新资源

som算法高光谱遥感图像分类