权重驱动的高光谱图像分类：旋转森林新方法

88 浏览量更新于2024-08-27 收藏 1.08MB PDF 举报

"这篇论文提出了一种名为基于权重的旋转森林（Weighted Rotation Forest, WRoF）的新算法，用于高光谱图像的分类。该算法通过动态权重函数来强调重要训练实例的作用，以提高分类性能。在实际的高光谱数据集上的测试显示，WRoF相比于传统的随机森林（Random Forests, RFs）和旋转森林（Rotation Forest, RoF）方法，具有显著的分类提升。" 高光谱图像分类是遥感信息处理的关键任务之一，由于高光谱数据的特征数量庞大且相互关联性强，使得分类工作相对复杂。传统的机器学习方法，如随机森林和旋转森林，尽管在许多领域表现出色，但在处理高光谱数据时可能遇到挑战。随机森林是一种集成学习方法，通过构建多棵决策树并取其平均结果来提高分类准确性和抗过拟合能力。然而，在高光谱图像分类中，它可能无法充分捕捉到每个特征的独特信息，特别是对于那些对分类至关重要但可能被其他特征掩盖的实例。旋转森林（RoF）则是随机森林的一种变体，通过随机旋转特征空间来增加模型的多样性，以此提高分类效果。然而，这种方法同样未能解决关键实例的重视问题。针对这些问题，WRoF算法引入了动态权重函数，这一创新之处在于能够自适应地调整训练实例的权重。每个实例的权重会根据其对分类的重要性而变化，权重高的实例在构建新树时会被更多地考虑。这样做可以确保重要信息的突出，使得每棵树都能更专注于那些对分类有重大影响的实例。实验结果显示，WRoF在两个实际的高光谱数据集上实现了显著的分类性能提升，证明了这种方法的有效性。这表明，基于实例权重的策略对于优化高光谱图像分类是有益的，特别是在处理复杂和高维数据时。 WRoF算法提供了一种改进的决策树集成方法，通过实例权重的动态调整，增强了对高光谱图像中关键信息的捕获，从而提高了分类的精确度。这种方法不仅适用于高光谱图像，也有可能推广到其他领域，特别是在处理大量特征和复杂数据模式的场景下。

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, VOL. 14, NO. 11, NOVEMBER 2017 2167

Weight-Based Rotation Forest for Hyperspectral

Image C lassiﬁcation

Wei Feng and Wenxing Bao

Abstract— In this letter, we propose a new weight-based

rotation forest (WRoF) induction algorithm for the classiﬁcation

of hyperspectral image. The main idea of the new method is to

guide the growth of trees adaptively via exploring the potential

of important instances. The importance of a training instance is

reﬂected by a dynamic weight function. The higher the weight

of an instance, the more the next tree will have to focus on the

instance. Experimental results on two real hyperspectral data sets

show that the WRoF algorithm results in signiﬁcant classiﬁcation

improvement compared with random forests and rotation forest.

Index Terms— Classiﬁcation, hyperspectral image, random

forests (RFs), rotation forest (RoF), weight.

I. INTRODUCTION

LASSIFICATION is one of the major tasks in remote

sensing information processing. Classiﬁcation of hyper-

spectral data is usually more difﬁcult than other remote sensing

imagery due to issues, such as the high ratio of feature to

instance and the redundant information in the feature set [1].

While most learning systems suffer from the intractability

issue known as the curse of dimensionality, studies have

demonstrated the successful application of classiﬁer ensemble

techniques to hyperspectral classiﬁcation [2]–[6].

Ensemble learning, also calle d committee-based learning,

is an effective method to develop accurate classiﬁcation sys-

tems [7]. It is appealing, because it is able to boost weak

learners, which are slightly better than random guess to strong

aggregated learners, which can make very accurate predictions.

Boosting [8] and bagging (the acronym of bootstrap aggre-

gating) [7] are major ensemble learning methods. Diversity,

which is the difference among the individual learners, has

been recognized as a very important characteristic in classiﬁer

combination [9]. It can be used effectively to reduce the

variance error without increasing the bias error by ensemble

methods [10]. In o rder to encourage diversity within bagging,

random forests (RFs) [11] are proposed. The RFs is a combi-

nation of tree predictors in which the decision trees [12] are

constructed using a resampling technique with replacement;

they randomly sample the attributes and choose the best split

Manuscript receive d April 26, 2017; revised August 20, 2017; accepted

September 11, 2017. Date of publication October 13, 2017; date of current

version October 25, 2017. This work was supported by the National Natural

Science Foundation of China under Grant 61461003. (Corresponding author:

Wei Feng.)

W. Feng is with the Geo-Resources and Environment Laboratory,

Bordeaux INP, 33600 Pessac, France (e-mail: wei.feng@ipb.fr).

W. Bao is with School of Computer Science and Engineering, North Minzu

Uni versity, Yinchuan 750021, China (e-mail: bwx71@163.com).

Color versions of one or more of the ﬁgures in this letter are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/LGRS.2017.2757043

among those variables rather than the best split among all

attributes. Important advantages, such as running efﬁciently on

large data bases, handling thousands of input variables without

variable deletion and low time cost make RFs widely attract

the interest of researchers [13], [14].

The rotation forest (RoF) [20] method draws upon the idea

of RFs, but aims at building more accurate and diversiﬁed b ase

classiﬁers. It splits randomly the feature space into several

subspaces, applies principal component analysis (PCA) [15]

separately on each subspace, and repeats the aforementioned

process to generate the diversiﬁed training data sets and

base classiﬁers for different feature subspaces. Studies have

demonstrated the successful application of RoF to remote

sensing imagery [2], [3], [16]–[18]. Moreover, the RoF was

found to provide more satisfactory result with respect to

bagging, AdaBoost, and RFs ensembles in the classiﬁcation

of hyperspectral data [2].

Recently, several approaches have proposed to improve the

performance of RoF [6], [17], [18]. Lu et al. [6] increase the

classiﬁcation accuracy of RoF by improving the effectiveness

of base classiﬁers. A cost-sensitive decision tree is recom-

mended to replace the standard decision tree [12] as a base

classiﬁer in their experiment. Li et al. [17] construct the RoF

with improved performance by using an ensemble AdaBoost

instead of a single classiﬁer as a basic classiﬁer. Xia et al. [18]

proposed the high-performance RoF via building decision trees

for each subfeature set. However, these methods treat all

the instances equally, a nd the potentials of the infor mative

instances do not be taken into account. Furthermore, these

algorithms generate base classiﬁers independently of one

another, and some of these base classiﬁers not only increase

the computation complexity of the algorithm but also decrease

the ensemble performance.

The major contribution of this letter is to propose a novel

weight-based RoF (WRoF) algorithm. The main idea of the

new method is to guide the growth of trees adaptively via

exploring the potential of important instances. The impor-

tance of a training instance is reﬂected by a dynamic weight

function. The remainder of this letter is o rganized as follows.

In Section II, we introduce the dynamic weight function and

the proposed method WRoF. The experimental studies are

presented in Section III. Section IV presents the conclusions

and the fu ture work of this letter.

II. M

ETHODS

Our WRoF algorithm is inspired by boosting. However,

boosting updates the weights of training samples at each

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

下载后可阅读完整内容，剩余4页未读，立即下载

weixin_38683562

粉丝: 6
资源: 970

权重驱动的高光谱图像分类：旋转森林新方法

基于TM遥感影像带岭区森林生物量估测研究

小样本学习全解析：从理论到高光谱图像分类的实用指南

【进阶】KNN算法在图像分类中的应用

MATLAB图像处理高级算法详解

MATLAB图像绘制在图像处理中的应用：直观呈现图像信息，揭示图像本质

SimCLR遥感图像分析新探索：解锁地球奥秘，洞察自然规律

【SARScape图像处理全攻略】：精通裁剪技巧与流程（仅限前100名读者）

STM32之光敏电阻模拟路灯自动开关灯代码固件

PHP在线工具箱源码站长引流+在线工具箱源码+多款有趣的在线工具+一键安装

PageNow大数据可视化开发平台-开源版，基于SprigBoot+Vue构建的数据可视化开发平台，灵活的拖拽式布局、支持多种数据源、丰富的通用组件.zip

最新资源