基于BPA距离的不确定信息聚类方法

12 浏览量更新于2024-08-28 收藏 273KB PDF 举报

"基于BPA间距离的不确定信息聚类" 本文提出了一种新的证据聚类方法，专门用于处理多源信息分析时的信息聚类问题。在该方法中，引入了信念函数（Belief Functions，简称BPAs）之间的距离来构建聚类矩阵。通过将距离矩阵转换成向量，以此为基础进行聚类。这种方法与传统的聚类算法相比，通过实例验证了其有效性和优越性。在信息技术领域，尤其是在数据分析和决策支持系统中，处理不确定或模糊数据是常见的挑战。Dempster-Shafer理论（也称为证据理论或DS理论）提供了一种框架，用于处理和融合来自不同来源的不完整或冲突的信息。BPAs是DS理论中的核心概念，代表了对某个命题的不确定性信念分布。传统的聚类方法通常依赖于确定性的数据，如欧几里得距离或余弦相似度。然而，在处理不确定信息时，这些方法可能无法充分捕捉数据的复杂性和不确定性。因此，作者提出的新方法引入了BPAs之间的距离，这使得能够量化和比较不同信息源的不确定性程度，从而更准确地进行聚类。在新方法中，首先计算每对BPAs之间的距离，形成一个距离矩阵。然后，这个距离矩阵被转化为向量，这允许使用基于向量的方法来进行聚类。这种方法的一个关键优点是它能够处理不一致和矛盾的信息，这是传统方法难以处理的。文章通过几个示例集展示了新方法的性能，与现有的聚类算法进行了对比。结果显示，基于BPA距离的聚类方法在处理不确定信息时能提供更合理的聚类结果，证明了其在处理复杂信息环境下的适用性和有效性。此外，由于BPAs距离考虑了信息的不确定性和矛盾，因此这种方法特别适用于传感器网络数据融合、图像识别、社交网络分析等场景，其中数据往往带有噪声、不完全或有冲突。这篇论文为不确定信息的处理和分析提供了一个新的视角，即利用BPAs之间的距离来进行聚类，为处理复杂和模糊信息的问题提供了有力的工具。这一研究对于理解和改进多源信息融合技术，以及开发更加智能和适应性强的决策系统具有重要意义。

Uncertain Information Clustering Based on Distance Between BPAs

Ya Li

,Yajuan Zhang

, Daijun Wei

1,2

, Yong Deng

1. School of Computer and Information Science, Southwest University, Chongqing, 400715,China

E-mail: ydeng@swu.edu.cn

2. School of Science, Hubei Institute for Nationalities, Enshi, 445000, China

Abstract: It is necessary to cluster the information according to their sources when analyzing multi-source information.

In this paper, a new evidential clustering method is proposed. In the proposed method, pairwise distance between BPAs

have been introduced to form a matrix for clustering. The clustering method is based on vector which is transformed

from distance matrix. Illustrative example with several sets demonstrate the validity of the proposed method as compared

to other methods.

Key Words: Clustering, Dempster-Shafer theory, Distance between BPAs

1 INTRODUCTION

In recent years, a great deal of attention has been paid to

the analysis of imprecise or fuzzy data. Several references

may be found in the literature focusing on inferential statis-

tics, regression or classiﬁcation. Along with variety of of

sensors are been used to detect object, a great deal of infor-

mation has been got. when analyzing these kind of multi-

source information, it is necessary to cluster the informa-

tion according to their source. However, a wide variety of

information expression form makes information analyzing

more difﬁcult. Under such circumstance, all the informa-

tion can be uniﬁed into one form, namely, evidence, for

later information fusion using.

In [1], a problem of clustering multi-source information de-

noted by evidence is investigated, and an evidence cluster-

ing standard is given. In addition, an idea of transforma-

tion from the evidence interspaces to Euclidean interspace

is presented, then the HCM clustering algorithm is used to

cluster the multi-source information. We consider that the

transformation in [1] itself is not so reasonable, the reasons

are presented in the following context.

The method presented in this paper uses a different ap-

proach. Since there is no bijection between BPAs to pignis-

tic probabilities, the method in [1] might not make use of

all the information of the BPAs. It appears useful to apply

pairwise distance between BPAs for clustering rather than

transform evidence interspaces to Euclidean interspace.

The work is partially supported National Natural Science Foundation

of China, Grant No.60874105, 61174022, Program for New Century Ex-

cellent Talents in University, Grant No.NCET-08-0345, Chongqing Natu-

ral Science Foundation, Grant No. CSCT, 2010BA2003, the Fundamental

Research Funds for the Central Universities Grant. No XDJK2010C030,

Grant. No XDJK2011D002, Doctor Funding of Southwest University

Grant No. SWU110021. The ﬁrst author also greatly appreciates the sup-

port by the School of Computer and Information Sciences of Southwest

University Scientiﬁc and Technological Innovation Fund for Students.

*Corresponding author: Yong Deng, School of Computer and In-

formation Sciences, Southwest University, Chongqing, 400715, E-mail:

ydeng@swu.edu.cn.

Previous related work addressing distances between BPAs

deserves to be mentioned here. Zouhal and Denoeux

[2] introduced a distance based on the mean square er-

ror between pignistic probabilities to improve a classiﬁ-

cation algorithm based on the k-nearest neighbor rule and

Dempster-Shafer’s theory. Jousselme and Grenier [3] in-

troduced a principled distance between two basic probabil-

ity assignments(BPAs)(or two bodies of evidence) based on

quantiﬁcation of the similarity between sets. They gave a

geometrical interpretation of BPAs and shown that the pro-

posed distance satisﬁed all the requirements for a metric.

The distance function in this paper is adapted from [3].

The proposed method deal with data in a very natural way

and to gain a full use of all the information of BPAs, as will

be shown by experimental results. The rest of the paper is

organized as follows.

First, the necessary background of Dempster-Shafer theory

is recalled in Section 2. Section 3 shows what the condi-

tions of metric spaces should satisfy. And then, distance

between two BPAs is presented. Hierarchical clustering is

simply introduced in section 4. In addition, illustrative ex-

ample, with synthetic data sets, are described in this Sec-

tion. Section 5 concludes this paper.

2 Dempster-Shafer theory

The Dempster-Shafer theory, ﬁrst proposed by Dempster

[4] and then developed by Shafer [5], is often regarded

as an extension of the bayesian theory of probability. As

a theory of reasoning under the uncertain environment,

Dempster-Shafer theory has an advantage of assigning the

probability to the subsets of the set composed of 𝑁 objects,

rather than to each of the individual objects. The probabil-

ity assigned to each subset is limited by a lower bound and

an upper bound, which respectively measure the total be-

lief and the total plausibility for the objects in the subset.

Furthermore, the Dempster-Shafer theory has the ability of

combining pairs of bodies of evidence or belief functions to

derive a new evidence or belief function. At present, some

3985

978-1-4577-2074-1/12/$26.00

2012 IEEE

下载后可阅读完整内容，剩余3页未读，立即下载

weixin_38529436

粉丝: 3

基于BPA距离的不确定信息聚类方法

Robust anti-synchronization of uncertain chaotic systems based on multiple-kernel least squares support vector machine modeling

Uncertain Computation-based Decision Theory 无水印原版pdf

Chaos Synchronization for Uncertain Fractional Order Chaotic Systems based on Mittag-Leffler Fractional Sliding Mode Control

Pricing Decision Study of Ecological Industry Chain Based on Uncertain Demand

A group decision-making method based on evidence theory in uncertain dynamic environment

H infinity controller design for uncertain networked controlsystems with scheduling strategy based on predicted error

Second-order integral sliding mode control for uncertain systems with control input time delay based on singular perturbation approach

A heuristic approach to effective and efficient clustering on uncertain objects

Robust Radio Mode Selection in Wirelessly Powered Communications with Uncertain Channel Information

Synthesis of Antenna Arrays under Uncertain Conditions using a PSO Based Algorithm

最新资源