动态不完整信息系统中的粗糙集增量学习方法

163 浏览量更新于2024-07-14 1 收藏 1.3MB PDF 举报

“动态不完整信息系统中基于粗糙集的增量式知识学习方法” 本文主要探讨了在数据集快速发展的背景下，如何处理信息系统中的对象集在新信息到来时的动态演化问题，特别是针对数据缺失和信息不完整的实际情况。研究的核心是利用粗糙集理论（Rough Set Theory）来设计一种增量式学习方法，以适应这种动态不完整信息系统。粗糙集理论是一种处理不确定性和不完整信息的数学工具，它允许我们在缺乏完全信息的情况下进行知识发现和决策分析。在本文中，作者提出了一个基于矩阵的方法，该方法将四种不同的扩展关系——公差关系、相似关系、有限公差关系和特征关系——应用于不完整信息系统。这四个关系通过三个关键矩阵（支持矩阵、准确性矩阵和覆盖率矩阵）来体现，它们有助于动态地引入和更新知识。支持矩阵用于度量对象与属性之间的关联程度，准确性矩阵则衡量了分类的精确性，而覆盖率矩阵则反映了规则覆盖数据的比例。通过这三个矩阵，文章构建了一个框架，当新的信息加入时，可以有效地更新和调整现有知识库，以反映数据集的变化。文章中还详细展示了所提方法的工作流程，并使用9个来自UCI数据集的实例以及包含数百万条记录的大规模数据集进行了实验验证。实验结果证明了该方法在处理动态不完整信息系统中的知识学习是可行且有效的，能够有效地处理大量数据和实时信息变化的情况。增量式学习（Incremental Learning）在这种场景下具有显著优势，因为它允许系统在接收到新数据时逐步更新模型，而不是每次都需要重新训练整个模型，这在大数据环境下尤其重要，因为完全重新训练可能非常耗时且资源密集。这项工作为处理动态信息系统中的知识发现提供了一个实用的框架，结合了粗糙集理论的优势，使得在不完整和变化的数据环境中也能有效地进行知识学习和决策。这一方法对于数据挖掘、机器学习以及智能决策系统的应用具有重要的理论和实践意义。

1768 D. Liu et al. / International Journal of Approximate Reasoning 55 (2014) 1764–1786

Deﬁnition 6. (See [43,44,60].) Given a decision table S = (U , C ∪ D, V , f ), U/C ={X

, X

, ···, X

} is a partition of objects

under the condition attributes of C , where X

(i = 1, 2, ···, m) is an equivalence class of condition attributes; U /D =

{

, D

, ···, D

} is a partition of objects under the decision attributes of D, where D

( j = 1, 2, ···, n) is an equivalence

class of decision attributes. ∀X

∈ U/C, ∀D

∈ U/D, the support, accuracy and coverage of a rule X

→ D

are deﬁned

respectively as follows.

Support of X

→ D

: Supp(D

) =|X

∩ D

Accuracy of X

→ D

: Acc(D

) =|X

∩ D

|/|X

Coverage of X

→ D

: Cov(D

) =|X

∩ D

|/|D

where |X

| and |D

| denote the cardinality of set X

and D

, respectively. For a decision rule X

→ D

, X

is the predecessor

of the decision rule, D

is the successor of the decision rule according to Deﬁnition 5.

considering the massive data in real-life applications, we utilize the strategy of matrices to simplify the problem. The

support matrix, the accuracy matrix, the coverage matrix as well as their propositions are deﬁned as follows [43,44].

Supp(D|X) =

⎛

⎜

⎝

Supp(D

) Supp(D

) ··· Supp(D

)

Supp(D

) Supp(D

) ··· Supp(D

)

Supp(D

) Supp(D

) ··· Supp(D

)

⎞

⎟

⎠

(1)

Acc

(D|X) =

⎛

⎜

⎝

Acc(D

) Acc(D

) ··· Acc(D

)

Acc(D

) Acc(D

) ··· Acc(D

)

Acc(D

) Acc(D

) ··· Acc(D

)

⎞

⎟

⎠

(2)

Cov

(D|X) =

⎛

⎜

⎝

Cov(D

) Cov(D

) ··· Cov(D

)

Cov(D

) Cov(D

) ··· Cov(D

)

Cov(D

) Cov(D

) ··· Cov(D

)

⎞

⎟

⎠

(3)

Proposition 1. (See [43,44].) Supp(D

) ≥ 0, ∀X

∈ U /C, ∀D

∈ U /D, i = 1, 2, ···, m, j = 1, 2, ···, n.

Proposition

2. (See [43,44].) 0 ≤ Acc(D

) ≤ 1 and



Acc(D

) = 1, ∀X

∈ U /C, i = 1, 2, ···, m.

Proposition

3. (See [43,44].) 0 ≤ Cov(D

) ≤ 1 and



Cov(D

) = 1, ∀D

∈ U /D, j = 1, 2, ···, n.

According to Deﬁnition 5, the relations among the three matrices in (1)–(3) can be written as: Acc(D

) =

Supp(D

)



Supp(D

)

and Cov(D

) =

Supp(D

)



Supp(D

)

, namely, the “support” can be expressed by both “accuracy” and “cover-

age”.

Followed by Pawlak and Tsumoto’s ideas [60,72,73], we choose the two factors, accuracy and coverage, to describe the

knowledge in this paper. The deﬁnition of knowledge is displayed as follows.

Deﬁnition 7. (See [43,44].) ∀X

(i = 1, 2, ···, m), ∀D

( j = 1, 2, ···, n), if Acc(D

) ≥ α and Cov(D

) ≥ β hold, we call

the rule X

→ D

a kind of knowledge where α ∈ (0.5, 1) and β ∈ (0, 1).

The

values α and β are dependent on a problem itself. Generally, we choose the decision rules with high accuracy and

high coverage [44].

3. Incremental approaches for knowledge discovery based on extended rough sets

In this section, we focus on discussing the knowledge updating process in IIS when the object set evolves over time

while the attribute set remains unchanged. Followed by the incremental approaches in [43,44], the incremental strategies

on adding and deleting of objects in IIS are ﬁrstly investigated. To illustrate our method clearly, we divide the work into

three parts. We provide several basic assumptions of the incremental approach for inducing knowledge in Section 3.1. We

introduce a matrix updating strategy to achieve the knowledge incremental learning process in Section 3.2.An incremental

剩余22页未读，继续阅读

weixin_38543460

粉丝: 5
资源: 982

动态不完整信息系统中的粗糙集增量学习方法

基于矩阵的混合型邻域决策粗糙集增量式更新算法.pdf

基于邻域关系的知识粒度增量式属性约简算法.docx

对象集变化下一种增量式邻域粗糙模糊集的逼近方法

不完备信息系统的增量式约简算法 (2012年)

基于信息熵的不完备数据增量特征选择方法

增量式更新规则获取：基于分布约简的粗糙集方法

集值有序信息系统中增量式逼近更新方法

变精度粗糙集模型中近似集增量式更新的矩阵算法研究

基于区分矩阵的不完备信息系统增量式约简算法

遗传算法与粗糙集理论结合的增量式规则挖掘

最新资源