基于粗糙集的动态约简计算优化方法探讨

120 浏览量更新于2024-08-26 收藏 257KB PDF 举报

本文主要探讨了基于粗糙集的动态约简计算分析，针对信息数据的约简与动态处理。粗糙集理论是数据挖掘领域的重要工具，它允许在不确定性环境中对复杂的数据进行简化和抽象，以便于理解和决策。首先，研究者介绍了信息系统的约简方法，强调了数据通常以二维形式或矩阵结构存在，这种结构使得粗糙集理论能够有效地处理和处理大量信息。作者Carine Pierrette Mukamakuza和Jiayang Wang来自中国中南大学的信息科学与工程学院，他们的工作关注如何通过构建可区分性矩阵来识别数据中的所有可能约简。可区分性矩阵是一种关键工具，它衡量了数据属性之间的区分度，有助于确定哪些属性是冗余的，哪些是必不可少的。通过Java编程和Weka工具，他们寻找并选择最佳（最优）约简，即频率最高、信息损失最小的约简方式。此外，文章还引入了三种动态约简计算方法。这些方法包括： 1. 新的对象约简类型：这是一种创新的方法，它考虑了数据在不同时间点的动态变化，从而能够捕捉到随着时间推移而出现的新特征或属性重要性的变化。这种动态特性使得动态约简在实时数据分析和预测模型中具有显著优势。 2. 基于阈值的动态约简：这种方法依据预设的阈值，动态地筛选出对当前数据集最有影响力的属性，适用于数据流和在线学习场景，能够实时适应环境变化。 3. 基于学习的动态约简：利用机器学习算法，该方法能够根据数据的不断学习过程动态调整约简，确保模型的适应性和准确性随着数据更新而持续优化。这篇研究论文深入分析了粗糙集理论在动态约简计算中的应用，旨在提高数据处理的效率和有效性，对于数据挖掘、知识发现以及机器学习等领域具有重要的理论和实践价值。通过结合粗糙集的理论基础和动态策略，研究人员提供了一种处理大规模、实时变化数据的有效框架。

2014 10th International Conference on Natural Computation

Dynamic Reducts Computation Analysis Based on Rough Sets

Carine Pierrette Mukamakuza

School of Information Science and Engineering

Central South University

Changsha, China

E-mail: kuzacari808@gmail.com

Jiayang Wang

School of Information Science and Engineering

Central South University

Changsha, China

E-mail: csuwjy@mail.csu.edu.cn

Li Li

Shenzhen Graduate School

Harbin Institute of Technology

Shenzhen, China

Abstract—In this paper analysis of reduction and dynamic

reducts of an information data is presented. The method of

reduction in information system is explained first, the

information was assumed to be in a two-dimension or in a

matrix form. A discernibility matrix of the data was

constructed, and then all reducts from that matrix were found.

The best (optimum) reduct was selected from all reducts; that

was achieved by considering the one with the highest level of

frequency by using Java programming and Weka tool. Three

methods of dynamic reducts computation are introduced

namely: The new type of Reduct in the object-oriented rough

set model which is called dynamic reduct , the method of

dynamic reduct calculation based on calculating of reduct

traces and the generation F-dynamic reduct using cascading

Hashes. The analysis of those three methods led to their

improvement through adding one step in each algorithm which

was the method of getting the optimum reducts from all

reducts calculated in first steps of each algorithm. As result,

the dynamic reducts were generated from optimum reducts

and not from all reducts. Thus by generating an improved

dynamic reducts, improvement of those three methods for

calculation of dynamic reducts is achieved.

Keywords-component: Information system; Optimum reduct;

Dynamic reduct ; knowlegde discovery

I. INTRODUCTION

The last few years have seen a remarkable growth in the

use of rough set theory and applications for solving various

problems in engineering fields. This is mainly because

optimum reduct and dynamic reduct algorithms have seen

tremendous improvements in the last few years, allowing

larger problem instances to be solved in different

applications domains such as computer intelligence,

mechanical, electrical, electronics, medical. Rough set theory

deals with reasoning and approximation about data. In

approximation aspect, the lower approximation and the

upper approximation are the basic concepts by

indiscernibility relations which illustrate set-theoretic

approximations of any given subset of data. The essential

part of an information system is the reduct, from which all

objects discernible are discerned in the original information

system. Another important part is the core which is a

common part of all reducts. The discernibility matrix is used

to compute the core and reducts.

The purpose of this work is to introduce three methods

of dynamic reducts computation namely: a dynamic reduct in

the object–oriented rough set models; dynamic reducts based

on calculating reduct traces; and the generation (F,ε)-

dynamic reduct using cascading Hashes. Method of

calculating the optimum reducts which improves the three

methods above is also proposed.

Rough set theory is the foundation of rough system

theory and applications. Rough set theory, introduced by

Professor Z.Pawlak in the early 1980s [1], is a new

mathematical tool to deal with imprecise, uncertain, and

vague information [2]. It has been widely applied in many

fields such as machine learning, data mining, artificial

intelligence, etc. Even though the research of knowledge

reduct in rough set is mostly based on complete information

system, it is meaningful to extend rough set theory into

incomplete information system and design an efficient

knowledge reduct algorithm.

II. P

RELIMINARY CONCEPTS OF ROUGH SET THEORY

A. Definitions

1) Information System

Rough Set approach can be presented to incomplete

information systems, i.e. to systems in which attribute values

for objects may be unknown (missing, null) and this theory’s

main concern can be devoted to finding rules from such

systems. A theory of Rough Set (RS) introduced by Z.

Pawlak defines formally an information system as a system

The project supported by National Natural Science Foundation of China,Grant No. 61173052.

The project supported by Hunan Provincial Natural Science Foundation of China,Grant No. 14JJ4007.

——————————————————————————————————————————————————————————————————————————————

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38702844

粉丝: 2
资源: 921

基于粗糙集的动态约简计算优化方法探讨

粗糙集属性约简python

基于粗糙集的属性约简算法研究

粗糙集属性约简算法源码MATLAB

基于粗糙集属性约简以及概念格的关联规则挖掘分析

基于变精度粗糙集属性约简

邻域粗糙集属性约简,粗糙集属性约简步骤,Python源码.zip

邻域粗糙集属性约简,粗糙集属性约简步骤,Python源码.rar

基于粗糙集信息系统约简的算法matlab实现.rar_matlab_粗糙集_粗糙集 matlab_粗糙集 MATLAB

邻域粗糙集属性约简_粗糙集_邻域粗糙集_邻域属性约简

基于粗糙集的分层约简算法研究

最新资源