粗糙集理论：起源、发展与应用综述

3星 · 超过75%的资源需积分: 10 27 浏览量更新于2024-07-25 收藏 958KB PDF 举报

粗糙集(Rough Set)理论是由三位波兰学者Jan KôMorowski、Lech Polkowski和Andrzej Skowron在20世纪80年代初提出的一种新兴的数学工具，主要用于处理模糊、不确定或不完全的信息。该理论源于他们在计算机科学与信息技术领域的工作，特别是对知识表示和数据挖掘方面的研究。 Rough Set理论的核心概念是基于集合论的，它通过一种粗略的方式来划分数据，即使在数据存在噪声、缺失或者不精确的情况下也能进行有效的分析。它定义了一个粗糙的近似关系，允许我们在不精确的数据中识别出具有相似特征的元素群体，这些群体在某些方面表现出共同的行为或属性。这在处理大量实际问题时显得尤为有用，如决策支持系统、数据库查询优化、模式识别和机器学习等领域。粗糙集理论的主要组成部分包括： 1. 上下文集：这是粗糙集的基础，用于定义数据的背景知识，通常由数据的属性集和数据实例集组成。 2. 精化算子：它将粗糙集的近似关系转换为更精确的关系，帮助我们了解数据的细节。 3. 粗糙集的性质：如粗糙等价类（Rough Equivalence Class，表示一组具有相同特征的元素集合）和粗糙集的维度，它们对于理解数据的复杂性和结构至关重要。 4. 粗糙集的运算：包括并、交、补等操作，这些运算在处理粗糙数据集时提供了基础的分析框架。随着粗糙集理论的发展，它逐渐在国际上得到了广泛的关注和应用。近年来，大量的高质量论文涌现，探讨了粗糙集的不同方面，例如粗糙集的理论基础、算法设计、应用案例研究以及与其他数学和计算模型的结合。工作坊、会议和研讨会专门围绕粗糙集展开，表明其在现代信息技术中的重要地位和持续影响力。粗糙集不仅提供了一种处理不确定性问题的有效方法，还在人工智能、数据挖掘、知识工程等领域发挥着重要作用，为解决实际问题提供了新颖且强大的工具。因此，深入理解粗糙集理论及其背后的原理，对于从事相关领域的研究人员和工程师来说，是一项不可或缺的基本技能。

If a Bo olean function such as in the case of

-relative discernibility function is constructed

by restricting the conjunction to run only over these entries of the column that corresponds to

ob jects with a decision dierent from the decision on

then the (

k d

){

relative discernibility

function

is obtained. Decision rules with minimal descriptions of their left hand sides maybe

constructed from prime implicants of these functions (see Sect. 1.7.3).

Example 1.4.8

Figures 1.2 to 1.5 display these four typ es of indiscernibility. It is p ossible to consider other

kinds of reducts, e.g. reducts that preserve the p ositive region and then use the same Bo olean

reasoning method to compute these reducts.

1.5 Rough Membership

In classical set theory, either an element b elongs to a set or it do es not. The corresp onding

memb ership function is the characteristic function for the set, i.e. the function takes values

1 and 0, resp ectively. In the case of rough sets, the notion of membership is dierent. The

rough membership function

quanties the degree of relativeoverlap b etween the set

and the

equivalence 

]

class to which

b elongs. It is dened as follows:



0



1] and



(



]



]

The rough membership function can be interpreted as a frequency-based estimate of Pr(

), the conditional probability that ob ject

b elongs to set

, given knowledge

of the

information signature of

with resp ect to attributes

, i.e.

Inf

(

) (see e.g. 531], 312],

310], 541]).

The formulae for the lower and upp er set approximations can be generalized to some arbi-

trary level of precision



(



1] by means of the rough membership function 549 ], as shown

below.





(

)









(

)

;



Note that the lower and upp er approximations as originally formulated are obtained as a sp ecial

case with



Approximations of concepts are constructed on the basis of background knowledge. Ob-

viously, concepts are also related to unseen so far ob jects. Hence it is very useful to dene

parameterized approximations with parameters tuned in the searching pro cess for approxima-

tions of concepts. This idea is crucial for construction of concept approximations using rough

set methods.

Rough sets can thus approximately describ e sets of patients, events, outcomes, etc. that

may be otherwise dicult to circumscrib e.

1.6 Dependency of Attributes

Another imp ortant issue

in data analysis is discovering dep endencies between attributes. In-

tuitively, a set of attributes

dep ends totally on a set of attributes

, denoted

)

,ifall

values of attributes from

are uniquely determined byvalues of attributes from

. In other

words,

dep ends totally on

, if there exists a functional dep endency between values of

and

Formally dependency can be dened in the following way. Let

and

b e subsets of

Wewillsay that

depends on C

in a

degree



1), denoted

)

,if



(

C D

POS

(

)



where

POS

(



U=D

(

)



called a

positive region

of the partition

U=D

with respect to

,istheset of all elements of

that can be uniquely classied to blocks of the partition

U=D

,bymeansof

Obviously



(

C D

U=D

(

)

=1 wesay that

depends total ly

, and if

1, wesay that

depends partial ly

(in

degree k

)on

The coecient

expresses the ratio of all elements of the universe, which can b e prop erly

classied to blo cks of the partition

U=D

employing attributes

and will b e called the

degree

of the dependency.

It can b e easily seen that if

dep ends totally on

then

IN D

(

)



IN D

(

)

This means

that the partition generated by

is ner than the partition generated by

Let us notice

that the concept of dep endency discussed ab ove corresp onds to that considered in relational

databases.

Summing up:

total ly

(

partial ly

) dep endent on

, if employing

al l

(

possibly some

)

elements of the universe

may be uniquely classied to blo cks of the partition

U=D:

1.7 Concept Approximation Construction: The Mo deling Pro-

cess

One of the main goals of machine learning, pattern recognition, knowledge discovery and data

mining as well as of fuzzy sets and rough sets is to synthesize approximations of target concepts

(e.g. decision classes) from the background knowledge (represented e.g. in the form of decision

tables). It is usually only possible to search for approximate descriptions of target concepts due

to incomplete knowledge ab out them (e.g. p ositive and negative examples of concept ob jects

are given).

Approximate descriptions of concepts are constructed from some primitive concepts. It is

furthermore well known that target concept descriptions dened directly

by Bo olean combi-

nations of descriptors of the form

(when

is and attribute and

are often not

of go o d approximation quality. Feature selection and feature extraction problems are often

approached by searching for relevant primitive concepts and are well known approaches in

machine learning, KDD and other areas as (see, e.g. 147, 210, 92]).

In the case of feature selection relevant features are sought among the given features e.g.

among descriptors

where

is a relevant attribute. In Sect. 1.7.1 rough set-based metho ds

for feature selection are briey discussed.

The feature extraction problem is implemented as a search for some new features that are

more relevant for classication and are dened (in some language) by means of the existing

features.

These new features can be e.g. of the form

0



1) or 2

Their values on

a given ob ject are computed from given values of conditional attributes on the ob ject. The

new features are often binary taking value 1 on a given ob ject i the sp ecied condition is

true on this ob ject. In the case of symbolic value attributes we lo ok for new features like

2 f

French, English, Polish

with value 1 i a p erson sp eaks any of these languages. The

imp ortant issues in feature extraction are problems of discretization of real value attributes,

grouping of symb olic (nominal) value attributes, searching for new features dened byhyp er-

planes or more complex surfaces dened over existing attributes. In Section 1.7.2 discretization

based on rough set and Bo olean reasoning approach is discussed. Some other approaches to

feature extraction that are based on Bo olean reasoning are also discussed. All cases of fea-

ture extraction problem mentioned above may b e described in terms of searching for relevant

features in a particular language of features. Bo olean reasoning plays the crucial role of an

inference engine for feature selection problems.

Feature extraction and feature selection are usually implemented in a pre-pro cessing stage

of the whole mo deling pro cess. There are some other asp ects related to this stage of modeling

such as, for instance, elimination of noise from the data or treatment of missing values. More

information related to these problems can be found in 344 , 345] and in the bibliography

included in these b o oks.

In the next stage of the synthesis of target concept approximations descriptions of the target

concepts are constructed from the extracted relevant features (relevant primitive concepts) by

applying some op erations. In the simplest case when Bo olean connectives

and

are chosen

these descriptions form the so-called decision rules. In Sect. 1.7.3 we give a short introduction to

metho ds for decision rule synthesis that are based on rough set metho ds and Bo olean reasoning.

Two main cases of decision rules are discussed: exact (deterministic) and approximate (non-

deterministic) rules. More information on decision rule synthesis and using rough set approach

the reader may nd in 344 , 345 ] and in the bibliography included in these bo oks.

Finally, it is necessary to estimate the quality of constructed approximations of target con-

cepts. Let us observe that the "building blocks" from which dieren

t approximations of target

concepts are constructed may be inconsistent on new, so far unseen ob jects (i.e. some ob jects

from the same class may b e classied to disjoint concepts). This creates a necessitytodevelop

metho ds for resolving these inconsistencies. The quality of target concept approximations can

be considered acceptable if the inconsistencies may be resolved by using these metho ds. In

Sect. 1.7.4 some introductory comments on this problem are presented and references to rough

set metho ds that resolve conicts among dierent decision rules byvoting for the nal decision

are given.

1.7.1 Signicance of Attributes and Approximate Reducts

One of the rst ideas 297] was to consider as relevant features those in the

core

of an information

system, i.e. features that b elong to the intersection of all reducts of the information system.

It can b e easily checked that several denitions of relevant features that are used by machine

learning community4]canbeinterpreted bycho osing a relevant decision system corresp onding

to the information system.

Another approach is related to dynamic reducts (see e.g. 19 ]) i.e. conditional attribute

sets app earing \suciently often" as reducts of samples of the original decision table. The

attributes b elonging to the \ma jority" of dynamic reducts are dened as relevant. The value

thresholds for \suciently often" and \ma jority" needs to b e tuned for the given data. Several

of the rep orted exp eriments show that the set of decision rules based on such attributes is

much smaller than the set of all decision rules and the qualityof classication of new ob jects

is increasing or at least not signicantly decreasing if only rules constructed over such relevant

features are considered.

It is also p ossible to consider as relevant features those from some approximate reducts

of suciently high quality. As it follows from the considerations concerning reduction of at-

tributes, they can be not equally imp ortant and some of them can be eliminated from an

information table without lo osing information contained in the table. The idea of attribute re-

duction can b e generalized byanintro duction of the concept of

signicance of attributes

, which

enables an evaluation of attributes not only byatwo-valued scale,

dispensable { indispensable

but by asso ciating with an attribute a real numb er from the 0,1] closed interval this number

expresses the imp ortance of the attribute in the information table.

Signicance of an attribute

in a decision table

U C



) (with the decision set

)

can be evaluated by measuring the eect of removing of an attribute

from the attribute

set

on the p ositive region dened by the table

As shown previously, the number



(

C D

)

expresses the degree of dep endency between attributes

and

, or accuracy of approximation

U=D

We can ask how the co ecient



(

C D

)changes when an attribute

is removed,

i.e., what is the dierence between



(

C D

)and



((

D

)

We can normalize the dierence

and dene the signicance of an attribute



(

CD

)

(



(

C D

)

;



(

D

))



(

C D

)

;



(

D

)



(

C D

)



Thus the co ecient



(

) can b e understo od as the error of classication which occurs when

attribute

is dropped. The signicance coecient can b e extended to the set of attributes as

follows:



(

CD

)

(



(

C D

)

;



(

;

B D

))



(

C D

)

;



(

;

B D

)



(

C D

)



denoted by



(

), if

and

are understoo d, where

is a subset of

is a reduct of

, then



(

;

)=0, i.e., removing any reduct complement from the

set of conditional attributes enables to make decisions with certainty,whatsoever.

Any subset

can be treated as an

approximate reduct

,andthenumber

(

CD

)

(



(

C D

)

;



(

B D

))



(

C D

)

;



(

B D

)



(

C D

)



denoted simply as

(

), will be called an

error of reduct approximation

. It expresses how

exactly the set of attributes

approximates the set of condition attributes

(relatively to

The concept of approximate reduct (with respect to the positive region) is a general-

ization of the reduct concept. A minimal subset

of condition attributes

, such that



(

C D

) =



(

B D

), or

(

CD

)

(

) = 0 is a reduct (preserving the p ositive region). The idea

an approximate reduct can be useful in those cases when a smaller number of condition

attributes is preferred over the accuracy of classication on training data. This can allow to

increase the classication accuracy on testing data. The error level of reduct approximation

should b e tuned for a given data set to achieve this eect.

Section 1.7.3 introduces several other methods of reduct approximation that are based on

other measures than p ositive region. Exp eriments showthatby tuning the approximation level

one can, in most cases, increase the classication quality of new ob jects. It is imp ortant to note

once again that Bo olean reasoning may be used to compute these dierent types of reducts

and to extract relevant approximations from them (see e.g. 420 ].

剩余117页未读，继续阅读

MarkDyk

粉丝: 0
资源: 2

粗糙集理论：起源、发展与应用综述

粗糙集ROSETTA软件_粗糙集属性约简_源码

粗糙集理论与方法(清晰版)

邻域粗糙集属性约简,粗糙集属性约简步骤,Python

粗糙集理论与方法pdf

邻域粗糙集源码 csdn

scikit learn 粗糙集

spss能不能解粗糙集

mgrs 多粒度 粗糙集

粗糙集python算法

python 邻域粗糙集

最新资源

mgrs 多粒度粗糙集