基于粗糙集理论与遗传算法的股票价格预测混合模型

需积分: 10 165 浏览量更新于2024-07-17 收藏 727KB PDF 举报

"基于粗糙集理论与遗传算法的股票价格混合模型" 在股票市场中，技术分析是一种预测股票价格的有效方法。尽管专业的股票分析师和基金经理通常会根据客观的技术指标做出主观判断，但对非专业人士来说，由于需要考虑的复杂技术指标过多，应用这种预测技术非常困难。过去的许多预测模型存在两个主要缺点：(1)时间序列模型如自回归滑动平均模型(ARMA)和自回归条件异方差性(ARCH)需要对变量进行统计假设来构建数学方程式的预测模型，这对股票投资者来说不易理解；(2)一些人工智能(AI)算法，如神经网络(NN)，挖掘出的规则不易实现。本文提出了一种结合粗糙集理论和遗传算法的混合模型用于股票价格预测。粗糙集理论是一种处理不完全或不确定信息的工具，能够简化数据并提取关键特征，而遗传算法则是一种全局优化策略，能够搜索大量的解决方案空间以找到最优解。通过结合这两种方法，可以克服传统模型的不足，提供更易于理解和实现的预测规则。首先，粗糙集理论被用来从大量技术指标中提炼出影响股票价格的关键因素。这涉及到对原始数据的预处理，包括数据归一化和属性约简，以减少冗余和复杂性。然后，遗传算法被应用于生成一组优化的股票预测规则，这些规则基于选定的技术指标。遗传算法通过模拟自然选择和遗传过程来逐步优化规则集，确保生成的规则既具有预测准确性，又易于解释。文章进一步讨论了两种方法在模型构建中的具体应用：累积概率分布法和最小熵原理法。累积概率分布法用于评估不同技术指标的预测能力，而最小熵原理法则用于寻找最简化的规则集，以减少预测的不确定性。这些方法的结合使得模型能够在保留预测精度的同时，提高可解释性和实用性。该混合模型为股票市场的非专业投资者提供了一种实用的工具，他们可以通过理解和应用由模型生成的简单规则来进行股票价格预测。同时，这种方法也为未来的研究提供了新的思路，即如何将复杂的数据分析技术转化为更直观、用户友好的预测模型。通过不断优化和改进，这种基于粗糙集理论和遗传算法的模型有可能进一步提升股票预测的准确性和效率，对投资决策提供有力支持。

Assume that the value of the segment point is being sought for a sample in the range between x

and x

. An entropy equa-

tion is written for the regions ½x

; x and ½x; x

, and denotes the ﬁrst region p and the second region q. Entropy with each value

of x is expressed as: [14]

SðxÞ¼pðxÞS

ðxÞþqðxÞS

ðxÞ ð7Þ

where

ðxÞ¼½p

ðxÞln p

ðxÞþp

ðxÞln p

ðxÞ

ðxÞ¼½q

ðxÞln q

ðxÞþq

ðxÞln q

ðxÞ

ð8Þ

and where p

ðxÞ and q

ðxÞ are the conditional probabilities that the class k sample is in region ½x

; x

þ x and ½x

þ x; x

,

respectively, and pðxÞ and q(x) are the probabilities that all samples are in region ½x

; x

þ x and ½x

þ x; x

, respectively.

pðxÞþqðxÞ¼1 ð9Þ

The value of x that gives the minimum entropy is the optimum value of the segment point. The entropy estimates of pk(x)

and qk(x), and p(x) and q(x) are calculated, as follows: [14]

pkðxÞ¼

ðxÞþ1

nðxÞþ1

ð10Þ

qkðxÞ¼

ðxÞþ1

NðxÞþ1

ð11Þ

pðxÞ¼

nðxÞ

ð12Þ

qðxÞ¼1  pðxÞ ð13Þ

where nk(x) is the number of class k samples located in ½x1; x1 þ x, n(x) is the total number of samples located in ½x1; x1 þ x,

Nk(x) is the number of class k samples located in ½x1 þ x; x2, N(x) is the total number of samples located in ½x1 þ x; x2, and n

is the total number of samples in [x1, x2].

2.4. Rough set theory

Rough sets theory (RST) was proposed by Pawlak [33–37] in 1982. In recent years, RST has been used in economic and

ﬁnancial prediction. Many researchers have applied RST to discover trading rules [20,48]. The concept of RST is founded

on the assumption that with every associated object of the universe of discourse, some information objects characterized

by the same information are indiscernible in the view of the available information about them. Any set of all indiscernible

objects is called an elementary set and forms a basic granule of knowledge about the universe. Any union of elementary sets

is referred to as a precise set; otherwise the set is rough.

With any rough sets, a pair of precise sets, called the lower and upper approximation,

BX ¼fxj½x

# Xg and

BX ¼fxj½x

\ X – ;g of the rough sets, is associated [33]. The lower approximation consists of all objects that deﬁnitely be-

long to the set, and the upper approximation contains all objects that possibly belong to the set. The difference between the

upper and the lower approximation constitutes the boundary region, BN

ðxÞ¼BX  BX, of the rough sets. The set X is called

‘‘rough” (or ‘‘roughly deﬁnable”) with respect to the knowledge in B, if the boundary region is non-empty. The basic notions

in rough sets are shown in Fig. 1.

The RST is a series of logical reasoning procedures used for analyzing an information system. An information system can

be seen as a decision table, denoted by S ¼ðU; A; C; DÞ, where U is the universe of discourse, A is a set of primitive features,

and C; D  A are two subsets of features, assuming that A ¼ C [D and C \ D ¼;, where C is called the condition attribute and

D is the decision attribute. The measure to describe the inexactness of approximation classiﬁcations is called the quality of

Fig. 1. Basic notions of rough sets.

C.-H. Cheng et al. / Information Sciences 180 (2010) 1610–1629

1613

剩余19页未读，继续阅读

BoBoZhang

粉丝: 0

基于粗糙集理论与遗传算法的股票价格预测混合模型

最新资源