M-OSELM：离群值混沌时间序列预测的在线序贯极限学习机

82 浏览量更新于2024-07-15 收藏 862KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"M-OSELM (M-estimator-based Online Sequential Extreme Learning Machine) 是一种针对混沌时间序列预测的新方法，特别设计用于处理含有离群值的数据。此方法源自在线序列极限学习机（OSELM），并引入了M估计器的概念以提高模型对异常值的鲁棒性。M-OSELM通过采用M估计器的稳健成本函数，避免异常值对模型输出权重更新的影响。此外，它还结合误差滑动窗口策略进行参数估计，实时检测离群值，实现高效的在线学习过程。研究显示，即使训练数据包含异常值，M-OSELM也能保持高抗扰性，并在混沌时间序列预测上优于同类方法。" 正文：在线序列极限学习机（OSELM）是一种快速且有效的机器学习算法，尤其适用于处理连续流式数据。然而，当数据中存在离群值时，传统的最小二乘成本函数可能会导致模型的不稳定性和预测精度下降。为解决这一问题，研究人员提出了M-OSELM，它采用M估计器作为成本函数的基础，提高了模型对异常值的抵抗力。 M估计器是一种统计学上的概念，它通过考虑数据中的异常值来提供更稳健的估计。这种估计器可以降低单个异常值对整体估计的影响，从而增强模型的稳定性和预测性能。在M-OSELM中，通过最小化M估计器成本函数，可以有效地防止异常值影响模型的输出权重更新，这使得模型能够在含有离群值的数据流中保持准确的学习。为了实现在线离群值检测，M-OSELM引入了一种基于误差滑动窗口的顺序参数估计方法。这种方法利用误差滑动窗口来跟踪误差分布，动态地估计M估计器函数的阈值。滑动窗口策略允许M-OSELM实时检测到潜在的异常值，而不需要大量的计算资源，从而保持高效的学习过程。实验结果显示，M-OSELM在处理含有离群值的混沌时间序列预测任务中表现出优越的性能。它不仅能够有效地抵抗异常值的影响，而且在预测混沌时间序列的精度上超越了其他同类方法。这种特性使得M-OSELM成为处理复杂、动态变化且含有异常值的现实世界问题的理想选择，特别是在混沌系统的预测分析中。 M-OSELM是在线序列学习领域的一个重要进展，它通过集成M估计器和滑动窗口技术，实现了对离群值的鲁棒性和高效的在线学习，对于混沌时间序列预测提供了更可靠和精确的解决方案。这一方法的创新性和实用性对于未来的时间序列预测和异常检测研究有着重要的启示意义。

资源详情

资源推荐

2.2 R-ELM

ELM has attracted many attentions for its extremely fast

training speed and good generalization performance. But it

is still based on empirical risk minimization principle [see

Eq. (6)] and tends to generate over-ﬁtting models. Conse-

quently, the trained ELM would behave very differently if

test data change but slightly away from the training data,

and it will become more serious when the training set

contains corrupted data such as outliers.

According to the statistical learning theory, a model

with good generalization ability should consider not only

the empirical risk but also the structural risk and pursue a

best tradeoff between the two risks. Based on this idea, a

regularized ELM [24, 25] is proposed to seek b that min-

imizes the following cost function:

bðÞ¼ Hb  T

þk b

; ð9Þ

where Hb  Tkk

is the sum of squared training errors

which can be regarded as empirical risk, b

is the

square of norm of the network output weights vector

which represents structural risk, and k is a positive real

value called the regularization parameter to balance the

two risks.

The cost function is minimized by differentiating (9)

with respect to b and setting the results to zero, this yields

the following regularization normal equation:

HþkI



b ¼ H

T; ð10Þ

where I is an identity matrix with the same dimensions as

H. The estimator of b from Eq. (10) is given by

b ¼ H

H þ kI



1

T: ð11Þ

Compared with ELM, the R-ELM replaces the LS

solution [Eq. (8)] with the generalized ridge regression

estimator [Eq. (11)], which can provide better stability and

generalization ability for noisy data. Moreover, the added

regularization item also makes the correlation matrix H

nonsingular and then the matrix inversion method can be

applied directly. A more complete analysis of the R-ELM

can be found in [26], where the authors extend such study

to generalized SLFNs with different feature mappings as

well as kernels.

2.3 OSELM and R-OSELM

As a sequential version of the batch ELM algorithm, the

OSELM adopts a recursive way to solve the LS solution,

and which may also encounter the ill-posed problems due

to the unavoidable presence of noise or outliers. Similar to

R-ELM, an improvement of OSELM called regularized

OSELM [27] is proposed to improve the stability of

OSELM while maintaining the same sequential learning

ability as OSELM.

The R-OSELM algorithm uses the same cost function

[Eq. (9)] as the R-ELM and aims to seek the optimal reg-

ularization solution in a sequential learning fashion. The

learning process of R-OSELM consists of an initialization

phase and a following sequential learning phase as the

same as OSELM, just adding a regularization item to sta-

bilize the initial output weights. The one-by-one

R-OSELM is summarized below.

In initialization phase, given an initial training set

k1

¼fðx

; t

Þjj ¼ 1; ...; k  1g, according to Eq. (11),

the initial output weights are given by

k1

¼ P

k1

ð12Þ

where P

k1

¼ H

k1

þkI



1

, H

k1

¼ h

 h

k1



and T

k1

¼ t

 t

k1

½

In the sequential learning phase, the recursive least-

squares algorithm is used to constantly update the output

weights. Suppose now that we receive another sample

ðx

; t

Þ, the corresponding partial hidden layer output matrix

is calculated as h

¼ Gða

; b

; x

Þ½ Gða

; b

; x

Þ, then

the output weights update equations are determined by

¼ P

k1



k1

1 þ h

k1

;

¼ b

k1

þ P

 h

k1

ðÞ: ð13Þ

As seen from Eq. (13), the output weights are updated

recursively only based on the newly arrived data, which is

discarded immediately as soon as it has been learnt. The

above one-by-one R-OSELM algorithm can be easily

extended to chunk-by-chunk type. In addition, if the reg-

ularization parameter k in the initial solution [Eq. (12)]

equals zero, then R-OSELM becomes the original

OSELM.

3 Proposed M-OSELM

In this section, we ﬁrst present a novel M-estimator-based

learning model, next a recursive solution that solves the

M-estimator model is derived and concomitantly a

sequential parameter estimation approach is introduced to

estimate the threshold parameter of the M-estimator func-

tion for online outlier detection, ﬁnally a robust online

sequential learning algorithm named M-OSELM is

proposed.

3.1 M-estimator-based learning model

As described in Sect. 2, the learning rules of the ELM and

OSELM are based on the LS criterion, which minimizes

4096 Neural Comput & Applic (2017) 28:4093–4110

123

剩余17页未读，继续阅读

不善言辞的我

粉丝: 258
资源: 921

M-OSELM：离群值混沌时间序列预测的在线序贯极限学习机

在线顺序极限学习机原理

基于在线序贯极限学习机的高炉渗透率预测

一种基于增量加权平均的在线序贯极限学习机算法

分布式算法的无监督极限学习机基于聚类的离群值检测

基于极限学习机的离群值检测

基于马尔科夫转换多重分形预测模型_MSM模型_捕捉金融时间序列波动中隐藏的离群值、时间标度以及长程相关性特征_matlab

imputeFin:具有缺失值和_或离群值的金融时间序列的估算

具有离群值处理的鲁棒内核估计，用于图像去模糊

【ELM预测】基于离群鲁棒极限学习机实现数据预测附matlab代码.zip

time-series-prediction:研究预测时间序列和检测离群值的新颖方法

基于MATLAB实现的马尔科夫转换多重分形预测模型，能较好的捕捉金融时间序列波动中隐藏的离群值+使用说明文档

tsmoothie：用于以向量化方式进行时间序列平滑和离群值检测的python库

基于核密度估计的环境流离群值检测方法

预测分析模型详解：分类、聚类、预测、离群值与时间序列

使用极限学习机进行离群值检测的新型方法

鲁棒极限学习机：基于M估计的抗差优化方法

预测分析模型详解：分类、聚类、预测与离群值检测

预测分析模型详解：分类、聚类、预测与离群值分析

最新资源