电子健康网络的增量学习分类算法：健忘因子方法

148 浏览量更新于2024-08-23 收藏 2.8MB PDF 举报

"这篇研究论文提出了一种基于健忘因子的电子健康网络增量学习分类算法，旨在解决在大规模实时数据处理中的挑战。该算法通过更新训练数据，改进了支持向量机随机梯度下降（SVMSGD）算法，提高了学习效率和准确性。" 在现代科技的推动下，电子健康（eHealth）系统已成为可能，利用网络技术和移动通信技术，能够连续实时地获取生理数据和情境感知数据。然而，这种大规模的数据流对实时大数据处理提出了严峻挑战。传统的机器学习方法往往难以应对这种高速流动的数据，因为它们通常需要一次性加载所有数据进行训练，这在数据量庞大时是不可行的。为了应对这一问题，研究者们提出了α-SVMSGD算法，这是一种增量学习算法。增量学习是一种适应性学习策略，它允许模型在新数据到来时逐步更新，而无需重新训练整个模型。α-SVMSGD算法的核心在于引入了“健忘因子”，它能动态调整旧数据的影响，以保持模型的最新性和准确性。健忘因子可以控制模型如何权衡新数据和旧数据的重要性，防止过快遗忘历史信息，同时确保模型能及时适应新的变化。支持向量机（SVM）是一种广泛应用的分类算法，它通过找到最大边距超平面将不同类别的数据分离。随机梯度下降（SGD）是一种优化方法，用于在大型数据集上有效地更新模型参数。在α-SVMSGD中，SVM与SGD相结合，通过每次迭代只处理一部分数据样本，减少了计算复杂性，使得算法能在数据流环境下高效运行。论文作者们在实验中对比了α-SVMSGD与其他学习算法的性能，证明了其在处理eHealth数据流时的优越性，尤其是在实时性和准确性方面。通过不断调整健忘因子，该算法能够在保持模型性能的同时，有效地处理持续流入的大量健康数据。这篇研究为电子健康领域的实时大数据分析提供了一种有效的解决方案，对于优化医疗监控、疾病预测以及健康管理等方面具有重要意义。未来的研究可能会进一步探索如何优化健忘因子的选择，以及如何将此算法扩展到更复杂的多任务学习和深度学习框架中。

An Incremental Learning Classiﬁcation Algorithm

based on Forgetting Factor for eHealth Networks

Li Yang

, Kun Wang

, Chenhan Xu

, Chunsheng Zhu

, Yanfei Sun

Key Lab of Broadband Wireless Communication and Sensor Network Technology, Ministry of Education,

Nanjing University of Posts and Telecommunications, China.

Emails: islyang@foxmail.com, kwang@njupt.edu.cn, xchank@outlook.com, njsyf@vip.163.com

Department of Electrical and Computer Engineering, The University of British Columbia, Canada.

Email: cszhu@ece.ubc.ca

Abstract—The advances of network technology and mobile

communication technology are making eHealth possible. In

eHealth systems, physiological data and relevant context-aware

data are acquired continuously and in real time. At the same time,

such large-scale data results in huge challenges in the aspect of

real-time big data processing since eHealth data appears in the

form of data stream. Therefore, we propose a novel incremental

learning algorithm, namely α-SVMSGD, which improves the

SVMSGD (Support Vector Machine-Stochastic Gradient Descent)

algorithm by updating the training data with the continuous data

stream. Besides, this α-SVMSGD may handle the problem that

original SVMSGD cannot further mine the useful information

in unclassiﬁed data. In α-SVMSGD, the process of training data

updating is completed by introducing the concept of forgetting

mechanism, in which the forgetting factor α is introduced to weed

out useless training data. α-SVMSGD is applied into ambient

assisted living communications, and further incorporated into

the data ﬁltering layer of a local data processing architecture

(LDPA) to reduce data redundancy. Simulation results conﬁrm

that the proposed algorithm is a promising data redundancy

solution for classiﬁcation without loss of accuracy in the case of

real-time data stream.

Keywords—eHealth, Incremental learning, Support vector ma-

chine, Stochastic gradient descent, Forgetting factor

I. INTRODUCTION

Due to the hospital capacity and medical staff are limited

concerning the increasing treatment requests, traditional health

care services can hardly satisfy growing population’s needs.

Under this background, for the beneﬁts of big data technique,

a new kind of eHealth service monitoring people’s lives with

intelligent device is developing rapidly. In this current era

of big data [1], the Internet transmits a great deal of data,

followed with the data storage and data processing by servers

or clouds. Besides, mobile networks collect lots of data all

around people’s lives. Due to the limitations of traditional

data processing methods, they are often used to describe those

complex or large data sets in the network.

Speciﬁcally, the growing amount of data collected by mobile

eHealth networks is more and more pervasive with the devel-

opment of hardware. Meanwhile, collecting nodes are required

to get more data [2] and the number of them tends to increase

in networks. All the above factors increase the scale of network

and the volume of data transferred in the e-Health network.

Since the energy and functionality of mobile nodes are

limited, data has to be aggregated and processed in a central

sever. However, with the increment of network size which

leads to the emergence of more powerful functions, it is likely

to cause that the central server is not capable of analyzing all

the data due to various factors (e.g., routing blocking resulting

from malicious nodes or network congestions). For this reason,

how to efﬁciently process these data is a very important

problem. In our previous work, we proposed the framework of

a local data processing architecture (LDPA) to provide the idea

of quantifying the result of data analysis in ambient assisted

living communications (AAL) [3]. However, data redundancy

issue still exists in data ﬁltering layer (DFL) of LDPA. To deal

with this problem, this paper utilizes an improved SVMSGD

(Support Vector Machine-Stochastic Gradient Descent) [4]

algorithm by introducing the concept of forgetting factor α

to update training data. Besides, in original SVMSGD, some

past information in classiﬁer may be new classiﬁcations, and

SVMSGD cannot further process this kind of data. We then

propose an incremental learning algorithm α-SVMSGD and

further incorporate it into data ﬁltering layer (DFL) of LDPA

for mining useful information from unclassiﬁed data.

To this end, we mainly focus on efﬁcient data processing

and mining useful data classiﬁcation from the original data and

increasing data to save storage space and reduce the probability

of data loss due to network congestion. The contributions of

our work are summarized as follows:

• A forgetting factor method is proposed to mine useful

information from the unclassiﬁed data. It can mine new

useful information from these useless or incremental

data, and reduce history sample storage and the scale of

training sample set.

• In α-SVMSGD, we adopt adaptive method to adjust the

value of α. We set a threshold for α ﬁrstly, and then

calculate the error between initial threshold and α value

of sample which has been training for several times.

Finally, we choose α with maximum error weights as

a new threshold and begin the next round of the training

of new data sample.

The rest of paper is organized as follows. Section II reviews

related work. Section III presents the overview of LDPA.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38592502

粉丝: 6
资源: 934

电子健康网络的增量学习分类算法：健忘因子方法

论文研究 - 边缘性脑炎引起的局灶性逆行健忘症的病例研究

五年级上册语文健忘的教授｜鄂教PPT学习教案.pptx

2健忘的教授——学生学习课件

1健忘的教授——学习ppt课件

健忘症

基于数据挖掘的名老中医治疗健忘的用药规律探析.pdf

搞定健忘的快捷启动栏

桂圆粳米粥常吃少健忘

对于健忘的vim用户-Linux开发

酒店人力资源部健忘案例.docx

最新资源