基于因子分析和聚类分析的神经网络算法研究

146 浏览量更新于2024-08-27 收藏 266KB PDF 举报

"该研究论文探讨了一种基于因子分析（Factor Analysis, FA）和聚类分析（Cluster Analysis, CA）的反向传播（Back-Propagation, BP）神经网络算法，旨在解决大规模样本高维特征的问题。通过结合FA和CA的原理，优化BP神经网络的结构，降低初始数据的特征维度，简化网络架构，并通过CA将样本划分为不同的子类别，以提升网络的适应性。实验表明，新算法在预测精度方面有显著提高，并与基于FA和CA的BP算法进行了比较验证其有效性。" 本文的研究重点在于改善传统的BP神经网络在处理大量样本和高维特征时的效率和准确性问题。传统的BP神经网络在面对高维数据时，可能会遇到过拟合、训练时间长和泛化能力差等问题。为此，作者提出了一种创新的方法，即结合因子分析和聚类分析来改进BP神经网络。因子分析是一种统计方法，能将高维数据集中的相关性转化为少数几个因子，从而降低数据的维度，减少网络的复杂性。这种方法有助于减少计算量，同时保持数据的主要信息，使得网络更易于训练和收敛。聚类分析则用于将样本分成具有相似属性的子集，即子类别。在神经网络中，这种方法可以创建针对不同类别的特定网络，提高对各个类别数据的适应性和预测能力。通过对每个子类别分别训练网络，可以更好地捕捉每个类别内的特征，从而提高整体预测性能。实验部分，作者对比了新算法与仅基于FA或CA的BP神经网络算法，结果表明，结合FA和CA的新算法在预测精度上有显著提升，验证了该方法的有效性。这为处理大规模高维数据的预测任务提供了一个有力的工具，对于实际应用，如数据分析、模式识别等领域，具有重要的理论和实践意义。这篇研究论文展示了如何通过融合因子分析和聚类分析来增强BP神经网络的性能，为解决高维数据问题提供了新的思路。这种集成方法不仅减少了数据处理的复杂性，还提高了模型的预测准确性和泛化能力，对于未来在IT领域的研究和应用具有积极的推动作用。

ORIGINAL ARTICLE

Research of neural network algorithm based on factor analysis

and cluster analysis

Shifei Ding

•

Weikuan Jia

•

Chunyang Su

•

Liwen Zhang

•

Lili Liu

Received: 16 September 2009 / Accepted: 23 June 2010 / Published online: 7 July 2010

 Springer-Verlag London Limited 2010

Abstract Aiming at the large sample with high feature

dimension, this paper proposes a back-propagation (BP)

neural network algorithm based on factor analysis (FA) and

cluster analysis (CA), which is combined with the princi-

ples of FA and CA, and the architecture of BP neural

network. The new algorithm reduces the feature dimen-

sionality of the initial data through FA to simplify the

network architecture; then divides the samples into differ-

ent sub-categories through CA, trains the network so as to

improve the adaptability of the network. In application, it is

ﬁrst to classify the new samples, then using the corre-

sponding network to predict. By an experiment, the new

algorithm is signiﬁcantly improved at the aspect of its

prediction precision. In order to test and verify the validity

of the new algorithm, we compare it with BP algorithms

based on FA and CA.

Keywords Artiﬁcial neural network (ANN)  Factor

analysis (FA)  Cluster analysis (CA)  FA-CA-BP network

1 Introduction

Artiﬁcial neural network (ANN) is a kind of cross-subject,

which combines with Brain Science, Neuroscience,

Cognitive Science, Psychology, Computer Science, and

Mathematics [1]. It has many important applications in

nature science, such as Earth Science [2], Environmental

Science [3], and Physical Science [4]. Artiﬁcial neural

network simulates the structure of the human brain neural

network and some working mechanism to establish one

kind of computing model. Artiﬁcial neural network has

some characteristics such as self-adaption, self-organiza-

tion and real-time learning, and powerful ability in dealing

with processing non-linear problem and large-scale com-

putation. Neural network has been more than 60 years until

now. During these years, hundreds of network algorithm

models have been proposed [5], and back-propagation (BP)

neural network is one of the most mature and most wide-

spread algorithms [6]. Artiﬁcial neural network is conve-

nient for people to solve the problems, but it is not perfect

for the feature of the input samples and the properties of the

network’s structure. For example, a large number of ori-

ginal samples can be used to provide available information,

while also increase the difﬁculty to deal with these data for

the neural network, there is some related, or even repeated

information which exists in the features of the samples. If

we take all of its data as the network input, it will be

detrimental to the design of the network, and will occupy a

lot of storage space and computing time. Too many feature

inputs and repeated training samples will lead to time-con-

suming work and hinder the convergence of the network,

ﬁnally affect the recognition precision of the network. So it

is necessary to pre-process the original data, analyze and

extract useful variable features from a large amount of

data, excluding the inﬂuences of the related or duplicate

factors. It is also important to reduce the feature dimen-

sionality as far as possible under the premise of not

affecting the solution of the problems and then classify the

similar samples in order to simplify the network structure.

S. Ding (&)  W. Jia  C. Su  L. Zhang  L. Liu

School of Computer Science and Technology,

China University of Mining and Technology,

Xuzhou 221008, China

e-mail: dingsf@cumt.edu.cn

S. Ding

Key Laboratory of Intelligent Information Processing,

Institute of Computing Technology, Chinese Academy

of Sciences, Beijing 100080, China

123

Neural Comput & Applic (2011) 20:297–302

DOI 10.1007/s00521-010-0416-2

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38670501

粉丝: 8
资源: 975

基于因子分析和聚类分析的神经网络算法研究

An optimizing BP neural network algorithm based on genetic algorithm

Optimal linear combination of neural network classifiers based on the minimum classification error criterion

An optimizing method of RBF neural network based on genetic algorithm

Research of assembling optimized classification algorithm by neural network based on Ordinary Least Squares (OLS)

The Diagnosis Algorithm of Pulmonary Embolism Based on AdaBoost and BP Neural Network

Application of Motion Image Skeleton Recognition Algorithm Based on Convolutional Neural Network in Rehabilitation Training.pdf

Deep Neural Network-based Enhancement for Image and Video Stream

Deep Convolutional Neural Network-Based Early_neuralnetwork_

Trajectory tracking control of wheeled mobile manipulator based on fuzzy neural network and extended Kalman filtering

Classification and Prediction of Tibetan Medical Syndrome Based on the Improved BP Neural Network

最新资源