分布式极端学习机与交替方向乘子法

164 浏览量更新于2024-08-31 收藏 1.01MB PDF 举报

"分布式极端学习机与交替方向乘子法" 这篇研究论文《Distributed Extreme Learning Machine with Alternating Direction Method of Multiplier》深入探讨了在大数据背景下如何利用分布式系统解决单机内存限制问题，以实现极端学习机（Extreme Learning Machine, ELM）的大规模学习。ELM是一种广义的单隐藏层前馈神经网络，因其极快的学习速度和良好的泛化性能而受到广泛关注。然而，当面对大数据时，由于单台机器的内存限制以及大规模数据的分布式特性，ELM的学习过程会面临挑战。论文中提到的交替方向乘子法（Alternating Direction Method of Multiplier, ADMM）是一种优化算法，常用于解决带约束的优化问题。在分布式环境中，ADMM允许将大问题分解为多个小问题，分别在不同的计算节点上并行求解，然后通过协调步骤将这些局部解合并成全局解。这种方法在处理大规模机器学习问题时，能够有效提高计算效率并减少通信成本。在分布式极端学习机中，ADMM的应用旨在将复杂的ELM训练过程分解为可并行化的任务，每个节点可以独立处理一部分数据，再通过ADMM的协调机制同步各个节点的状态。这不仅解决了单机内存不足的问题，还能够利用分布式系统的计算能力加速学习过程，使得ELM在大数据场景下也能保持高效运行。此外，文章可能还讨论了以下几点： 1. 模型优化：通过ADMM优化ELM模型，可能涉及到调整超参数，如学习率、正则化项等，以达到更好的学习效果和收敛速度。 2. 实验验证：论文可能会通过对比实验展示分布式ELM与传统ELM在处理大数据集时的性能差异，以及ADMM在优化过程中的有效性。 3. 应用领域：ELM通常应用于分类和回归问题，可能在论文中作者会探讨其在图像识别、自然语言处理、推荐系统等领域的应用，以及在分布式环境下的适应性和性能提升。 4. 理论分析：论文可能涉及对ADMM算法的理论分析，包括其收敛性、复杂度和稳定性，以及在ELM框架下的适应性。这篇研究论文是关于如何结合分布式计算和优化算法来改进极端学习机在处理大规模数据时的性能，对于理解如何在实际应用中高效地运用ELM具有重要的理论和实践价值。

Neurocomputing 261 (2017) 164–170

Contents lists available at ScienceDirect

Neurocomputing

journal homepage: www.elsevier.com/locate/neucom

Distributed extreme learning machine with alternating direction

method of multiplier

Minnan Luo

a , ∗

, Lingling Zhang

, Jun Liu

, Jun Guo

, Qinghua Zheng

SPKLSTN Lab, Department of Computer Science, Xi’an Jiaotong University, Xi’an 710049, China

Hardware Department, School of Computer Science and Technology, Northwest University, Xi’an 710127, China

a r t i c l e i n f o

Article history:

Received 26 September 2015

Revised 16 March 2016

Accepted 22 March 2016

Available online 14 February 2017

Keywords:

Extreme learning machine

Neuron work

Alternating direction method of multiplier

a b s t r a c t

Extreme learning machine, as a generalized single-hidden-layer feedforward network, has achieved much

attention for its extremely fast learning speed and good generalization performance. However, big data

often makes a challenge in large scale learning of extreme learning machine due to the memory limi-

tation of single machine as well as the distributed manner of large scale data in many applications. For

the purpose of relieving the limitation of memory with big data, in this paper, we exploit a novel dis-

tributed model to implement the extreme learning machine algorithm in parallel for large-scale data set,

namely distributed extreme learning machine (DELM). A corresponding algorithm is developed on the ba-

sis of alternating direction method of multipliers which has shown its effectiveness in distributed convex

optimization. Finally, extensive experiments on some benchmark data sets are carried out to illustrate

the effectiveness and superiority of the proposed DELM method with an analysis on the performance of

speedup, scaleup and sizeup.

1. Introduction

Extreme learning machine is a generalized single-hidden-layer

feedforward network, where the parameters of hidden layer fea-

ture mapping are generated randomly according to any continu-

ous probability distribution [1] instead of being tuned by gradient

descent based algorithms. As a result, extreme learning machine

achieves extremely fast learning speed and better performance of

generalization. The ELM technique performs effectively and have

been applied in many applications of machine learning such as

classiﬁcation [2,3] , clustering [4] and regression [5] . C. W. Deng

and G. B. Huang highlighted the new trends of multi-layer learn-

ing with extreme learning machine [6] . Extreme learning machine

is also used in many real life applications, for example, S. Shaha-

boddin et al. use extreme learning machine to estimate the wind

speed distribution [7] ; Deng et al. proposed an eﬃcient image

super-resolution approach based on extreme learning machine to

reconstruct the high-frequency components containing details [8] .

It is noteworthy that traditional extreme learning machine is of-

ten implemented on a single machine, and therefore it is inevitable

to suffer from the limitation of memory with large scale data set.

∗

Corresponding author.

E-mail addresses: minnluo@mail.xjtu.edu.cn (M. Luo), zhanglingling@stu.

xjtu.edu.cn (L. Zhang), liukeen@mail.xjtu.edu.cn (J. Liu), guojun@nwu.edu.cn

(J. Guo), qhzheng@mail.xjtu.edu.cn (Q. Zheng).

Especially in the era of big data, the data set scale is usually ex-

tremely large and the data is often very high-dimensional for de-

tailed information [9,10] . On the other hand, it is actually necessary

to deal with the data set in different machines due to the follow-

ing two reasons: (1) The data set is stored and collected in a dis-

tributed manner because of the large scale of applications; (2) It

is impossible to collect all of data together for the reason of con-

ﬁdentiality and the data set can be only accessed on their own

machine. Based on the analysis above, how to implement extreme

learning machine with respect to the data set which located in dif-

ferent machines becomes a key problems.

In previous work, some parallel or distributed extreme learning

machine have been implemented to meet the challenge of large-

scale data set [11,12] . For example, Q. He et al. took advantages

of the distributed environment provided by MapReduce [13] and

propose an parallel extreme learning machine on the basis of

MapReduce via designing the proper  key, value  pairs [14] . X.

Wang et al. focused on the issue of parallel ELM and propose M

extreme learning machine on the basis of min-max modular net-

work, namely as [15] . This approach decomposes the classiﬁcation

problem into several small subproblems and trains individual ELM

for each subproblem; in the end, M

-network is adopted to ensem-

ble the individual classiﬁers together. Additionally, A. Akusok et

al. exploited a complete approach which successfully utilize high-

performance extreme learning machine toolbox for big data [16] .

http://dx.doi.org/10.1016/j.neucom.2016.03.112

下载后可阅读完整内容，剩余6页未读，立即下载

weixin_38660108

粉丝: 6
资源: 924

分布式极端学习机与交替方向乘子法

Distributed Optimization and Statistical Learning via the Alternating Direction

Packt Building Machine Learning Projects with TensorFlow

Communication Efficient Distributed Machine Learning with the Parameter Server-计算机科学

分布式优化（Distributed optimization）的模型有哪些，请列举相关文献并做简要说明？常见的求解算法有哪些

The Non-IID Data Quagmire of Decentralized Machine Learning

Privacy-Preserving Machine Learning Using Federated Learning and Secure Aggregation

Communication-Efficient Learning of Deep Networks from Decentralized Data

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

Please initialize `TimeDistributed` layer with a `tf.keras.layers.Layer` instance. Received: 10

给出分布式深度神经网络参考文献

最新资源