深度学习模型：自适应学习率在故障诊断中的应用

29 浏览量更新于2024-08-03 收藏 2.44MB PDF 举报

"这篇论文提出了一种用于故障诊断的深度学习模型，该模型采用了自适应学习率策略，旨在解决深度学习模型训练时间过长的问题，同时确保故障诊断的准确性和及时性。" 在深度学习领域，随着设备故障诊断数据量的不断增长，深度学习在故障诊断过程中的应用越来越重要。这一过程对时效性的要求很高，需要快速准确地获取故障诊断结果。然而，随着网络层数的增加，深度学习模型的训练时间也会相应延长。学习率是深度学习模型训练过程中的关键因素，一个设计良好的学习率调整策略能够有效地减少训练时间，满足故障诊断的需求。目前，许多深度学习模型通常采用全局统一的学习率策略，但这种策略对于不同的参数并不合理。论文作者Xiaodong Zhai和Fei Qiao来自同济大学电子与信息工程学院，他们针对这一问题提出了一个新的方法。该方法为深度学习模型中的权重和偏置参数分别设计了自适应学习率策略。通过这种方式，模型能够根据每个参数的特性动态调整学习率，使得训练过程更加高效，同时优化模型性能。自适应学习率策略的核心在于理解和利用参数更新的差异性。不同参数在训练过程中的变化速度可能不同，因此，给予每个参数一个适合其自身特点的学习率可以加速收敛，避免在训练过程中陷入局部最优或者过拟合。这样的策略可以更有效地挖掘数据中的模式，提升模型在故障诊断任务上的表现。论文中可能详细探讨了以下几个方面： 1. 学习率策略的数学原理：如何根据参数的梯度信息来动态调整学习率。 2. 实现方法：可能介绍了一种或多种算法实现自适应学习率，例如Adagrad、RMSprop、Adam等。 3. 模型结构：论文可能会提及所使用的具体深度学习架构，如卷积神经网络（CNN）、循环神经网络（RNN）或长短期记忆网络（LSTM），以及它们如何与自适应学习率策略结合。 4. 实验验证：通过对比实验，展示提出的自适应学习率策略相比于传统全局统一学习率策略在训练时间和诊断准确性上的优势。 5. 应用场景：可能会讨论该模型在实际设备故障诊断中的潜在应用，以及可能遇到的挑战和解决方案。这篇论文为深度学习在故障诊断领域的应用提供了一个创新的方法，通过自适应学习率策略提高了模型训练效率和诊断性能，对于深度学习在工业界的实际应用具有重要的参考价值。

A Deep Learning Model with Adaptive Learning Rate for Fault Diagnosis

Xiaodong Zhai

, Fei Qiao

1. School of Electronics and Information Engineering, Tongji University, Shanghai 201804

E-mail: 1710332@tongji.edu.cn, fqiao@tongji.edu.cn

Abstract: With the increasing amount of data in the field of equipment fault diagnosis, deep learning is playing an increasingly

important role in the process of fault diagnosis, during which the timeliness requirement is high and the fault diagnosis results

need to be obtained accurately and timely. However, with the increase of network layers, the training time of deep learning model

becomes longer. Learning rate in the deep learning model plays an important role in the process of model training, and a

well-designed learning rate adjustment strategy can effectively reduce the training time and satisfy the requirements of fault

diagnosis. At present, some deep learning models usually adopt a globally uniform learning rate strategy, which is unreasonable

for different parameters. This paper has designed an adaptive learning rate strategy for the parameters of weight and bias

respectively in deep learning model. Specifically, the strategy contains a learning rate strategy based on stochastic gradient

descent method for weight, and a power exponential learning rate strategy for bias. Experiments are carried out to validate the

effectiveness of proposed learning rate strategy. Results suggest that the strategy can reduce the training time and reconstruction

error rate of deep learning model, and improve the classification accuracy of fault diagnosis.

Key Words: Deep learning, Learning rate, Adaptive, Fault diagnosis



1 Introduction

With the development of modern industrial technology,

the safety, stability, reliability and operation efficiency of

equipment have become the core competitiveness of

manufacturing enterprises [1], and equipment management

has become an important field in enterprise management. In

the process of production, the performance of equipment

deteriorates with the increase of service time, and various

faults will occur in the process of equipment operation.

When the equipment fails, the production efficiency will be

reduced. More seriously, the equipment will be shut down,

and malignant accidents such as machine damage and

human death will occur. Therefore, it is particularly

important to find and identify the types and locations of

faults in time. With the development of computer

technology, many artificial intelligence algorithms have

been applied in the field of equipment fault diagnosis. It is

predicted that the growing Internet of Things will connect

30 billion devices by 2020 [2], and the huge amount of data

will also promote the innovation of the monitoring process

of the physical network system of industrial 4.0. With the

increasing amount of data, the advantages of deep learning

using in dealing with large-scale data are highlighted.

The motivation of deep learning is to build and simulate

the neural network of human brain for analysis and learning.

It imitates the mechanism of human brain to interpret data,

such as images, sounds and texts [3-5]. Deep learning is a

multi-layer neural network model essentially. By combining

low-level features, we can get a higher-level and more

abstract feature representation to discover the distributed

feature representation of data. At the same time, it weakens

the adverse effects of unrelated factors and improves the

accuracy of classification and prediction [6]. Meanwhile, the

excellent performance of deep learning is mainly based on a

This work is supported by National Natural Science Foundation,

China(No. 71690234, 61873191), National Science and Technology Major

Project (2017-V-0011-0063) and the National Key R&D Program, China

(No. 2017YFE0101400).

large number of training data and deep-level network

structure, as a result, the training time of deep learning

model is longer than other machine learning algorithms

generally [7]. Therefore, how to speed up the training time

of deep learning model is a problem which is worth of

intensive study, especially when it is applied in engineering

practice.

2 Related Work

Traditional fault diagnosis methods include model driven

methods, knowledge driven methods, and data driven

methods. However, the first two methods are often limited

by professional technology, expert experience and other

knowledge. In addition, with the continuous development of

equipment status monitoring technology, more and more

equipment status data can be utilized. As a result, data

driven methods based on machine learning and artificial

intelligence have attracted people's attention in recent years

[8-9]. Data driven methods can discover the intrinsic law of

equipment status trends and estimate the fault types of

equipment by advanced methods based on equipment status

data. With the increasing amount of equipment status data,

more and more attention has been paid to the deep learning

method in machine learning.

There are two significant parameters in deep learning

model, which are weight and bias. However, traditional

deep learning models often use a global uniform constant

parameter for these two parameters, and the setting of this

constant parameter requires previous experience.

Meanwhile, it should be noted that there are a large number

of weight and bias parameters in deep learning model, and

they are two different types of parameters. Different

parameters play different roles. With this in mind, it is

unreasonable to provide the same learning rate strategy for

different parameters. A global uniform learning rate is not

necessarily suitable for all parameters, and it will reduce the

iteration efficiency and increase the model training time of

deep learning model.

At present, there have been some studies on the

adjustment strategies of the learning rate in the deep

2020 IEEE 9th Data Driven Control and Learning Systems Conference

November 20-22, 2020, Liuzhou, China

DDCLS'20

668

Authorized licensed use limited to: Carleton University. Downloaded on June 20,2021 at 01:24:34 UTC from IEEE Xplore. Restrictions apply.

下载后可阅读完整内容，剩余5页未读，立即下载

花于陌上开

粉丝: 43
资源: 16

深度学习模型：自适应学习率在故障诊断中的应用

deep learning(Adaptive Computation and Machine Learning series)

Barzilai–Borwein-based adaptive learning rate for deep learning

A niching evolutionary algorithm with adaptive negative correlation learning for neural network ensemble

Deep Learning (Adaptive Computation and Machine Learning series)

Deep Learning (Adaptive Computation and Machine Learning series) 英文版

A Self-Adaptive Deep Learning-Based System for Anomaly Detection

Deep Learning (Adaptive Computation and Machine Learning series) 最新中文版

Deep learning_ adaptive computation and machine learning-The MIT Press (2016)

Deep learning_ adaptive computation and machine learning（深度学习经典专著 英文原版）.pdf

adaptive_deeplearning_matlab_

最新资源

Deep learning_ adaptive computation and machine learning（深度学习经典专著英文原版）.pdf