HMM驱动的动态过程故障分类：鲁棒概率主成分分析新方法

111 浏览量更新于2024-08-26 收藏 706KB PDF 举报

"这篇研究论文提出了一种基于隐马尔可夫模型（HMM）的鲁棒概率主成分分析方法，用于动态过程故障分类。通过利用HMM和鲁棒的潜在变量模型（LVM），该方法能够处理异常值并集成多种类型的过程信息。模型参数通过期望最大化算法（EM）进行估计，并在田纳西-伊斯曼基准过程上进行了性能验证。关键词包括：期望最大化（EM）、隐马尔可夫模型（HMM）、混合模型、异常值和鲁棒概率分类。" 正文：在工业生产过程中，故障检测和分类是确保高效运行和避免损失的关键任务。传统的故障诊断方法往往对异常数据敏感，这可能导致错误的故障识别。这篇由Zhu、Ge和Song发表的研究论文引入了一种新的方法，即HMM驱动的鲁棒概率主成分分析仪，旨在解决这个问题。首先，该方法建立了一个鲁棒的潜在变量模型，采用学生t分布的混合模型来应对数据中的异常值。学生t分布因其对异常值的容忍度而被广泛应用于统计建模，尤其是当数据存在噪声或异常时。混合模型则允许模型适应不同类型的故障模式，提高了故障分类的灵活性。其次，研究人员进一步发展了这个鲁棒LVM的结构，使其能整合在模型获取过程中收集到的各种类型的过程信息。这样，模型不仅考虑了静态特征，还考虑了动态变化，从而提高了故障分类的准确性。接下来，该模型在HMM框架内得到扩展，以便刻画时间域内的随机不确定性。HMM是一种强大的工具，尤其适合处理具有隐藏状态和时间依赖性的序列数据，这在动态过程故障分析中非常常见。通过HMM，模型能够捕捉到故障随时间演变的动态特性。模型的参数估计是通过期望最大化算法完成的。EM算法是一种迭代方法，用于估计参数，特别是在数据中存在隐含变量的情况下。它通过交替执行期望步骤（E-step）和最大化步骤（M-step）来逐步优化模型参数，直到达到收敛。为了验证模型的性能，研究人员在经典的田纳西-伊斯曼（Tennessee Eastman）过程控制问题上进行了测试。田纳西-伊斯曼过程是一个模拟化工厂的基准模型，包含多种可能的故障情况，因此是评估故障诊断方法的理想平台。实验结果证明了所提出的HMM驱动的鲁棒概率主成分分析仪在动态过程故障分类中的有效性。这篇论文提出的HMM驱动鲁棒概率主成分分析方法为工业过程故障诊断提供了一种新颖且强大的工具，它结合了HMM的时间序列建模能力和鲁棒LVM对异常值的处理能力，提升了故障分类的鲁棒性和准确性。这种方法对未来的工业自动化和智能维护系统具有重要的理论与实践价值。

3814 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, VOL. 62, NO. 6, JUNE 2015

HMM-Driven Robust Probabilistic Principal

Component Analyzer for Dynamic

Process Fault Classiﬁcation

Jinlin Zhu, Zhiqiang Ge, Member, IEEE, and Zhihuan Song

Abstract—In this paper, a novel hidden Markov model

(HMM)-driven robust latent variable model (LVM) is pro-

posed for fault classiﬁcation in dynamic industrial pro-

cesses. A robust probabilistic model with Student’s t

mixture output is designed for tolerating outliers. Based

on the robust LVM, the probabilistic structure is further

developed into a classiﬁer form so as to incorporate vari-

ous types of process information during model acquisition.

After that, the robust probabilistic classiﬁer is extended

within the HMM framework so as to characterize the time-

domain stochastic uncertainties. The model parameters are

derived through the expectation–maximization algorithm.

For performance validation, the developed model is tested

on the Tennessee Eastman benchmark process.

Index Terms—Expectation–maximization (EM), hidden

Markov model (HMM), mixture model, outliers, robust prob-

abilistic principal component analyzers, robust sequential

data modeling.

I. INTRODUCTION

ROCESS monitoring is of great importance for industrial

plants due to the complex manufacturing process and

costly equipment [1]. The early detection of process faults

can not only prevent potential serious damage to instruments

or the environment but also keep operators safe and help to

reduce economic loss. Traditional process monitoring methods,

represented by the model-based approach, rely on in-depth

analytical expressions of the monitored process. In most cases,

however, the detailed kinetic properties can hardly be obtained

completely [2]. As an alternative, databased process monitor-

ing methods demand little requirements for rigorous system

models, and the underlying key information can be effectively

extracted from the readily available historical data [3]. For

this reason, databased monitoring techniques have been widely

researched over the past few decades [4], [5].

Principal component analysis (PCA) and partial least squares

are probably the most popular databased monitoring methods

Manuscript received April 14, 2014; revised June 21, 2014, August

2, 2014, September 10, 2014, October 19, 2014, and November 25,

2014; accepted December 14, 2014. Date of publication January 30,

2015; date of current version May 8, 2015. This work was supported

in part by the National Natural Science Foundation of China under

Grant 61273167 and in part by the National Project 973 under Grant

2012CB720500.

The authors are with the State Key Laboratory of Industrial Control

Technology, Department of Control Science and Engineering, Zhejiang

University, Hangzhou 310027, China (e-mail: gezhiqiang@zju.edu.cn).

Color versions of one or more of the ﬁgures in this paper are available

online at http://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/TIE.2015.2396877

[6]. Both methods are built upon information of the steady-state

operating scenarios and characterize those internal variances

which are assumed to be different from other working condi-

tions. However, a common issue for these methods is that the

uncertainties or noise has been neglected during the modeling

phase. Recently, the probabilistic modiﬁcation of PCA called

probabilistic PCA (PPCA) has been presented and applied suc-

cessfully along with the mixture formulation [mixture PPCA

(MPPCA)] in nonlinear/non-Gaussian monitoring areas [7].

Compared with the original PCA, the probabilistic based latent

variable models (LVMs) show more desirable performances,

and more importantly, the Bayesian inference method pro-

vides a uniﬁed framework for comprehensive modeling and

monitoring.

In essence, LVMs, s uch as PCA and PPCA, are all con-

structed in a static manner on the basis of normal operating

conditions. Thus, information from other operating conditions

is totally ignored. Meanwhile, one can resort to elaborately

designed indices such as T

and Q statistics for further fault

detection and contribution plots for diagnosis. To improve the

monitoring efﬁciency, some supervised modeling techniques

have been reported, such as neural networks (NNs), sup-

port vector machines (SVMs), and Gaussian mixture models

(GMMs) [8], [9]. The supervised monitoring methods can be

constructed from the whole sample space, and the mechanism

is to view the detection and diagnosis as a single classiﬁcation

task. For example, the NNs and SVMs are black box models

that map the original data space into some sophisticated but

well-organized high-dimensional spaces to make the discrim-

inant analysis. Despite the feasibility, the data explanatory

abilities are beyond discussion. In contrast, GMMs conduct

the modeling directly on the sampled data set, and the global

distribution in the range can be elegantly approximated by a

ﬁnite set of local distributions. Compared with NNs and SVMs,

GMMs can be more ﬂexible and intuitive.

Despite the appealing beneﬁts and successful applications,

a main drawback for all the aforementioned methods is the

lack of a proper mechanism to deal with outliers, which can

be frequently encountered in practical industrial process [10].

Generally speaking, outliers can be regarded as irregular data.

Irregular data come from several aspects such as sensor failure,

network transmission errors, computer malfunction, errors in

database software, and data recording errors [11]. In many

irregular data-related studies, outliers are usually detected and

smoothed by averaging [12]. A previous study showed that the

IEEE. Personal use is permitted, but republication/redistribution requires IEEE per mission.

See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

下载后可阅读完整内容，剩余7页未读，立即下载

weixin_38632825

粉丝: 3

HMM驱动的动态过程故障分类：鲁棒概率主成分分析新方法

特征空间广义可变参数HMM用于噪声鲁棒识别

用于HMM多元高斯过程的EM：用于HMM多元高斯混合物的EM算法的快速实现-matlab开发

用于噪声鲁棒语音识别的通用可变参数HMM的自动复杂度控制

HMM 隐马尔科夫 概率图

HMM(matlab函数集).zip_HMM MATLAB工具包_HMM 故障识别_HMM 预测_hmm 预测

AOC LV272HMM显示器驱动下载指南及故障解决

HMM驱动的语音识别方法详解

应用层DDoS检测：K均值多重主成分分析法的创新策略

HMM驱动的步态身份识别：性能研究与应用进展

DSP与HMM驱动下：语音识别系统硬件设计详解

最新资源

HMM 隐马尔科夫概率图