利用Kullback-Leibler散度进行异常检测的方法

85 浏览量更新于2024-08-26 收藏 719KB PDF 举报

"这篇研究论文探讨了如何使用Kullback-Leibler(KL)散度来检测技术系统的异常情况。作者包括来自不同大学和机构的研究人员，他们在故障检测和数学建模方面有深入研究。文章重点介绍了利用KL散度作为统计工具来监控复杂系统的健康状况，特别是对于初期故障条件的检测，提高了敏感性。" Kullback-Leibler散度是一种信息理论中的度量，用于量化两个概率分布之间的差异。在本文中，它被应用于多变量概率密度函数（PDF）中，以识别系统状态的变化或异常。在大型技术系统中，故障检测是至关重要的，因为早期发现潜在问题可以防止设备损坏和生产中断。KL散度的优势在于它可以衡量实际观测数据分布与正常操作条件下的预期分布之间的偏离程度。论文中提到的“Multivariate probability density function”是指涉及多个变量的概率分布。在多变量情况下，故障可能会影响系统中的多个组件，因此考虑所有变量的联合分布对于检测异常至关重要。通过计算KL散度，研究人员可以量化这些变量分布的改变，从而识别异常情况。 “Incipient fault condition”指的是系统开始出现故障但尚未完全失效的状态。在这一阶段，故障可能难以察觉，因此需要高度敏感的检测方法。KL散度的应用增强了这种敏感性，使得在故障初期就能进行有效的检测。 “Fault detection”是这篇论文的核心主题，目的是开发一种能够及时识别系统故障的工具。KL散度提供了一种统计方法，通过比较模型预测的概率分布与实际观测数据，能够在故障迹象微弱时就发出警报。 “Increased sensitivity”意味着通过使用KL散度，检测方法能够更早地捕捉到系统状态的微妙变化，这对于预防性的维护和减少停机时间具有重要意义。论文可能详细描述了如何构建和应用KL散度统计模型，以及在各种实际案例中的验证结果。这篇研究论文对利用Kullback-Leibler散度进行异常检测进行了深入探讨，为复杂技术系统的故障诊断和管理提供了新的思路和工具。通过提高检测的敏感性，这种方法有助于在故障早期发现并采取适当的措施，从而降低维修成本，保障系统的稳定运行。

Automatica 50 (2014) 2777–2786

Contents lists available at ScienceDirect

Automatica

journal homepage: www.elsevier.com/locate/automatica

Detecting abnormal situations using the Kullback–Leibler divergence

✩

Jiusun Zeng

a,d

, Uwe Kruger

b,1

, Jaap Geluk

, Xun Wang

, Lei Xie

d,2

College of Metrology and Measurement Engineering, China Jiliang University, Hangzhou 310018, PR China

Department of Mechanical & Industrial Engineering, Sultan Qaboos University, P.O. Box 33, Al Khod, Oman

Department of Mathematics, The Petroleum Institute, P.O. Box 2533, Abu Dhabi, United Arab Emirates

State Key Laboratory of Industrial Control Technology, Zhejiang University, Hangzhou 310027, PR China

a r t i c l e i n f o

Article history:

Received 9 November 2013

Received in revised form

11 June 2014

Accepted 17 June 2014

Available online 3 October 2014

Keywords:

Kullback–Leibler divergence

Multivariate probability density function

Incipient fault condition

Fault detection

Increased sensitivity

a b s t r a c t

This article develops statistics based on the Kullback–Leibler (KL) divergence to monitor large-scale

technical systems. These statistics detect anomalous system behavior by comparing estimated density

functions for the current process behavior with reference density functions. For Gaussian distributed

process variables, the paper proves that the difference in density functions, measured by the KL

divergence, is a more sensitive measure than existing work involving multivariate statistics. To cater

for a wide range of potential application areas, the paper develops monitoring concepts for linear static

systems, that can produce Gaussian as well as non-Gaussian distributed process variables. Using recorded

data from a glass melter, the article demonstrates the increased sensitivity of the KL-based statistics by

comparing them to competitive ones.

1. Introduction

Detecting abnormal operating conditions is of fundamental im-

portance to ensure the safe, reliable and economic operation of

technical systems. Related research can be broadly divided into

model-based, signal-based, rule-based and knowledge-based tech-

niques and their applications span over a wide range including the

general manufacturing industry, automotive and aircraft as well as

civil engineering and chemical systems (Kruger & Xie, 2012). From

the availability of large data records that are routinely updated, the

application of multivariate statistics has also gained significant at-

tention over the past decades (Dunia, Qin, Edgar, & McAvoy, 1996;

Feital et al., 2010; Ge, Xie, Kruger, & Song, 2012; Kano, Hasebe,

Hashimoto, & Ohno, 2004; Kourti, 2005; Lee, Qin, & Lee, 2006; Liu,

Xie, Kruger, Littler, & Wang, 2008; Miletic, Quinn, Dudzic, Vaculik,

✩

This work was supported by the Petroleum Institute, internal grant RIFP-

14301, and the Natural Science Foundation of China, grant numbers 61203088,

61320106009, 61374121. The material in this paper was not presented at any

conference. This paper was recommended for publication in revised form by

Associate Editor Juergen Hahn under the direction of Editor Frank Allgöwer.

E-mail addresses: jszeng@cjlu.edu.cn (J. Zeng), uwe.kruger@gmail.com

(U. Kruger), jgeluk@pi.ac.ae (J. Geluk), xwang@pi.ac.ae (X. Wang),

leix@csc.zju.edu.cn (L. Xie).

Tel.: +968 2414 2549; fax: +968 2414 1316.

Tel.: +86 571 87952233; fax: +86 571 87951200.

& Champagne, 2004; Venkatasubramanian, Rengaswamy, Kavuri,

& Yin, 2003), mainly based on their conceptual simplicity (Kruger

& Xie, 2012).

Multivariate statistical approaches rely, predominantly, on

non-causal data structures identified using principal component

analysis (PCA) (AlGhazzawi & Lennox, 2008; Dunia et al., 1996;

Feital et al., 2010), for Gaussian distributed source signals, and

independent component analysis (Ge et al., 2012; Kano et al., 2004;

Lee et al., 2006) as a non-Gaussian extension. The independent

components are embedded within the PCA components (Liu et al.,

2008), which follows from the data structure:

y(k) = Cs(k) + e(k). (1)

Here, y ∈ R

is a measured data vector, s ∈ R

, d

< d

, is a

vector of source variables that has the density function f

and the

covariance matrix 6

, C ∈ R

×d

is a parameter matrix, e ∈ R

is an error vector that has a Gaussian distribution, e ∼ N {0, 6

and k is a sample index. A practically reasonable assumption is that

the non-diagonal elements of 6

are zero. If 6

= σ

I, an eigen-

decomposition of the covariance matrix of y, 6

= C6

+ 6

yields σ

and the column space of C (Kruger & Xie, 2012). Con-

versely, if 6

= σ

I, the application of maximum likelihood PCA

allows the estimation of the diagonal elements of 6

and the col-

umn space of C (Feital et al., 2010; Kruger & Xie, 2012; Liu et al.,

2008; Narasimhan & Shah, 2008).

Using a moving window approach, Kruger, Kumar, and Littler

(2007) showed that evaluating changes in the underlying geometry

http://dx.doi.org/10.1016/j.automatica.2014.09.005

下载后可阅读完整内容，剩余9页未读，立即下载

weixin_38656064

粉丝: 10
资源: 932

利用Kullback-Leibler散度进行异常检测的方法

基于Kullback Leibler散度的过程监控

KL_D:Kullback-Leibler 散度-matlab开发

一组张量的总 Kullback-Leibler (tKL) 散度中心：根据总 Kullback Leibler 散度找到一组张量的中心。-matlab开发

Kullback-Leibler Divergence：计算两个概率分布之间的 Kullback-Leibler 散度-matlab开发

matlab实现Kullback-Leibler散度的计算方法

高斯分布的Kullback-Leibler散度计算方法

Kullback-Leibler散度在无源传感器数据关联中的应用

kullback-leibler散度

基于Kullback-Leibler散度的无源传感器数据关联 (2013年)

面向Kullback-Leibler散度不确定集的正则化线性判别分析.docx

最新资源