无监督条件自编码器驱动的健壮KPI异常检测算法Donut

研究论文

19 浏览量更新于2024-08-28 收藏 2.8MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

本文主要探讨了"Robust and Unsupervised KPI Anomaly Detection Based on Conditional Variational Autoencoder"这一研究论文，针对大型互联网公司日常运营中监控关键绩效指标（KPIs）的需求，提出了一种新颖的无监督异常检测算法Donut。在处理季节性KPI时，这些指标通常具有复杂多样的模式和数据质量挑战，尤其是在没有标签的情况下，传统的异常检测方法往往难以应对。传统上，异常检测依赖于监督学习，需要大量的标记数据来进行模型训练。然而，论文作者意识到在实际业务场景中，获取大量的标注数据可能并不总是可行的，因此他们转向了无监督学习方法。Conditional Variational Autoencoder（CVAE）被选为研究的基础框架，这是一种深度学习技术，能够生成潜在变量分布并用于数据重建。 Donut算法的核心创新在于引入了几项关键技术，包括： 1. CVAE的应用：CVAE通过学习数据的潜在结构，能够在不依赖标签的情况下捕捉KPI数据的潜在规律，从而实现异常检测。与常规的VAE相比，CVAE在处理季节性变化和异常值时更为精确。 2. KDE解释：论文提出了Donut的一种新解读，即基于Kernel Density Estimation (KDE)的重构解释。这种解释方式使得Donut能够提供更直观的理解，区分正常行为和异常情况，提高了模型的可解释性和可靠性。 3. 性能提升：经过实验验证，Donut在无监督情况下显著优于现有的监督ensemble方法和基于VAE的基线，显示出其在F-score上的优异表现，F-score范围从0.75到0.9，表明其在识别各类KPI异常方面的高效性和准确性。 4. 实际应用价值：论文结果来自全球顶级互联网公司的实际KPI数据，这意味着Donut算法具有广泛的实用性和推广潜力，有助于互联网企业在保持业务连续性的同时，快速发现并解决潜在问题。这篇研究论文通过对CVAE的巧妙改造和KDE的引入，提供了一种有效的无监督KPI异常检测策略，对于提升互联网企业的业务监控能力和效率具有重要意义。它不仅解决了季节性KPI的异常检测难题，而且展示了在缺乏标签数据的情况下，深度学习技术的潜力和适应性。

资源详情

资源推荐

Unsupervised Anomaly Detection via Variational Auto-Encoder

for Seasonal KPIs in Web Applications WWW 2018, April 23–27, 2018, Lyon, France

Figure 2: Architecture of VAE. The prior of z is regarded as

part of the generative model (solid lines), thus the whole

generative model is denoted as p

(x, z) = p

(x|z)p

(z). The

approximated posterior (dashed lines) is denoted as q

(z|x).

2.4 Background of Variational Auto-Encoder

Deep Bayesian networks use neural networks to express the rela-

tionships between variables, such that they are no longer restricted

to simple distribution families, thus can be easily applied to compli-

cated data. Variational inference techniques [

] are often adopted

in training and prediction, which are ecient methods to solve

posteriors of the distributions derived by neural networks.

VAE is a deep Bayesian network. It models the relationship be-

tween two random variables, latent variable

and visible vari-

able

. A prior is chosen for

, which is usually multivariate unit

Gaussian

N(0, I)

. After that,

is sampled from

(x|z)

, which is

derived from a neural network with parameter

. The exact form

(x|z)

is chosen according to the demand of task. The true pos-

terior

(z|x)

is intractable by analytic methods, but is necessary

for training and often useful in prediction, thus the variational

inference techniques are used to t another neural network as

the approximation posterior

(z|x)

. This posterior is usually as-

sumed to be

N(µ

(x), σ

(x))

, where

(x)

and

(x)

are derived

by neural networks. The architecture of VAE is shown as Fig 2.

SGVB [

] is a variational inference algorithm that is often

used along with VAE, where the approximated posterior and the

generative model are jointly trained by maximizing the evidence

lower bound (ELBO, Eqn (1)). We did not adopt more advanced

variational inference algorithms, since SGVB already works.

logp

(x) ≥ log p

(x) − KL



(z|x)



(z|x)



= L(x) (1)

= E

(z |x)



logp

(x) + log p

(z|x) − log q

(z|x)



= E

(z |x)



logp

(x, z) − log q

(z|x)



= E

(z |x)



logp

(x|z) + logp

(z) − log q

(z|x)



Monte Carlo integration [

] is often adopted to approximate the

expectation in Eqn (1), as Eqn (2), where

(l)

, l =

. . . L

are samples

from q

(z|x). We stick to this method throughout this paper.

(z |x)

[

f (z)

]

≈

l=1

f (z

(l)

) (2)

Figure 3: Overall architecture of Donut.

(a) Variational net q

(z |x) (b) Generative net p

(x |z)

Figure 4: Network structure of Donut. Gray nodes are ran-

dom variables, and white nodes are layers. The double lines

highlight our special designs upon a general VAE.

3 ARCHITECTURE

The overall architecture of our algorithm Donut is illustrated as

Fig 3. The three key techniques are Modied ELBO and Missing

Data Injection during training, and MCMC Imputation in detection.

3.1 Network Structure

As aforementioned in

2.1, the KPIs studied in this paper are

assumed to be time sequences with Gaussian noises. However, VAE

is not a sequential model, thus we apply sliding windows [

] of

length

over the KPIs: for each point

, we use

t −W +1

, . . . , x

the

vector of VAE. This sliding window was rst adopted because

of its simplicity, but it turns out to actually bring an important and

benecial consequence, which will be discussed in § 5.1.

The overall network structure of Donut is illustrated in Fig 4,

where the components with double-lined outlines (e.g., Sliding Win-

dow x, W Dimensional at bottom left) are our new designs and the

remaining components are from standard VAEs. The prior

(z)

chosen to be N(0, I). Both x and z posterior are chosen to be diag-

onal Gaussian:

(x|z) = N (µ

, σ

, and

(z|x) = N (µ

, σ

where

and

are the means and standard deviations of

each independent Gaussian component.

is chosen to be

dimen-

sional. Hidden features are extracted from

and

, by separated

hidden layers

(x)

and

(z)

. Gaussian parameters of

and

are

then derived from the hidden features. The means are derived from

linear layers:

= W

⊤

(z) + b

and

= W

⊤

(x) + b

The standard deviations are derived from soft-plus layers, plus a

non-negative small number

= SoPlus[W

⊤

(z) + b

] + ϵ

and

= SoPlus[W

⊤

(x) + b

] + ϵ

, where

SoPlus[a] =

剩余11页未读，继续阅读

weixin_38550722

粉丝: 8
资源: 928

无监督条件自编码器驱动的健壮KPI异常检测算法Donut

References.rar

A Discriminative Metric Learning Based Anomaly Detection Method

robust and explainable autoencoder

Based on STM32Cube Library Based on STM32Cube Library

a fast and robust convolutional neural network-based defect detection model

stacked convolutional sparse denoising autoencoder

异常行为有哪些好的论文

robust detection可以应用在哪些领域

robust infrared small target detection via multidirectional derivative-based

robust detection算法python实现

Combining Prior Knowledge and Data for Robust Controller Design

RRD（Robust Regression and Outlier Detection）如何用python代码实现，举个例子

robust and optimal control pdf

Analysis and simulation of synchronization performance of direct sequence spread spectrum system based on matlab

stacked denoising autoencoder

kemin zhou, robust and optimal control, prentice hall, englewo o d cliffs, n

parallax-tolerant image stitching based on robust elastic warping

GANprintR: Improved Fakes and Evaluation of the State of the Art in Face Manipulation Detection

A Robust Rectangular Object Detection Method using Multiple Features", 2019 8th International Conference on Computer and Communication Engineering (ICCCE) 链接

基于拓扑的曲面重建参考资料

最新资源