基于修剪搜索的高效精确网格异常检测法GO-PEAS

101 浏览量更新于2024-08-26 收藏 296KB PDF 举报

本文主要探讨了一种名为GO-PEAS（Grid-Based Outlier Detection with Pruning Searching techniques）的创新性算法，该算法旨在解决大规模数据源中的异常值检测问题。在当前的研究论文中，作者们针对传统基于网格的异常检测方法存在的速度瓶颈，提出了新颖的修剪搜索技术，以此显著提升算法的可扩展性和精度。首先，GO-PEAS的核心思想是将数据集划分成网格结构，每个网格内的点被视为潜在的正常值。然而，传统的网格方法可能会对大量数据进行密集计算，导致效率低下。为了解决这个问题，作者们引入了"修剪搜索"这一关键概念，这是一种智能的数据筛选策略。通过预先评估网格内点的可能性，剔除那些不可能是异常值的部分，从而减少了不必要的计算量。这种策略有效地实现了对数据空间的高效剪枝，极大地提高了算法的执行速度，使其能够应对海量数据的处理需求。此外，尽管采用了优化技术，作者们强调了GO-PEAS的检测准确性并未因此牺牲。他们确保了即使在使用增强技术的情况下，算法的基线版本（不使用改进技术的版本）所具有的检测精度得以保持一致。这是非常重要的，因为异常值检测的目标不仅是速度，精准性同样至关重要。为了验证这些改进的有效性，作者们进行了详尽的实验评估。他们在多个数据集上对比了GO-PEAS与传统方法的性能，包括处理时间、内存使用以及检测结果的准确率。实验结果显示，GO-PEAS在保持高精度的同时，其运行速度有了显著提升，这使得它在实际应用中具有很大的竞争优势。总结来说，GO-PEAS是一种可扩展且精确的基于网格的异常值检测方法，其核心贡献在于引入了修剪搜索技术来提高算法效率，同时保持了高水准的检测准确性。这对于大数据时代的异常检测任务来说，无疑是一个重要的进展，展示了在复杂数据处理场景下，如何在效率与精度之间找到理想的平衡。

GO-PEAS: A Scalable Yet Accurate Grid-Based

Outlier Detection Method Using Novel Pruning

Searching Techniques

Hongzhou Li

, Ji Zhang

)

, Yonglong Luo

,FulongChen

and Liang Chang

Guangxi Key Laboratory of Trusted Software, Guilin University

of Electronic Technology, Guilin, China

homzh@163.com, changl@guet.edu.cn

University of Southern Queensland, Toowoomba, Australia

ji.zhang@usq.edu.au

Anhui Normal University, Wuhu, China

ylluo@ustc.edu.cn, long005@mail.ahnu.edu.cn

Abstract. In this paper, we propose a scalable yet accurate grid-based

outlier detection method called GO-PEAS (stands for Grid-based Outlier

detection with Pruning Searching techniques). Innovative techniques are

incorporated into GO-PEAS to greatly improve its speed performance,

making it more scalable for large data sources. These techniques oﬀer

eﬃcient pruning of unnecessary data space to substantially enhance the

detection speed performance of GO-PEAS. Furthermore, the detection

accuracy of GO-PEAS is guaranteed to be consistent with its baseline

version that does not use the enhancement techniques. Experimental

evaluation results have demonstrated the improved scalability and good

eﬀectiveness of GO-PEAS.

1 Introduction

Outlier detection is an important data analytic/mining problem that aims to ﬁnd

objects and/or patterns that are considerably dissimilar, exceptional and incon-

sistent with respect to the majority data in an input database. Outlier detection

has become one of the key enabling technologies for a wide range of applications in

industry, business, security and engineering, etc., where outliers represent abnor-

mal patterns that are critical for domain-speciﬁc decision-making and actions.

Due to its inherent importance in various areas, considerable research eﬀorts

in outlier detection have been taken in the ﬁeld and a number of outlier detection

techniques have been proposed that leverage diﬀerent detection mechanisms and

algorithms. The majority of them deal with the traditional relational datasets

which can be generally classiﬁed into the distribution-based methods [2], the

distance-based methods [4,10], the density-based methods [8,11,13,16–18]and

the clustering-based methods [6,9], which feature diﬀerent levels of performance

in terms of detection accuracy and eﬃciency. The research on outlier detection

has also been carried out for other types of datasets such as temporal data

 Springer International Publishing Switzerland 2016

T. Ray et al. (Eds.): ACALCI 2016, LNAI 9592, pp. 125–133, 2016.

DOI: 10.1007/978-3-319-28270-1

下载后可阅读完整内容，剩余8页未读，立即下载

weixin_38556394

粉丝: 7
资源: 896

基于修剪搜索的高效精确网格异常检测法GO-PEAS

GO-PEAS：大规模数据的高效修剪搜索异常检测法

Google Chrome扩展：黑眼豆豆粉丝必备插件

13900张33类蔬菜水果目标检测VOC+YOLO数据集发布

BEPClub - Black Eyed Peas-crx插件

BEPClub - 黑眼豆豆「BEPClub - Black Eyed Peas」-crx插件

GO-PEAS：使用新颖的修剪搜索技术的可扩展但准确的基于网格的异常值检测方法

PEAS:用于PEAS（ATAC-seq的预测增强子）的代码存储库，包括功能提取文件和易于使用的python脚本，用于训练增强子模型并使用MLP神经网络预测增强子

PESA-II.rar_PEAS2_PESA-II_evolutionary_multiobjective_pesa

语音合成代码matlab-peass-software:来自http://bass-db.gforge.inria.fr/peass/PEAS

PaaS平台Peas.zip

最新资源