基于集成学习的物联网安全：Fog-to-Things环境下的入侵检测系统

需积分: 0 35 浏览量更新于2024-08-05 收藏 923KB PDF 举报

随着物联网（IoT）应用的日益增长，其安全威胁也相应增加。传统的基于签名的入侵检测系统（IDS）已经不足以应对这种复杂的安全环境，因此，基于机器学习的入侵检测系统（Machine Learning-based IDS）成为了一种有前景的解决方案。本文提出了一种新颖的Ensemble Learning方法应用于Fog-to-Things环境中的安全防护，旨在提高对未知攻击的检测能力和减少误报。 Fog-to-Things架构将云计算、边缘计算和物联网技术融合，为设备间的通信和数据处理提供了更高效的方式。然而，这种分布式环境增加了潜在的攻击面，包括中间人攻击、数据泄露等。作者Illy、Kaddoum、Miranda Moreira、Kuljeet Kaur和Sahil Garg在蒙特利尔École de Technologie Supérieure的电气工程系，针对这一挑战，设计了一种新型的IDS，它利用Ensemble Learning策略，通过整合多个模型的优势，提高了整体的预测性能和泛化能力。 Ensemble Learning是一种集成学习方法，它通过组合多个基础模型（如决策树、支持向量机或神经网络）的预测结果，来增强整体系统的性能。这种方法能够减少单个模型的过拟合风险，同时提高模型的鲁棒性，有助于在实际场景中更好地识别和抵御各种恶意行为。为了确保模型的有效性和实用性，研究者强调了训练和评估过程中的关键要素，即模型必须在真实世界的数据集上进行训练，以确保其在实时部署中的表现和有效性。然而，许多现有文献中的解决方案虽然在实验室环境下展示了高精度，但在实际应用中往往由于数据的非代表性而表现不佳。因此，该研究团队特别关注了数据的代表性和迁移学习的能力，以确保模型能够适应不断变化的威胁模式，并能够在Fog-to-Things环境中无缝运行。本文的核心贡献在于提出了一种基于Ensemble Learning的入侵检测系统，用于增强Fog-to-Things环境的安全保障。通过优化模型的训练策略和真实数据集的选择，作者们旨在解决实际应用中的挑战，提供一个更为可靠且有效的安全解决方案。这不仅有助于提升物联网的安全水平，也为未来智能环境下的安全防范研究提供了有价值的参考框架。

Securing Fog-to-Things Environment Using

Intrusion Detection System Based On Ensemble

Learning

Poulmanogo Illy

∗

, Georges Kaddoum

∗

, Christian Miranda Moreira

∗

, Kuljeet Kaur

∗

, and Sahil Garg

∗

Electrical Engineering Department,

Ecole de Technologie Sup

erieure, Montr

eal, Canada.

Email: poulmanogo.illy.1@ens.etsmtl.ca

Abstract—The growing interest in the Internet of Things (IoT)

applications is associated with an augmented volume of security

threats. In this vein, the Intrusion detection systems (IDS) have

emerged as a viable solution for the detection and prevention

of malicious activities. Unlike the signature-based detection

approaches, machine learning-based solutions are a promising

means for detecting unknown attacks. However, the machine

learning models need to be accurate enough to reduce the number

of false alarms. More importantly, they need to be trained and

evaluated on realistic datasets such that their efﬁcacy can be

validated on real-time deployments. Many solutions proposed

in the literature are reported to have high accuracy but are

ineffective in real applications due to the non-representativity of

the dataset used for training and evaluation of the underlying

models. On the other hand, some of the existing solutions

overcome these challenges but yield low accuracy which hampers

their implementation for commercial tools. These solutions are

majorly based on single learners and are therefore directly

affected by the intrinsic limitations of each learning algorithm.

The novelty of this paper is to use the most realistic dataset

available for intrusion detection called NSL-KDD, and combine

multiple learners to build ensemble learners that increase the

accuracy of the detection. Furthermore, a deployment architec-

ture in a fog-to-things environment that employs two levels of

classiﬁcations is proposed. In such architecture, the ﬁrst level

performs an anomaly detection which reduces the latency of the

classiﬁcation substantially, while the second level, executes attack

classiﬁcations, enabling precise prevention measures. Finally,

the experimental results demonstrate the effectiveness of the

proposed IDS in comparison with the other state-of-the-arts on

the NSL-KDD dataset.

Index Terms—Intrusion detection system, Machine learning,

Ensemble learner, NSL-KDD, Fog-to-Things.

I. INTRODUCTION

The Internet of Things (IoT) paradigm offers prodigious

opportunities to the industries [1]. This technology is expected

to be further active with the imminent Fifth-Generation (5G)

mobile communications system [2]. However, the massive de-

ployment of IoT networks and their usage in critical domains

such as smart housing, smart transportation, and e-health,

results in the generation of abundant sensitive data on real-time

basis. Due to this reason, these networks are deemed to be one

of the most vulnerable sites for different security attacks and

risks. To tackle this issue, many research studies have been

This research was supported by the NSERC B-CITI CRDPJ 501617-16

grant.

focused on the ﬁrst security layer, i.e., the prevention layer.

Thus, stronger authentication, authorization, and cryptography

techniques have been proposed in the literature. However,

despite the deployment of such strong security measures, a

system can still be compromised by an enduring adversary

using advanced techniques or high computational resources.

Therefore, under any prevention layer, there must be an

intrusion detection layer. This is the motivation for the devel-

opment of intrusion detection systems (IDS). Majority of in-

trusion detection solutions deployed commercially implement

signature-based approaches. Unlike the signature-based IDS,

the machine learning-based IDS are capable of detecting even

unknown attacks. Nevertheless, the fundamental challenge in

this direction involves the designing of an efﬁcient machine

learning based IDS that performs well on real-time data.

The majority of machine learning-based IDS proposed in

the literature have been built on KDDCUP'99 dataset [3]. The

corresponding evaluations results indicate impressive perfor-

mances in terms of high accuracy (99%) and negligible false

positive rate (1%) [4]–[6]. Despite their good performances,

the existing solutions are still not employed widely in com-

mercial tools, relatively to the signature-based approaches. To

understand this situation, the work in [7] conducted a statistical

analysis on KDDCUP'99 dataset and found some important

issues, mainly induced by a huge number of redundant records.

To address these problems, the authors provided a new dataset

named NSL-KDD (comprising of KDDTrain+, KDDTest+,

and KDDTest-21) that is more realistic and challenging

enough to compare different solutions. Based on these reﬁne-

ments, many machine learning methods have been proposed

and compared in the literature. In [7], the authors implemented

ﬁve different methods, namely Naive Bayes/Decision-Tree,

Random Tree, Decision Tree J48, Random Forest, and Multi-

Layer Perceptron on the reﬁned datasets that led to overall

accuracy of 82.02% on KDDTest+ and 66.16% on KDDTest-

21 datasets respectively. To improve the detection, the work

in [8] employed different feature selection metrics at the

pre-processing phase for dimensionality reduction on NSL-

KDD dataset. Overall, accuracy of 82.32% and 66.77% were

achieved on KDDTest+ and KDDTest-21 respectively, which

is quite a small performance improvement. Ibrahim et al.

in [9] employed a Self-Organization Map (SOM) Artiﬁcial

Neural Network (ANN) which is an unsupervised learning

2019 IEEE Wireless Communications and Networking Conference (WCNC)

下载后可阅读完整内容，剩余6页未读，立即下载

张盛锋

粉丝: 30
资源: 297

基于集成学习的物联网安全：Fog-to-Things环境下的入侵检测系统

Securing-Optimizing-Linux-The-Hacking-Solution-v3.0.pdf

改进lbp代码matlab-securing-online-transaction-using-face-recognition:使用面部识别

Securing Android-Powered Mobile Devices Using SELinux

securing-files-using-nodejs-crypto-yt

Securing Scriba-开源

securing-debian-howto.en

Securing acoustics-based short-range communication systems: an overview

Networkers2009：BRKSEC-3003 - Advanced IPv6 Security: Securing Link-Operations at First Hop

securing-debian-howto.en,refcard.en,debian-faq.en,install

securing-portlets-with-spring-security

最新资源