DGA僵尸网络检测：平均激活自编码域方法

需积分: 0 119 浏览量更新于2024-08-05 收藏 210KB PDF 举报

本文探讨了用于检测DGA（Domain Generation Algorithm，域名生成算法）僵尸网络的一种新颖方法，即AutoEncoded Domains with Mean Activation（均值激活自编码域）。作者Binay Dahal和Yoohwan Kim来自内华达大学拉斯维加斯分校计算机科学系，他们的研究针对了现代网络环境中DGA botnet的复杂挑战。 DGA botnets是利用动态生成算法生成大量不可预测的域名，以此作为与命令和控制（Command and Control，C&C）服务器通信的手段，以逃避静态IP地址被识别和封锁的限制。这种策略使得传统的基于特定IP地址或静态特征的检测方法变得失效。研究人员在文中提到，过去的应对策略主要包括根据DGA生成的域名中的字母数字分布提取特征，并通过分类技术来识别这些域名的恶意行为。例如，有的研究会分析网络日志，运用时间序列分析等手段来发现异常行为。然而，这些方法往往依赖于预定义的特征，可能在面对不断演变的DGA技术时表现出局限性。 AutoEncoded Domains with Mean Activation提出了一种创新的方法，它利用自编码器（Autoencoder）这一深度学习模型，能够学习到更抽象和动态的表示，从而更好地捕捉DGA生成域名的潜在模式。自编码器通过压缩输入数据并尝试重构输出，能提取出数据的内在结构，即使面对不断变化的DGA生成规则，也能更有效地进行分类和识别。这种方法的优势在于其能够适应性地处理非结构化和动态的数据，提高了对DGA botnet检测的鲁棒性和准确性。通过将平均激活（Mean Activation）引入到自编码器的设计中，可能进一步增强了模型对异常域名的区分能力，减少了误报和漏报的可能性。这篇文章贡献了一个新的视角，即利用深度学习技术在DGA botnet检测中挖掘动态生成域名的内在模式，这将有助于提升网络安全防御体系对抗不断演变的威胁的能力。两位作者的研究成果为未来的恶意软件分析和防御策略提供了有价值的新思路。

AutoEncoded Domains with Mean Activation for

DGA Botnet Detection

Binay Dahal

Department of Computer Science

University of Nevada, Las Vegas

Las Vegas, Nevada

Email:binay.dahal@unlv.edu

Yoohwan Kim

Department of Computer Science

University of Nevada, Las Vegas

Las Vegas,Nevada

Email: yoohwan.kim@unlv.edu

Abstract—Botnets are the powerful and effective way of

performing malicious activities over the internet. Over the years,

it has evolved into many forms. Earlier bots used static IP

to communicate with their command and control server. This

method stopped working as soon as that speciﬁc IP was identiﬁed

and blocked. These days, domain ﬂuxing botnets are mostly in

practice. The idea is, using Dynamically Generation Algorithm

(DGA) to generate domains and use it to connect with C&C

server. Numerous researches have been done to detect DGA

botnets. These includes deriving features based on alphanumeric

distribution of DGA domains and performing classiﬁcation on it.

Other studies include network logs analysis, time series analysis

etc. Most of these domain classiﬁcation works rely upon the

features developed and may not work well if the botmaster

decides to generate domain with completely new features. We are

concerned with developing algorithm that is resilient to feature

change that also work well for domain generated by completely

new algorithm that was not seen before. We generated 16 bit

representation of domains using autoencoder and classiﬁed it as

benign or DGA generated using supervised learning(with neural

net and SVM). To make it work with previously unseen algorithm,

we tweaked our method with mean activation of 16-bit domain

representation. This helped improve classiﬁcation accuracy for

completely new set of domain generation algorithm by up to

16%.

Keywords—DGA Botnets, AutoEncoder, Malicious domain De-

tection.

I. INTRODUCTION

The most prevalent malwares these days are the botnets.

Botnets are the network of infected computers called “Bot”

which communicate with a single Command & Control Servers

(C&C Servers) to perform malicious task such as DDoS

attacks, email spamming, click fraud etc. For the botnet to

be effective, it must connect with a single C&C server which

provide the instruction to perform speciﬁc nefarious activity.

Using the static IP for the C&C server might not be a good

idea, as that speciﬁc ip can be discovered and blacklisted.

Hence, these days botnets have a new way of communicating

with the C&C server. This new method employs “domain

ﬂuxing” which is changing the domain names of the server

to avoid getting blacklisted.

The bots use some sort of Domain Generating Algorithm

(DGA) to generate large number of domains and tries to

connect with each of them. As Botmaster already knows the

set of domains that are generated by the bots, they can register

some of those domains and point them to the C&C server.

Once, the request sent from bot to any domain is resolved,

it then connects to the server pointed by that domain name

and start getting instructions. The domain name to the server

is periodically changed to avoid detection. This DGA based

botnets are resilient to ip blacklisting or sinkholing and hence

pose a serious challenge to network administrators. But as

mentioned above, each of these botnets send hundreds to

thousands of request to the domains they have generated. Only

few (one or two) domains are actually registered, so there are

lots of unresolved DNS response with NXDOMAINS. This

trend encouraged network security researchers to look into

network data and ﬁnd if the host is infected with botnets.

Various attempts have been made to detect a DGA based

botnets based on the network activities. This involves ﬁnding

some sort of anomaly in a network behavior to conclude it is

infected. For example, DNS records of infected host contains

a lot of unresolved queries. This can be a symptom that it is

a part of botnet. Research have been done to analyze the time

sequence of a host activity. If there exists a speciﬁc pattern

like sending DNS queries on some ﬁxed time intervals, this

can also signal that it is a part of botnet. Although, these

methods of identifying a botnet have yielded pretty good result,

it involves analyzing the complete DNS records which may not

be always available or it involves manually designing features.

If the botmaster decide to alter the botnet in some form,

previously formed method wont work well. Hence, we are

using new way of devising feature which is resilient to change

in the form of botnets. We propose to use deep learning to

detect if an individual domain is DGA generated or not. Deep

Neural Networks are renowned for automatically engineering

features based on the large number of training examples. This

developed deep network will be able to detect if a domain is

malignant for the new class of DGA algorithms as well.

II. RELATED WORKS

If we look from the perspective of intelligence of algorithm

used, botnet detection in general can be studied under two

broad categories. Those that doesn’t use deep learning and the

approaches that employ certain deep learning architectures.

Here, we review some of the recent trends of botnets and

various works in both of those categories that have been

proposed to tackle such botnets.

Since, static IP based bots can become ineffective once

the network administrator identiﬁes the IP they are trying to

下载后可阅读完整内容，剩余4页未读，立即下载

XiZi

粉丝: 735

DGA僵尸网络检测：平均激活自编码域方法

动态DNS流量分析：DGA驱动的僵尸网络检测新策略

DNS流量挖掘与机器学习：一种僵尸网络检测系统

利用DGA分类器识别僵尸网络生成的域名

基于DGA的DNS流量僵尸网络检测1

融合字符级滑动窗口和深度残差网络的僵尸网络DGA域名检测方法.docx

基于机器学习的僵尸网络DGA域名检测系统设计与实现.pdf

botnet-dga-classifier:识别由僵尸网络常用的域生成算法创建的域

DGA恶意域名检测方法_蒋鸿玲1

使用Bigram频率分析的DGA域检测_DGA_Bigram_频率_python_dga-domains

支持向量机算法区分僵尸网络DGA家族.zip

最新资源