Android恶意软件检测：关键词vector与SVM的应用

49 浏览量更新于2024-08-26 收藏 514KB PDF 举报

"这篇研究论文探讨了如何在Android智能手机上使用关键字向量和支持向量机（SVM）技术进行恶意软件检测。作者包括Sun Junmei、Yan Kai、Liu Xuejiao、Yang Chunlei和Fu Yaoyin，来自杭州服务工程学院和杭州师范大学。文章提出了一种基于Java源代码特征提取的新方法，用于解决新型恶意软件和变种的有效检测问题。" 正文：随着智能手机的发展，特别是Android平台的普及，移动设备上的恶意软件数量日益增长，对用户信息安全构成严重威胁。传统的恶意软件检测方法主要基于二进制程序的特征提取，但这种方法对于新型和变种恶意软件的检测效果并不理想。鉴于此，该研究论文提出了一种新的方法，该方法专注于从Android恶意软件的Java源代码中提取特征。论文中的关键创新点在于使用了关键字相关距离计算。这种方法考虑了API调用、Android权限、常见参数以及恶意代码中的特定关键词之间的关联性。通过计算这些关键元素之间的相关性，研究人员能够构建一个更全面的特征表示，这有助于识别恶意软件的独特模式。接下来，研究应用了支持向量机（Support Vector Machine，SVM）算法。SVM是一种监督学习模型，常用于分类和回归分析，尤其在小样本数据集上表现出色。在这里，SVM被用来训练模型，以便系统能适应新出现的恶意软件样本。通过学习和理解已知恶意软件的关键特征，SVM可以有效地将新的未知样本分类为恶意或非恶意，从而实现对新型恶意软件的快速检测。此外，SVM的鲁棒性和泛化能力使其成为处理高维数据，如关键字向量的理想选择。它能够在数据中找到最优的决策边界，减少误报和漏报的可能性。这种方法的一个潜在优势是，即使面对从未见过的恶意软件样本，也能通过学习已有的特征模式来做出准确的判断。这篇研究论文提出了一种基于Java源代码特征提取和SVM分类的恶意软件检测框架，旨在提高检测新型和变种恶意软件的效率和准确性。这种方法对于提升Android设备的安全防护能力具有重要意义，同时为未来在移动安全领域的研究提供了新的思路和技术手段。

Malware Detection on Android Smartphones using

Keywords Vector and SVM

Junmei Sun*, Kai Yan, Xuejiao Liu , Chunlei Yang, Yaoyin Fu

Hangzhou Institute of Service Engineering

Hangzhou Normal University

Hangzhou, China

junmeisun@hznu.edu.cn

2015112011003@stu.hznu.cn

liuxuejiao@hznu.edu.cn

1027721710@qq.com

2015112011014@stu.hznu.cn

Abstract—With the development of smart phones, more and

more mobile phone malwares have came out in the market

especially on the popular platforms such as Android, which can

potentially cause harm to users’ information. But how to

effectively detect the new malwares and malicious software

variants has been a difficult problem. In view of the traditional

feature extraction method based on binary program, this paper

presents a method for feature extraction of JAVA source code.

The method uses the Keywords Correlation Distance to compute

the correlation between key codes such as API calls, Android

permissions, the common parameters, and the common key

words in Android malware source code. Then SVM is applied to

make the system gain to accommodate the function of the new

malicious software sample, so as to detect new malicious software

and existing malwares. This method is different from the

conventional methods which are based on the context of the text.

This method combines the characteristics of the malicious

software categories and operating environment to record the

behavior of the malicious software. Experiments show that the

method is efficient and effective in detecting malwares on

Android platform.

Keywords—Android; Malware; Keywords Correlation

Distance; SVM

I. INTRODUCTION

With the advent of the Internet era, the smart phones in the

world is also getting more and more popular, especially the

smart phone with Android operating system with its excellent

performance. However, Android malwares have increased

significantly in recent years. It has been highlighted [4] that

“among all mobile malware, the share of Android based

malware is higher than 46% and still growing rapidly.” Given

the rampant growth of Android malware, there is a pressing

need to effectively mitigate or defend against them[5].

Unfortunately, Most malware detection methods are based on

traditional content signatures, such as a list of malware

signature definitions, and compare each application against the

database of known malware signatures. The disadvantage of

this detection method is that users are only protected from

malware that are detected by most recently updated signatures,

but not protected from new malware[6].There are some

researches proposed detecting malware based on static

requested permissions. The disadvantage of this detection

method is not reliable. This is mainly because developers can

freely request any permission they want, so they can mock the

requested permissions of benign applications. Some researches

dynamically run the App On the sandbox to capture runtime

activities of the App. But analyzing Apps’ runtime dynamic

behaviors requires sophisticated skills and platforms which is

time consuming process and will cause high cost overhead.

Motivated by the above observations, we propose feature

extracted method based on the keywords vector. Every

keywords vector is a set of keywords which can common

complete a malicious attack. We know only some request may

be no harm to users. Harm is often done by a series of

malicious operations.

The contributions of this paper are summarized as follows.

First, we propose a feature extraction method based on

keywords correlation distance which is different from the

traditional method based on binary program.

Second, we use feature vector to describe malicious

software feature including not only permission, APIs, but also

the common parameters and common package etc.

Third, we give a malware detection method through SVM

based on the feature vector set, which can detect new malwares

and malicious software variants.

The rest of this paper is organized as follows: Section II

presents the system framework. Section III gives the definition

of keywords correlation distance. After that, Section IV

proposes the feature extraction method and Section V shows

the detection method using SVM. Section VI gives the

experiment result. Lastly, we proposes the related work in

Section VII. And we summarize our paper in Section VIII.

II. SYSTEM

FRAMEWORK

In this section, we introduce the overall framework of

proposed malware detection scheme. The system framework is

shown in Figure 1. Our system is mainly divided into two

This research is supported in part by the following funds: National Natural Science Foundation of China under grant number 61502134 and Zhejiang Provincial

Science and Technology Innovation Program under grant number 2013TD03. Hangzhou Science and Technology Development Plan (Grant No. 20170533B04)

*Corresponding author: junmeisun@hznu.edu.cn.

ICIS 2017, May 24-26, 2017, Wuhan, China

833

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38660579

粉丝: 11
资源: 918

Android恶意软件检测：关键词vector与SVM的应用

SVM.rar_SVM_SVM 干扰_i-vector_svm算法实现

GreyWolfOptimization-MKSVM:使用GreyWolfOptimization进行功能选择，并使用多内核SVM对IoT设备上的恶意软件狩猎进行分类

Face Detection using Support Vector Machine (SVM)：使用 Gabor 特征提取和支持向量机 (SVM) 进行人脸检测-matlab开发

support_vector_machines_succinctly_someone5eg_vector_SVM_

svm-chinese.rar_SVM支持向量机_support vector _svm non linear_向量机_支持向量

基于FWKN-SVM的Android异常入侵检测的研究.pdf

SVM在Android应用安全检测中的应用与展望

使用智能手机传感器和SVM进行行为识别

数据依赖核LS-SVM在压电智能结构冲击损伤检测中的应用

Android智能手机平台异常入侵检测技术研究

最新资源