检测与衡量搜索中毒：SURF系统分析

需积分: 10 14 浏览量更新于2024-09-16 收藏 628KB PDF 举报

"SURF: Detecting and Measuring Search Poisoning - Long Lu, Roberto Perdisci, Wenke Lee" 本文探讨了一种新兴且极具攻击性的黑帽SEO（Search Engine Optimization）策略，即“搜索中毒”（Search Poisoning）。黑帽SEO通常被用来提升网站在搜索结果中的排名，而搜索中毒则更为恶劣，它不考虑关键词的相关性，而是针对热门搜索词进行恶意篡改，目的是将大量用户引导至短期流量大的恶意网站。搜索中毒的威胁在于它能迅速地将无辜的搜索者重定向到可能存在恶意活动的网站，例如诈骗、安装恶意软件或盗取个人信息。这种行为不仅破坏了搜索引擎的正常功能，还对用户的在线安全构成严重威胁。为了准确检测搜索中毒现象，作者提出了一个名为SURF（Search Result Fidelity）的新检测系统。SURF作为一个浏览器组件运行，能提取一系列稳健的特征，这些特征可以用来识别搜索结果中的异常行为。系统通过对搜索结果的监控和分析，能够识别出与搜索关键词无关或不相关的链接，以及那些试图通过欺骗手段提高排名的网站。 SURF的工作原理主要包括以下几个步骤： 1. **数据收集**：系统首先收集用户的搜索请求和对应的搜索结果，包括排名靠前的网页URL。 2. **特征提取**：然后，SURF分析这些网页的元数据、内容和链接结构，提取关键特征，如关键词密度、页面质量指标、外部链接数量等。 3. **异常检测**：通过机器学习算法，如支持向量机（SVM）或深度学习模型，建立正常的搜索结果模式，并识别出与正常模式显著偏离的搜索结果。 4. **风险评估**：对于被标记为异常的搜索结果，系统会进一步评估其潜在的风险，如是否指向已知恶意域名、是否包含恶意代码等。 5. **反馈机制**：最后，SURF将检测结果反馈给用户，提供安全警告或直接阻止访问可能有害的网站。 SURF的出现，为防止搜索中毒提供了有效的技术手段，有助于保护用户免受网络欺诈和恶意攻击。然而，随着攻击手段的不断演变，检测系统也需要持续更新和优化，以应对日益复杂的网络安全挑战。搜索中毒是当前互联网安全领域的一大问题，而SURF作为检测和防御工具，为解决这个问题提供了重要的研究和实践价值。在未来，类似的研究将进一步推动搜索引擎安全性和用户体验的提升。

SURF: Detecting and Measuring Search Poisoning

Long Lu

College of Computing

Georgia Inst. of Technology

long@cc.gatech.edu

Roberto Perdisci

Dept. of Computer Science

University of Georgia

perdisci@cs.uga.edu

Wenke Lee

College of Computing

Georgia Inst. of Technology

wenke@cc.gatech.edu

ABSTRACT

Search engine optimization (SEO) techniques are often abused to

promote websites among search results. This is a practice known

as blackhat SEO. In this paper we tackle a newly emerging and

especially aggressive class of blackhat SEO, namely search poi-

soning. Unlike other blackhat SEO techniques, which typically at-

tempt to promote a website’s ranking only under a limited set of

search keywords relevant to the website’s content, search poison-

ing techniques disregard any term relevance constraint and are em-

ployed to poison popular search keywords with the sole purpose of

diverting large numbers of users to short-lived trafﬁc-hungry web-

sites for malicious purposes.

To accurately detect search poisoning cases, we designed a novel

detection system called SURF. SURF runs as a browser component

to extract a number of robust (i.e., difﬁcult to evade) detection fea-

tures from search-then-visit browsing sessions, and is able to ac-

curately classify malicious search user redirections resulted from

user clicking on poisoned search results. Our evaluation on real-

world search poisoning instances shows that SURF can achieve a

detection rate of 99.1% at a false positive rate of 0.9%. Further-

more, we applied SURF to analyze a large dataset of search-related

browsing sessions collected over a period of seven months starting

in September 2010. Through this long-term measurement study we

were able to reveal new trends and interesting patterns related to a

great variety of poisoning cases, thus contributing to a better un-

derstanding of the prevalence and gravity of the search poisoning

problem.

Categories and Subject Descriptors

H.3.3 [INFORMATION STORAGE AND RETRIEVAL]: Infor-

mation Search and Retrieval—Relevance feedback

General Terms

Security

Keywords

Search engine poisoning, Malicious search engine redirection,

Detection, Measurement

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee.

CCS’11, October 17–21, 2011, Chicago, Illinois, USA.

1. INTRODUCTION

Search engines, capable of digging out the most relevant from

oceans of information, have become web surfers’ ﬁrst choice when

seeking information on the web. In fact, for most websites more

than 70% of their visitors reach their pages through search en-

gines [6]. Therefore, website owners always strive to attract more

visits by optimizing their exposure in relevant search results. To

fulﬁll this need, web developers use a number of search engine op-

timization (SEO) techniques, which can improve the visibility of a

website to the search crawlers, highlight its relevance under certain

search terms, and promote its raking in the search results.

Legitimate uses of SEO techniques are accepted and even en-

couraged by search engines [1]. However, dishonest web devel-

opers may choose to abuse these techniques in various ways to

gain (or cheat) a favorable ranking in the search results, a prac-

tice known as blackhat SEO. In this case, search crawlers are pre-

sented with deceptive views of a website, which consist of spe-

cially crafted webpages with inﬂated relevance to a set of target

search terms. Attempts to counter blackhat SEO have been pro-

posed mainly in the information retrieval community [18, 24], but

with very limited success against the recent surge of blackhat SEO

adopters [11]. In the meantime, blackhat SEO has not captured

sufﬁcient attention from the security community, perhaps because

such techniques have been historically employed by non-harmful

websites, including some high proﬁle ones [10], that execute overly

aggressive marketing strategies to win search users from their com-

petitors.

This paper tackles a newly emerging class of blackhat SEO tech-

niques developed by Internet miscreants to lure search users into

visiting malicious websites [7]. We refer to this new class of black-

hat SEO as search poisoning. Unlike other blackhat SEO tech-

niques, which typically attempt to promote a website’s ranking

only under a limited set of search keywords relevant to the web-

site’s content, search poisoning techniques disregard any term rel-

evance constraint. In practice, search poisoning techniques target

any search term that can maximize the number of incoming search

users (e.g., popular keywords). This is in contrast with SEO or

other blackhat SEO techniques adopted by regular websites, be-

cause if search poisoning were to be used to promote a regular web-

site, users landing on the website via completely unrelated search

terms may get annoyed and the website’s reputation may be ir-

reparably damaged. Therefore, we posit that search poisoning can-

not be used for legitimate purposes and is only useful to short-lived

trafﬁc-hungry websites that aim to attract search users for malicious

purposes.

We approach the search poisoning problem from a new angle,

compared to previous work on blackhat SEO. We focus on detect-

ing malicious search user redirections, an essential component of

下载后可阅读完整内容，剩余9页未读，立即下载

saga111

粉丝: 0
资源: 1

检测与衡量搜索中毒：SURF系统分析

SuRF: Practical Range Query Filtering with Fast Succinct Tries原文

SURF: Speeded Up Robust Features

SURF::create

SURF::create参数

opencv SURF::create 参数

使用cv::Ptr<cv::xfeatures2d::SURF> surf = cv::xfeatures2d::SURF::create(); stitcher->setFeaturesFinder(surf);报错LNK2019和LNK1120应该怎么解决

在OpenCV4.6版本下，C++编写的程序中使用了cv::Ptrcv::xfeatures2d::SURF surf = cv::xfeatures2d::SURF::create(); stitcher->setFeaturesFinder(makePtr<SurfFeatureDetector>());提示错误"cv:.xfeatures2d:SURF”:无法实例化抽象类，应该怎么解决

SURF:SURF Project Trinity College都柏林

exwm-surf:exwm下的Surf接口

surf:冲浪和冲浪-PI

最新资源