提升白盒模糊测试：基于语法规则的方法

需积分: 10 126 浏览量更新于2024-09-10 收藏 169KB PDF 举报

"Grammar-based Whitebox Fuzzing - Enhancing Security Testing for Complex Structured-Input Applications" 在软件安全领域，白盒模糊测试（whitebox fuzzing）是一种自动化动态测试生成技术，它依赖于符号执行和约束求解来检测大型应用程序的安全漏洞。然而，当前的白盒模糊测试在面对输入高度结构化的应用程序，如编译器和解释器时，其效果受到限制。这些应用通常会通过多个阶段处理输入，如词法分析、解析和评估。由于早期处理阶段的控制路径数量庞大，白盒模糊测试往往难以深入到应用的更深层次。 Patrice Godefroid、Adam Kiezun 和 Michael Y. Levin 在他们的研究中提出了一个创新方法，即基于语法规则的白盒模糊测试（Grammar-based Whitebox Fuzzing），旨在增强对复杂结构输入应用的安全测试。他们认为，通过为应用的有效输入提供语法规范，可以更有效地指导模糊测试，从而突破早期处理阶段的局限，深入到应用的更深层部分。传统的模糊测试通常依赖于随机变异策略，这种方法对于无结构或低结构输入可能有效，但对于编译器和解释器等应用，它们需要遵循特定的语法规则。基于语法规则的模糊测试引入了一种新的动态策略，允许测试生成器根据应用的输入语法规则构造有效的输入，这样可以增加测试覆盖率，发现更多潜在的漏洞。该方法的工作原理包括以下步骤： 1. 语法定义：首先，为应用的输入格式定义一个形式化的上下文无关文法（CFG）或其他类型的语法规则，这可以确保生成的测试用例是有效的。 2. 动态引导：在测试过程中，利用符号执行跟踪程序执行路径，并结合约束求解，动态地调整输入生成，以探索新的控制路径。 3. 路径强化：通过优先考虑可能导致新路径的输入变异，避免陷入重复的路径循环，从而更有效地覆盖代码。 4. 反馈循环：测试生成器不断学习并优化输入生成过程，以更好地探索未被覆盖的代码区域。这种方法的优势在于，它能够有效地生成符合应用输入格式的测试用例，这比传统模糊测试更有可能触发深藏在应用内部的错误和安全漏洞。通过这种方式，开发者可以针对那些通常难以触及的部分进行更深入的安全测试，从而提高整个软件系统的安全性。总而言之，基于语法规则的白盒模糊测试是一种强大的工具，它改进了现有的模糊测试技术，尤其对于处理复杂结构输入的应用程序而言。通过精确地构造输入以匹配应用的语法规则，该方法能够更高效地发现潜在的安全问题，从而提升软件的安全性。这对于维护和开发关键系统，如编译器和解释器，具有重大的实际意义。

Grammar-based Whitebox Fuzzing

Patrice Godefroid

Microsoft Research

Redmond, WA, USA

pg@microsoft.com

Adam Kie

zun

Massachusetts Institute of

Technology

Computer Science and Artiﬁcial

Intelligence Laboratory

Cambridge, MA, USA

akiezun@mit.edu

Michael Y. L evin

Microsoft Center for Software

Excellence

Redmond, WA, USA

mlevin@microsoft.com

Abstract

Whitebox fuzzing i s a form of automatic dynamic test gen-

eration, based on symbolic execution and constraint solving,

designed for security testing of large applications. Unfortu-

nately, the current effectiveness of whitebox fuzzing is lim-

ited when testing applications with highly-structured inputs,

such as compilers and interpreters. These applications pro-

cess their inputs in stages, such as lexing, parsing and evalu-

ation. Due to the enormous number of control paths in e arly

processing stages, whitebox fuzzing rarely reaches parts of

the application beyond those ﬁrst stages.

In this paper, we study how to enhance whitebox fuzzing

of comp lex structured-input applications with a grammar-

based speciﬁcation of their valid inputs. We present a novel

dynamic test g eneration algorithm where symbolic execu-

tion d irectly generates grammar-based constraints whose

satisﬁability is checked using a custom grammar-based con-

straint solver. We have impl emented this algorithm and eval-

uated it on a large security-critical application, the JavaScript

interpreter of Internet Explorer 7 (IE7). Results of our ex-

periments show that grammar-based whitebox fuzzing ex-

plores deeper program paths and avoids dead-ends due to

non-parsable inputs. Compared to regular whitebox fuzzing,

grammar-based whitebox fuzzing increased coverage of the

code generation module of the IE7 JavaScript interpreter

from 53% to 81% while us ing three times fewer tests.

Categories and Subject Descriptors D.2.4 [Software Engi-

neering]: Software/Program Ve riﬁcation; D.2.5 [Software En-

gineering]: Testing and Debugging; F.3.1 [Logics and Mean-

ings of Programs]: Specifying and Verifying and Reasoning

about Programs

General Terms Veri ﬁcation, Algorithms, Reliability

Keywords Software Testing, Automatic Test Generation,

Grammars, Program Veriﬁcation

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee.

PLDI’08,

June 7–13, 2008, Tucson, Arizona, USA.

 2008 ACM 978-1-59593-860-2/08/06. . . $5.00

1. Introduction

Blackbox fuzzing is a form of testing, heavily used for ﬁnding

security vulnerabilities in software. It simply consists in ran-

domly modifying well-formed inputs and testing the result-

ing variants [3, 12]. Blackbox fuzzing sometimes uses gram-

mars to generate the well-formed inputs, as well as to encode

application-speciﬁc knowledge and test heuristics for guid-

ing the generation of input variants [1, 37].

A recently propos ed alternative, whitebox fuzzing [ 16],

combines fuzz testing with dynamic test generation [6, 14].

Whitebox fuzzing executes the program under test with an

initial, well-formed input, both concretely and symbolically.

During the execution of conditional statements, symbolic

execution creates constraints on program inputs. Those con-

straints capture how the program uses its inputs, and satis-

fying assignments for the negation of each constraint deﬁne

new inputs that exercise different control paths. Whitebox

fuzzing repeats this process for the newly created inputs,

with the goal of exercising many different control paths of

the program under test and ﬁnding bugs as fast as possi-

ble us ing various search heuristics. In practice, the search is

usually incomplete because the number of f easible control

paths may be astronomical (even inﬁnite) and because the

precision of symbolic execution, constraint generation and

solving is inherently limited. Nevertheless, whitebox fuzzing

has been shown to be very effective in ﬁnding new s ecurity

vulnerabilities in several applications.

Unfortunately, the current effectiveness of whitebox

fuzzing is limited when tes ting applications with highly-

structured inputs. Examples of such applications are com-

pilers and interpreters. These applications process their in-

puts in stages, such as lexing, parsing and evaluation. Due

to the enormous number of control paths in early process-

ing stages, whitebox fuzzing rarely reaches parts of the

application beyo nd these ﬁrst stages. For instance, there

are many possible sequences of blank-spaces/tabs/carriage-

returns/etc. separating tokens in most structured languages,

each corresponding to a different control path in the lexer. In

addition to path explosion, symbolic execution itself may be

defeated already in the ﬁrst processing stages. For instance,

lexers often detect language keywords by comparing their

pre-computed, hard-coded hash values with the hash values

of strings read from the input; this effectively prevents sym-

bolic execution and constraint solving from ever generating

input strings that match those keywords since hash functions

cannot be inversed (i.e., given a constraint x == hash(y)

and a value f or x, one cannot compute a value for y that sat-

isﬁes this constraint).

下载后可阅读完整内容，剩余9页未读，立即下载

chen_zhong_mis

粉丝: 0
资源: 2

提升白盒模糊测试：基于语法规则的方法

Grammatical theory From transformational grammar to constraint-based approaches

Atom-atom-grammar-live-reload,语言语法实时重新加载原子（异步）。对ldez/atom的贡献.zip

grammar-based-suggestion-engine

捕鱼java源码-Awesome-Grammar-Fuzzing:基于语法的模糊研究论文、代码、教程的精选列表

grammar-based-sentence-generator:根据定义的语法和词汇创建随机句子

Grammar-multi-开源

grammar-fixes

grammar-models

grammar-proofreading

Grammar-Checker

最新资源