TABLE I. COMPARISON AMONG BINARY CODE SIMILARITY APPROACHES. FOR BOOLEAN COLUMNS X MEANS SUPPORTED AND 7 UNSUPPORTED.
INPUT COMPARISON CAN BE ONE-TO-ONE (OO), ONE-TO-MANY (OM), OR MANY-TO-MANY (MM). INPUT GRANULARITY AND APPROACH
GRANULARITIES CAN BE INSTRUCTION (I), BASIC BLOCK (B), FUNCTION (F), OR PROGRAM (P). APPROACH COMPARISON CAN BE SIMILAR (S),
IDENTICAL (I), OR EQUIVALENT (E). STRUCTURAL SIMILARITY CAN USE CFG (C), ICFG (I), CALLGRAPH (G), AND OTHER CUSTOM GRAPHS (O).
MACHINE LEARNING CAN BE SUPERVISED (S) OR UNSUPERVISED (U). IN NORMALIZATION, 7 MEANS NO NORMALIZATION, OPERAND REMOVAL,
• OPERAND NORMALIZATION, ◦ MNEMONIC NORMALIZATION, AND ? CODE ELIMINATION.
Approach Characteristics
Approach Year Venue
Input Comparison
Approach Comparison
Input Granularity
Approach Granularities
Syntactical similarity
Semantic similarity
Structural similarity
Feature-based
Machine learning
Locality sensitive hashing
Cross-architecture
Static analysis
Dynamic analysis
Dataflow analysis
Normalization
EXEDIFF [25] 1999 WCSSS OO I P I X 7 7 7 7 7 7 X 7 7 •
BMAT [32] 1999 FDO2 OO S,I P F,B X 7 C 7 7 7 7 X 7 7 •◦
F2004 [26] 2004 DIMVA OO S P F 7 7 C,G X 7 7 7 X 7 7 7
DR2005 [27] 2005 SSTIC OO S,I P F,B,I X 7 C,G X 7 7 7 X 7 7 •
KKMRV2005 [19] 2005 RAID MM S P B* 7 X I 7 7 7 7 X 7 7 •
BMM2006 [20] 2006 DIMVA OO S P B* 7 X I 7 7 7 7 X 7 X •
BINHUNT [28] 2008 ICISC OO S,E P F,B 7 X C,G 7 7 7 7 X 7 X 7
SWPQS2006 [56] 2009 ISSTA MM S,I I* I* X 7 7 X 7 X 7 X 7 7 •
SMIT [16] 2009 CCS OM S,I P F X 7 G X 7 7 7 X 7 7 7
IDEA [57] 2010 ESSoS MM S P I* X 7 7 X 7 7 7 X 7 7 •
MBC [58] 2012 RACS MM S P B X 7 7 X 7 7 7 X 7 7 •
IBINHUNT [59] 2012 ICISC OO S,E P B 7 X I 7 7 7 7 X X X 7
BEAGLE [22] 2012 ACSAC MM S P B* 7 X C 7 7 7 7 X X 7 •
BINHASH [60] 2012 ICMLA MM E F B 7 X 7 X U X 7 X 7 X •
BINJUICE [42] 2013 PPREW OO S,E P F,B 7 X 7 7 7 7 7 X 7 X 7
BINSLAYER [61] 2013 PPREW OO S P F,B 7 7 C,G 7 7 7 7 X 7 7 •
RENDEZVOUS [62] 2013 MSR OM S F F X 7 7 7 7 7 7 X 7 7 •
MUTANTX-S [17] 2013 Usenix ATC MM S P I* X 7 7 X U 7 7 X 7 7 •
EXPOS
´
E [63] 2013 COMPSAC OM S,E P F,I* X X 7 X 7 7 7 X 7 X •
ILINE [23] 2013 USENIX Sec MM S P B,I* X 7 7 X U 7 7 X X 7 •◦?
LKI2013 [64] 2013 RACS OO S P F,I* 7 7 C,G X 7 7 7 X 7 7 •
TRACY [1] 2014 PLDI OM S,E F I* X X 7 7 7 7 7 X 7 X ?
BINCLONE [65] 2014 SERE MM S,I I* I* X 7 7 X 7 7 7 X 7 7 •
RMKNHLLP2014 [66] 2014 DIMVA MM S F* F 7 7 7 X U 7 7 X 7 X 7
CXZ2014 [21] 2014 TDSC OM S P F 7 7 C X 7 7 7 X 7 7 7
BLEX [67] 2014 USENIX Sec OO S F F 7 X 7 X 7 7 7 X X 7 7
COP [33], [68] 2014 ESEC/FSE OO S,E P F,B 7 X C 7 7 7 7 X 7 X 7
TEDEM [2] 2014 ACSAC OM S B* B 7 X C 7 7 7 7 X 7 7 7
SIGMA [69] 2015 DFRWS OO S F F 7 7 O 7 7 7 7 X 7 7 •
MXW2015 [24] 2015 IFIP SEC OO E P B 7 X I 7 7 7 7 X X X •?
MULTI-MH [3] 2015 S&P OM S B* B 7 X C 7 7 X X X 7 X 7
QSM2015 [70] 2015 SANER OO I F I* 7 7 O 7 7 7 7 X 7 X •?
DISCOVRE [4] 2016 NDSS OM S F B 7 7 C X 7 7 X X 7 7 7
MOCKINGBIRD [29] 2016 SANER OM S F F 7 X 7 7 7 7 X 7 X 7 7
ESH [5] 2016 PLDI OM E F I* 7 X 7 7 7 7 7 X 7 X 7
TPM [71] 2016 TrustCom OO S P F 7 7 7 X 7 7 7 X 7 7 7
BINDNN [72] 2016 SecureComm OM S F F 7 7 7 7 S 7 X X 7 7 •
GENIUS [6] 2016 CCS OM S F B 7 7 C X U X X X 7 7 7
BINGO [7] 2016 FSE OM S F B*,I* 7 X 7 7 7 7 X X 7 X ?
KLKI2016 [18] 2016 JSCOMPUT OO S P F 7 7 G X 7 7 7 X X 7 7
KAM1N0 [73] 2016 SIGKDD OM S B* B X 7 C X 7 X 7 X 7 7 •
BINSEQUENCE [8] 2017 ASIACCS OM S F B,I X 7 C 7 7 X 7 X 7 7 •
XMATCH [9] 2017 ASIACCS OM S F I* 7 X 7 7 7 7 X X 7 X 7
CACOMPARE [74] 2017 ICPC OM S F F 7 X 7 7 7 X X X 7 7 7
SPAIN [30] 2017 ICSE OO S,I P F,B X X 7 7 7 7 7 X 7 X •
BINSIGN [75] 2017 IFIP SEC OM S F F 7 7 7 X 7 X 7 X 7 7 •
GITZ [10] 2017 PLDI OM E F I* 7 X 7 7 7 7 X X 7 7 7
BINSHAPE [76] 2017 DIMVA OM S F F 7 7 7 X 7 X 7 X 7 7 •
BINSIM [77] 2017 USENIX Sec OO S T I* 7 X 7 7 7 7 7 7 X X 7
KS2017 [31] 2017 ASE OM S T I* 7 X 7 X 7 7 7 7 X 7 7
IMF-SIM [78] 2017 ASE OO S F F 7 X 7 X S 7 7 7 X X 7
GEMINI [12] 2017 CCS OM S F F 7 7 C X S X X X 7 7 7
FOSSIL [79] 2018 TOPS OM S F F,B* 7 X C X 7 7 7 X 7 7 •
FIRMUP [13] 2018 ASPLOS OM E F I* 7 X 7 7 7 7 X X 7 7 •
BINARM [14] 2018 DIMVA OM S F F 7 7 C X 7 7 7 X 7 7 •
αDIFF [15] 2018 ASE OO S P F 7 7 7 7 S 7 X X 7 7 7
VULSEEKER [11] 2018 ASE OM S F F 7 7 C X S 7 X X 7 X 7
RLZ2019 [80] 2019 BAR OM S B B 7 7 7 7 S 7 X X 7 7 •
INNEREYE [81] 2019 NDSS OM S B* B 7 7 7 7 S X X X 7 7 •
ASM2VEC [82] 2019 S&P OM S F I* 7 7 7 7 S 7 7 X 7 7 7
SAFE [83] 2019 DIMVA OM S F F 7 7 7 7 S 7 X X 7 7 7