SemDiff: Finding Semtic Differences in Binary Programs based on Angr

Shi-Chao Wang,Yao Li,Wei-Yang Xu,Chu-Lei Liu,L Long,X Li,Y Dai,H Yang,Y Li

doi:10.1051/itmconf/20171203029

Shi-Chao Wang, Yao Li + Show 7 more

Open Access

https://doi.org/10.1051/itmconf/20171203029

Copy DOI

Abstract

We introduce SemDiff, a novel technology for finding semantic differences between two binary files. Now, the vendor will release the information to patch the previous version which has vulnerability. Then, we can compare the differences and similarities between the two versions to get the unpublished details of the 1day vulnerabilities. Tools, such as BinDiff, BinHunt and iBinHunt, have worked on this project before, however, there are some weaknesses on them. Just like BinDiff, a comparison method based on structure, can not be effective for judging the semantic differences. Though the other two tools(BindHunt and iBinHunt) can recognize the differences we focus on, they can not effectively verify the functional inlining and spend a pretty long time to finish the process because the use of graph-based isomorphism algorithm. In the paper, we first propose SemDiff, which uses the existing tool(angr) to generate the intermediate language(VEX). Then, because of the nature of program, the data read from and written to the memories, we record these information to implement the comparison. Last, an improved BinDiff algorithm is used to match the basic blocks. In this paper, we take some real vulnerabilities as examples, such as CVE-2010-3974-Microsoft Windows to test our tool, reaching a good goal, matching more blocks than BinDiff and taking less time than BinHunt and iBinHunt.

Highlights

For the purpose to protect the source code, many software vendors make the source code of their programs unavailable and when the vulnerabilities occurs, the patch is released in binary mode, rather than the source code
We propose a new method called SemDiff to find the semantic differences between the two programs
Our method is based on the control flow on basic blocks, symbolic execution[8] and the theorem prover

Summary

Introduction

For the purpose to protect the source code, many software vendors make the source code of their programs unavailable and when the vulnerabilities occurs, the patch is released in binary mode, rather than the source code. As Microsoft and other companies, when they publish a patch, no details are showed [1] This situation increases the difficulty in analyzing the potential vulnerabilities to protect us from those threats, which may hijack our data, steal our privacy and so on. Our method is based on the control flow on basic blocks, symbolic execution[8] and the theorem prover. We record the data written to and read from memories and registers, and we put them into theorem prover to judge the similarity. Last, we use these information and the SemDiff to match the blocks.

System Architecture

SemDiff Algorithm

Symbolic Execution and Theorem Proving

Experiment

Summary

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ITM Web of Conferences	Publication Date: Jan 1, 2017
Citations: 6	License type: cc-by

R Discovery Prime

R Discovery Prime

SemDiff: Finding Semtic Differences in Binary Programs based on Angr

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ITM Web of Conferences

Lead the way for us

Similar Papers

Understand Code Style: Efficient CNN-Based Compiler Optimization Recognition System
Shouguo Yang ... Limin Sun
-
Shouguo Yang, et. al.Shouguo Yang ... Limin Sun
01 May 2019
01 May 2019

BinHunt: Automatically Finding Semantic Differences in Binary Programs
Debin Gao ... Michael K Reiter
-
Debin Gao, et. al.Debin Gao ... Michael K Reiter
01 Jan 2008
01 Jan 2008

Using Binary Code Instrumentation in Computer Security
Marius Popa ... Sergiu Marin Capisizu
Informatica Economica | VOL. 17
Marius Popa, et. al.Marius Popa ... Sergiu Marin Capisizu
30 Dec 2014
Informatica Economica | VOL. 17

File Content-based Malware Classification
Mahendra Deore ... Chhaya S Gosavi
-
Mahendra Deore, et. al.Mahendra Deore ... Chhaya S Gosavi
08 May 2024
08 May 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SemDiff: Finding Semtic Differences in Binary Programs based on Angr

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: ITM Web of Conferences