Taint Analysis Research Articles

Previous work has shown that taint analyses are only useful if correctly customized to the context in which they are used. Existing domain-specific languages (DSLs) allow such customization through the definition of deny-listing data-flow rules that describe potentially vulnerable or malicious taint-flows. These languages, however, are designed primarily for security experts who are expected to be knowledgeable in taint analysis. Software developers, however, consider these languages to be complex. This paper thus presents fluent TQL, a query specification language particularly for taint-flows. fluentTQL is internal Java DSL and uses a fluent-interface design. fluentTQL queries can express various taint-style vulnerability types, e.g. injections, cross-site scripting or path traversal. This paper describes fluentTQL’s abstract and concrete syntax and defines its runtime semantics. The semantics are independent of any underlying analysis and allows evaluation of fluent TQL queries by a variety of taint analyses. Instantiations of fluentTQL, on top of two taint analysis solvers, Boomerang and FlowDroid, show and validate fluent TQL expressiveness. Based on existing examples from the literature, we have used fluentTQL to implement queries for 11 popular security vulnerability types in Java. Using our SQL injection specification, the Boomerang-based taint analysis found all 17 known taint-flows in the OWASP WebGoat application, whereas with FlowDroid 13 taint-flows were found. Similarly, in a vulnerable version of the Java Spring PetClinic application, the Boomerang-based taint analysis found all seven expected taint-flows. In seven real-world Android apps with 25 expected malicious taint-flows, 18 taint-flows were detected. In a user study with 26 software developers, fluentTQL reached a high usability score. In comparison to CodeQL, the state-of-the-art DSL by Semmle/GitHub, participants found fluentTQL more usable and with it they were able to specify taint analysis queries in shorter time.

Read full abstract

The popularization of the Android platform and the growing number of Android applications (apps) that manage sensitive data turned the Android ecosystem into an attractive target for malicious software. For this reason, researchers and practitioners have investigated new approaches to address Android’s security issues, including techniques that leverage dynamic analysis to mine Android sandboxes. The mining sandbox approach consists in running dynamic analysis tools on a benign version of an Android app. This exploratory phase records all calls to sensitive APIs. Later, we can use this information to (a) prevent calls to other sensitive APIs (those not recorded in the exploratory phase) or (b) run the dynamic analysis tools again in a different version of the app. During this second execution of the fuzzing tools, a warning of possible malicious behavior is raised whenever the new version of the app calls a sensitive API not recorded in the exploratory phase.The use of a mining sandbox approach is an effective technique for Android malware analysis, as previous research works revealed. Particularly, existing reports present an accuracy of almost 70% in the identification of malicious behavior using dynamic analysis tools to mine android sandboxes. However, although the use of dynamic analysis for mining Android sandboxes has been investigated before, little is known about the potential benefits of combining static analysis with a mining sandbox approach for identifying malicious behavior. Accordingly, in this paper we present the results of two studies that investigate the impact of using static analysis to complement the performance of existing dynamic analysis tools tailored for mining Android sandboxes, in the task of identifying malicious behavior.In the first study we conduct a non-exact replication of a previous study (hereafter BLL-Study) that compares the performance of test case generation tools for mining Android sandboxes. Differently from the original work, here we isolate the effect of an independent static analysis component (DroidFax) they used to instrument the Android apps in their experiments. This decision was motivated by the fact that DroidFax could have influenced the efficacy of the dynamic analyses tools positively—through the execution of specific static analysis algorithms DroidFax also implements. In our second study, we carried out a new experiment to investigate the efficacy of taint analysis algorithms to complement the mining sandbox approach previously used to identify malicious behavior. To this end, we executed the FlowDroid tool to mine the source–sink flows from benign/malign pairs of Android apps used in a previous research work.Our study brings several findings. For instance, the first study reveals that DroidFax alone (static analysis) can detect 43.75% of the malwares in the BLL-Study dataset, contributing substantially in the performance of the dynamic analysis tools in the BLL-Study. The results of the second study show that taint analysis is also practical to complement the mining sandboxes approach, with a performance similar to that reached by dynamic analysis tools.

Read full abstract

Taint Analysis Research Articles

Related Topics

Articles published on Taint Analysis

Quantifying Information Leakage for Security Verification of Compiler Optimizations

Context matters: Methods for Bitcoin tracking

Analyzing Android Taint Analysis Tools: FlowDroid, Amandroid, and DroidSafe

Smart Mobile Information Systems on the Key Systems of Blockchain Privacy Protection

Fluently specifying taint-flow queries with fluentTQL

Fast Graph Simplification for Interleaved-Dyck Reachability

CSChecker : A binary taint-based vulnerability detection method based on static taint analysis

Cefuzz: An Directed Fuzzing Framework for PHP RCE Vulnerability

Automatic protocol reverse engineering for industrial control systems with dynamic taint analysis

Only pay for what you need: Detecting and removing unnecessary TEE-based code

Explaining Static Analysis With Rule Graphs

Ethereum Smart Contract Analysis Tools: A Systematic Review

Hybrid Static-Dynamic Analysis of Data Races Caused by Inconsistent Locking Discipline in Device Drivers

ISmart: Protecting Smart Contract Against Integer Bugs

Irbis: статический анализатор помеченных данных для поиска уязвимостей в программах на C/C++

A mutation framework for evaluating security analysis tools in IoT applications

SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks.

TaintBench: Automatic real-world malware benchmarking of Android taint analyses

Towards Automatic Detection of Nonfunctional Sensitive Transmissions in Mobile Applications

Exploring the use of static and dynamic analysis to improve the performance of the mining sandbox approach for android malware identification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Taint Analysis Research Articles

Related Topics

Articles published on Taint Analysis

Quantifying Information Leakage for Security Verification of Compiler Optimizations

Context matters: Methods for Bitcoin tracking

Analyzing Android Taint Analysis Tools: FlowDroid, Amandroid, and DroidSafe

Smart Mobile Information Systems on the Key Systems of Blockchain Privacy Protection

Fluently specifying taint-flow queries with fluentTQL

Fast Graph Simplification for Interleaved-Dyck Reachability

CSChecker : A binary taint-based vulnerability detection method based on static taint analysis

Cefuzz: An Directed Fuzzing Framework for PHP RCE Vulnerability

Automatic protocol reverse engineering for industrial control systems with dynamic taint analysis

Only pay for what you need: Detecting and removing unnecessary TEE-based code

Explaining Static Analysis With Rule Graphs

Ethereum Smart Contract Analysis Tools: A Systematic Review

Hybrid Static-Dynamic Analysis of Data Races Caused by Inconsistent Locking Discipline in Device Drivers

ISmart: Protecting Smart Contract Against Integer Bugs

Irbis: статический анализатор помеченных данных для поиска уязвимостей в программах на C/C++

A mutation framework for evaluating security analysis tools in IoT applications

SAMLDroid: A Static Taint Analysis and Machine Learning Combined High-Accuracy Method for Identifying Android Apps with Location Privacy Leakage Risks.

TaintBench: Automatic real-world malware benchmarking of Android taint analyses

Towards Automatic Detection of Nonfunctional Sensitive Transmissions in Mobile Applications

Exploring the use of static and dynamic analysis to improve the performance of the mining sandbox approach for android malware identification