Guided pattern mining for API misuse detection by change-based code analysis

Sebastian Nielebock,Frank Ortmeier,Robert Heumüller,Kevin Michael Schott

doi:10.1007/s10515-021-00294-x

Sebastian Nielebock, Frank Ortmeier + Show 2 more

Open Access

https://doi.org/10.1007/s10515-021-00294-x

Copy DOI

Abstract

Lack of experience, inadequate documentation, and sub-optimal API design frequently cause developers to make mistakes when re-using third-party implementations. Such API misuses can result in unintended behavior, performance losses, or software crashes. Therefore, current research aims to automatically detect such misuses by comparing the way a developer used an API to previously inferred patterns of the correct API usage. While research has made significant progress, these techniques have not yet been adopted in practice. In part, this is due to the lack of a process capable of seamlessly integrating with software development processes. Particularly, existing approaches do not consider how to collect relevant source code samples from which to infer patterns. In fact, an inadequate collection can cause API usage pattern miners to infer irrelevant patterns which leads to false alarms instead of finding true API misuses. In this paper, we target this problem (a) by providing a method that increases the likelihood of finding relevant and true-positive patterns concerning a given set of code changes and agnostic to a concrete static, intra-procedural mining technique and (b) by introducing a concept for just-in-time API misuse detection which analyzes changes at the time of commit. Particularly, we introduce different, lightweight code search and filtering strategies and evaluate them on two real-world API misuse datasets to determine their usefulness in finding relevant intra-procedural API usage patterns. Our main results are (1) commit-based search with subsequent filtering effectively decreases the amount of code to be analyzed, (2) in particular method-level filtering is superior to file-level filtering, (3) project-internal and project-external code search find solutions for different types of misuses and thus are complementary, (4) incorporating prior knowledge of the misused API into the search has a negligible effect.

Highlights

We focus on Application Programming Interfaces (APIs) usage patterns that are inferred from existing source code through data mining
It contains the number of all methods in the project (Column A), the number of methods changed in the misuse introducing commit (Column C), and the subset of those methods that were part of an external API (Column E)
Recent research came up with a variety of automatic API misuse detectors that rely on the idea of inferring correct API usages from existing code samples

Summary

Objectives

Since we aim to detect API misuses at the time of the commit, this data is usually not available. Our goal was to find similar code examples without requiring human interaction

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Automated Software Engineering	Publication Date: Aug 17, 2021
Citations: 11	License type: open-access

R Discovery Prime

R Discovery Prime

Guided pattern mining for API misuse detection by change-based code analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Automated Software Engineering

Lead the way for us

Similar Papers

API-misuse detection driven by fine-grained API-constraint knowledge graph
Xiaoxue Ren ... Xinyuan Ye
-
Xiaoxue Ren, et. al.Xiaoxue Ren ... Xinyuan Ye
21 Dec 2020
21 Dec 2020

MisuseHint: A Service for API Misuse Detection Based on Building Knowledge Graph from Documentation and Codebase
Qingmi Liang ... Li Kuang
-
Qingmi Liang, et. al.Qingmi Liang ... Li Kuang
01 Jul 2022
01 Jul 2022

Commits as a basis for API misuse detection
Sebastian Nielebock ... Robert Heumüller
-
Sebastian Nielebock, et. al.Sebastian Nielebock ... Robert Heumüller
03 Sep 2018
03 Sep 2018

Python API Misuse Mining and Classification Based on Hybrid Analysis and Attention Mechanism
Xincheng He ... Lei Xu
International Journal of Software Engineering and Knowledge Engineering | VOL. 33
Xincheng He, et. al.Xincheng He ... Lei Xu
07 Aug 2023
International Journal of Software Engineering and Knowledge Engineering | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Guided pattern mining for API misuse detection by change-based code analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Automated Software Engineering