Evaluating classifiers in SE research: the ECSER pipeline and two replication studies

Davide Dell’Anna,Fabiano Dalpiaz,Fatma Başak Aydemir

doi:10.1007/s10664-022-10243-1

Abstract

ContextAutomated classifiers, often based on machine learning (ML), are increasingly used in software engineering (SE) for labelling previously unseen SE data. Researchers have proposed automated classifiers that predict if a code chunk is a clone, if a requirement is functional or non-functional, if the outcome of a test case is non-deterministic, etc.ObjectiveThe lack of guidelines for applying and reporting classification techniques for SE research leads to studies in which important research steps may be skipped, key findings might not be identified and shared, and the readers may find reported results (e.g., precision or recall above 90%) that are not a credible representation of the performance in operational contexts. The goal of this paper is to advance ML4SE research by proposing rigorous ways of conducting and reporting research.ResultsWe introduce the ECSER (Evaluating Classifiers in Software Engineering Research) pipeline, which includes a series of steps for conducting and evaluating automated classification research in SE. Then, we conduct two replication studies where we apply ECSER to recent research in requirements engineering and in software testing.ConclusionsIn addition to demonstrating the applicability of the pipeline, the replication studies demonstrate ECSER’s usefulness: not only do we confirm and strengthen some findings identified by the original authors, but we also discover additional ones. Some of these findings contradict the original ones.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Empirical Software Engineering	Publication Date: Nov 8, 2022
Citations: 8	License type: open-access

R Discovery Prime

R Discovery Prime

Evaluating classifiers in SE research: the ECSER pipeline and two replication studies

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering

Lead the way for us

Similar Papers

How to Treat the Use of Grey Literature in Software Engineering
Xin Zhou
-
Xin ZhouXin Zhou
26 Jun 2020
26 Jun 2020

An evidence-based inquiry into the use of grey literature in software engineering
He Zhang ... Xin Zhou
-
He Zhang, et. al.He Zhang ... Xin Zhou
27 Jun 2020
27 Jun 2020

The Discovery of Grounded Theory Practices for Software Engineering Research
Rozilawati Razali ... Mashal Kasem Alqudah
Electronic Journal of Business Research Methods | VOL. 18
Rozilawati Razali, et. al. Rozilawati Razali ... Mashal Kasem Alqudah
23 Feb 2021
Electronic Journal of Business Research Methods | VOL. 18

A Survey on the Use of Computer Vision to Improve Software Engineering Tasks
Mohammad Bajammal ... Ali Mesbah
IEEE Transactions on Software Engineering | VOL. 48
Mohammad Bajammal, et. al.Mohammad Bajammal ... Ali Mesbah
22 Oct 2020
IEEE Transactions on Software Engineering | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating classifiers in SE research: the ECSER pipeline and two replication studies

Abstract

Talk to us

Similar Papers

More From: Empirical Software Engineering