Abstract
In the past, several <i>software reliability growth models</i> (SRGMs) have been proposed to predict various aspects of software reliability, and non-homogeneous Poisson process (NHPP)-based SRGMs have drawn much attention. However, it should be noted that, in practice, blindly applying SRGMs might not lead to meaningful results when the characteristics of the data (i.e., data type) do not fit the characteristics of selected SRGMs. Moreover, there currently exist no guidelines for project managers or developers to follow in selecting the most appropriate SRGMs for a particular failure dataset. On the other hand, waiting to collect a substantial amount of failure data before fitting SRGMs might be unfeasible in many cases. In this paper, we propose useing statistical data analysis methods to investigate the characteristics of software failure data and to improve the accuracy of software reliability growth modeling. Here, SRGMs will be broadly partitioned into two categories: “NHPP models” and “non-NHPP models.” Similarly, software failure data can be classified into two categories: “NHPP-type data” and “non-NHPP-type data.” We propose a two-phase method to verify the hypothesis that failure data follow an NHPP. Through the proposed two-phase and Laplace trend tests, we will be able to infer whether a dataset is the NHPP type and whether it is properly used by NHPP-based SRGMs. In this case, the final choice for the most suitable NHPP model for the software failure process would be made, and the software reliability prediction result can be deemed trustworthy and helpful after correctly judging whether the dataset can be classified or not as being NHPP based. Experiments based on 25 real software failure data were presented and analyzed. Using our proposed method, project managers and/or developers will be able to analyze and use collected failure data to help improve both planning accuracy and software quality.
Highlights
In the modern society, software plays a very important role in many of society's institutions and organizations
In the first phase that tests whole data seen as one interval, 4% (1/25) of the datasets failed the both two tests, 8% (2/25) of the datasets failed the test for Poisson distribution but passed the test for serial independence, 60% (15/25) of the datasets failed the test for serial independence but passed the test for Poisson distribution, and 28% (7/25) of the datasets passed the test
For the 28% of the datasets that passed the test in the first phase, we can say that these datasets have satisfied the characteristics of NHPP, because we serve the whole datasets as an interval, we cannot quickly conclude that the rest of the datasets do not follow the NHPP properties
Summary
Software plays a very important role in many of society's institutions and organizations. We would like to deeply investigate the possibility that before we estimate the reliability by using SRGM(s), we are willing to accept falsely saying that the collected failure data set is the NHPP-type when it is not. Through the proposed two-phase procedure and the Laplace trend test, we will be able to infer whether a dataset is the NHPP-type and whether it is properly used by the NHPPbased SRGMs. That is, the final decision of choosing the most suitable NHPP model(s) to model the software failure process would be made after we can correctly judge whether the selected dataset is said to be absolutely NHPP-type or not.
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have