Abstract

Although contemporary research relies to a large extent on data, data quality in Information Systems research is a subject that has not received much attention until now. In this paper, a framework is presented for the measurement of scientific data quality using the principles of rule-based measurement. The proposed framework is capable of handling data quality problems due to both incorrect execution and incorrect description of data collection and validation processes. It is then argued that uncertainty can arise during the measurement, which complicates data quality assessment. The framework is therefore extended to handle uncertainty about the truth value of predicates. Instead of a numerical quality level, data quality is then expressed as either a probability distribution or a possibility distribution over the ordinal quality scale. Finally, it is also shown how quality thresholds can be formulated based on the results of the quality measurement. The usefulness of the proposed framework is illustrated throughout the paper with an example of the construction of a possible survey data quality measurement system and, subsequently, the application of that system on a realistic example.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.