Abstract

Data cleaning is an essential part of any research work without which the validity and reliability of the data could come under the spotlight. Aim: to document common errors found during the cleaning of datasets and suggests ways of minimizing errors during data entry process, reducing human errors throughout data cleaning. 
 
 Design and Setting: a case study based on the national Reproductive Health Assessment (RHA) data conducted in Solomon Islands in 2013. 
 
 Objective: The main objective of the Solomon Islands RHA was to establish the health status of reproductive aged women between the ages of 15 – 49 for the Solomon Islands. 
 
 Method: data was collected using questionnaires and entered on to the SPSS database in the country by the local Solomon Islands research assistants who were trained by the Pacific Sexual and Reproductive Health Research Center (PSRHRC). The data was brought back to Fiji where the cleaning process took place. 
 
 Results: findings of this case study showed that there were issues with the standardization of databases, database familiarization and data merging. 
 
 Conclusion: more training is needed for researchers who are involved in data collection, data entry and data cleaning to minimize such errors which could give results which may not be a true representation of the indented study.

Highlights

  • Data collection and data analysis have been carried out with obligatory importance for most research work

  • More training is needed for researchers who are involved in data collection, data entry and data cleaning to minimize such errors which could give results which may not be a true representation of the indented study

  • This paper aims to document errors found during the cleaning of datasets and suggests ways of minimizing errors during data entry process, reducing human errors throughout data cleaning

Read more

Summary

Objective

The main objective of the Solomon Islands RHA was to establish the health status of reproductive aged women between the ages of 15 – 49 for the Solomon Islands. Method: Data was collected using questionnaires and entered on to the SPSS database in the country by the local Solomon Islands research assistants who were trained by the Pacific Sexual and Reproductive Health Research Center (PSRHRC). The data was brought back to Fiji where the cleaning process took place

Conclusion
Introduction
A Brief Introduction of the SIRHA Study
The Need for Data Cleaning
Standardization of Data
Standardization of Database Design
Standardization in Data Entry
Database Familiarization
Merging Data Sets

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.