Abstract

Citizen Science can fulfill the quest for high-quality and sufficient environmental data, such as rainfall. However, the factors affecting the quality of rainfall data collected by the citizen scientists are not well understood. In this study, we examined the effect of citizen scientists’ attributes on the quality of rainfall data. For this purpose, Principal Component Analysis (PCA), stepwise regression and Multiple Linear Regressions (MLR) were used. A quality control procedure was developed and applied for daily observed rainfall data collected in the summer rainy season of 2020. Attributes of the citizen scientists' were gathered for those who collected rainfall data in the urban and peri-urban Akaki catchment which is located in the Upper Awash sub-basin, Ethiopia. We found that easy-to-detect errors, which were identified during the initial stage of quality control, formed most of the errors in the rainfall data. The PCA and the stepwise regression results revealed that four dominant attributes (education level, gauge relative location, use of smartphone app, and supervisor’s travel distance) highly affected the rainfall data quality. The MLR model using these four prominent dominant variables performed very well with R2 value of 0.98. The k-fold cross validation result showed that the developed model can be used to predict the relationships between data quality and attributes of citizen scientists with high accuracy. Hence, the PCA technique, stepwise regression and MLR model can provide useful information regarding the influence of citizen scientists’ attributes on rainfall data quality. Therefore, future studies should carefully consider citizen scientists' attributes when engaging and supervising citizen scientists, with a comprehensive data quality control while monitoring rainfall.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call