Abstract

Meteorological records, including precipitation, commonly have missing values. Accurate imputation of missing precipitation values is challenging, however, because precipitation exhibits a high degree of spatial and temporal variability. Data-driven spatial interpolation of meteorological records is an increasingly popular approach in which missing values at a target station are imputed using synchronous data from reference stations. The success of spatial interpolation depends on whether precipitation records at the target station are strongly correlated with precipitation records at reference stations. However, the need for reference stations to have complete datasets implies that stations with incomplete records, even though strongly correlated with the target station, are excluded. To address this limitation, we develop a new sequential imputation algorithm for imputing missing values in spatio-temporal daily precipitation records. We demonstrate the benefits of sequential imputation by incorporating it within a spatial interpolation based on a Random Forest technique. Results show that for reliable imputation, having a few strongly correlated references is more effective than having a larger number of weakly correlated references. Further, we observe that as the proportion of stations with incomplete records increases, there is a higher percentage of stations that benefit from sequential imputation. Overall, we present a new approach for imputing missing precipitation data which may also apply to other meteorological variables.

Highlights

  • Precipitation is an important component of the ecohydrological cycle and plays a crucial role in driving the Earth’s climate

  • We proposed a new algorithm, called the sequential imputation algorithm, for imputing missing time-series precipitation data

  • We hypothesized that stations with incomplete records contain information that can be used toward improving spatial interpolation

Read more

Summary

Introduction

Precipitation is an important component of the ecohydrological cycle and plays a crucial role in driving the Earth’s climate. It serves as an input for various ecohydrological models to determine snowpack, infiltration, surface-water flow, groundwater recharge, and transport of chemicals, sediments, nutrients, and pesticides (Devi et al, 2015). Numerical modeling of surface flow typically requires a complete time series of precipitation along with other meteorological records (e.g., temperature, relative humidity, solar radiation) as inputs for simulations (Dwivedi et al, 2017, 2018; Hubbard et al, 2018, 2020; Zachara et al, 2020). Reconstructing an incomplete daily precipitation time series is especially difficult since it exhibits a high degree of spatial and temporal variability (Simolo et al, 2010)

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call