Abstract

One of the greatest challenges in hydrology is in the accurate prediction of streamflow in ungauged basins. In recent years, machine learning models have made great strides in the ability to accurately predict these flows, but these models are extremely data intensive. However, resources for streamflow monitoring are often quite limited so it is important to gather data as efficiently as possible. In this study, we establish a relationship between the amount of data used in training and the quality of predictions in ungauged basins for Long-Short Term Memory-based neural network models. First, using the CAMELS dataset, we create numerous training sets with a different number of basins in each and corresponding testing sets that use basins outside of the training set to simulate ungauged basins. We then quantify how changing the size of the training sets and the length of the training data affects the quality of predictions in the testing sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.