Abstract. Our understanding and predictive capability of streamflow processes largely rely on high-quality datasets that depict a river's upstream basin characteristics. Recent proliferation of large sample hydrology (LSH) datasets has promoted model parameter estimation and data-driven analyses of hydrological processes worldwide, yet existing LSH is still insufficient in terms of sample coverage, uncertainty estimates, and dynamic descriptions of anthropogenic activities. To bridge the gap, we contribute the synthesis of Global Streamflow characteristics, Hydrometeorology, and catchment Attributes (GSHA) to complement existing LSH datasets, which covers 21 568 watersheds from 13 agencies for as long as 43 years based on discharge observations scraped from the internet. In addition to annual and monthly streamflow indices, each basin's daily meteorological variables (i.e., precipitation, 2 m air temperature, longwave/shortwave radiation, wind speed, actual and potential evapotranspiration), daily–weekly water storage terms (i.e., snow water equivalence, soil moisture, groundwater percentage), and yearly dynamic descriptors of the land surface characteristics (i.e., urban/cropland/forest fractions, leaf area index, reservoir storage and degree of regulation) are also provided by combining openly available remote sensing and reanalysis datasets. The uncertainties in all meteorological variables are estimated with independent data sources. Our analyses reveal the following insights: (i) the meteorological data uncertainties vary across variables and geographical regions, and the revealed pattern should be accounted for by LSH users; (ii) ∼6 % watersheds shifted between human-managed and natural states during 2001–2015, e.g., basins with environmental recovery projects in northeast China, which may be useful for hydrologic analysis that takes the changing land surface characteristics into account; and (iii) GSHA watersheds showed a more widespread declining trend in runoff coefficient than an increasing trend, pointing towards critical water availability issues. Overall, GSHA is expected to serve hydrological model parameter estimation and data-driven analyses as it continues to improve. GSHA v1.1 can be accessed at https://doi.org/10.5281/zenodo.8090704 and https://doi.org/10.5281/zenodo.10433905 (Yin et al., 2023a, b).
Read full abstract