Abstract. We introduce CAMELS-IND (Catchment Attributes and MEteorology for Large-sample Studies – India), a dataset containing hydrometeorological time series and catchment attributes for 472 catchments in Peninsular India, of which 228 catchments have observed streamflow data available for over 30 % of the period between 1980 to 2020. Peninsular India covers 15 interstate river basins defined by the Central Water Commission (CWC), where river flow and water level datasets are available for several gauge stations through the open-source India Water Resources Information System (India-WRIS). However, many of these gauge stations lack reliable metadata, and data are not in an analysis-ready format for large-sample hydrological studies. Therefore, we utilized 472 gauge stations and their catchment boundaries, characterized as stations with reliable metadata, from the Geospatial dataset for hydrologic analyses in India (GHI) (Goteti, 2023). For each of these catchments, CAMELS-IND provides a catchment mean time series of meteorological forcings for 41 years (1980–2020) and 211 catchment attributes representing hydroclimatic and land cover characteristics extracted from multiple data sources (including ground-based observations, remote sensing-based products, and reanalyses datasets). CAMELS-IND follows the same standards of the previously developed CAMELS datasets for the USA, Chile, Brazil, Great Britain, Australia, Switzerland, and Germany to facilitate comparisons with catchments of those countries and inclusion in global hydrological studies. Notably, CAMELS-IND includes available observed streamflow and catchment mean time series of 19 meteorological forcings, including precipitation, maximum, minimum, average temperature, long-wave and short-wave radiation flux, U and V components of wind, relative humidity, evaporation rates from canopy and soil surface, actual and potential evapotranspiration, and soil moisture of four layers (covering depth up to 3 m below ground) for detailed hydrometeorological studies. We also derived catchment attributes representing human influences, including the number of dams and their utilization, total volume contents of dams in catchments, population density, and increases in urban and agricultural land covers to facilitate studies to understand human influences on catchment hydrology. Furthermore, the dataset includes predicted streamflow time series from a regionally trained long short-term memory (LSTM)-based hydrological model for all 472 catchments which can fill gaps in observed streamflow data or serve as a benchmark for testing and developing new hydrological models. We envision that CAMELS-IND will provide a strong foundation for a community-led effort toward gaining new hydrological insights from hydrologically distinct Indian catchments and solving pertinent issues related to water management, quantification and risk assessment of hydrologic extremes, unraveling regional-scale hydrologic functioning, and climate change impact assessment of catchments across India. The CAMELS-IND dataset is available at https://doi.org/10.5281/zenodo.14005378 (Mangukiya et al., 2024).
Read full abstract