ABSTRACT Oceanic barrier layer thickness (BLT) has been estimated from sea surface parameters through four machine learning (ML) models. The BLT used for training and testing of ML models has been computed from 19 years (2005–2023) of Argo data of vertical temperature and salinity profile in the north Indian ocean. It is observed that thick barrier layer (BL) exists in three regions within the north Indian ocean: the Bay of Bengal (BoB), Eastern Arabian Sea (EAS) and the East Equatorial Indian Ocean (EEIO) off the Sumatra. The observed BLT ranges from 0 to 80 m in the north Indian ocean with a seasonal cycle. Four ML models, i.e. multiple linear regression, artificial neural network (ANN), Random Forest and XGBoost have been trained for 2010–2018 to learn the relationship between sea surface parameters (input features) and BLT (target/output variable) in three thick BL regions within the north Indian Ocean. Thereafter, the trained models have been tested for the estimation of BLT in the same regions for the time period of 2019–2023. The ML models are well able to estimate the BLT from sea surface parameters (SST, SSS) with RMSE less than 5 m in the BoB and EAS, which is less than 8 percentile of the model testing datasets. The ‘Random Forest Regressor’ turns out to perform better than the other three ML models for estimation of BLT. Further, the Random Forest estimated BLT has been compared with the BLT computed using ORAS5 reanalysis data of vertical temperature and salinity profile. This study explores the capability of the ML models for capturing the signature of a subsurface variable on the surface owing to the limitation of costly subsurface measurement.
Read full abstract