The shear wave velocity is among the key parameters that are responsible for damage caused during an earthquake. Determining shear wave velocity is a costly and time-consuming direct geophysical method. In the present paper, a machine learning model has been developed to obtain the subsurface shear wave velocity profile without using direct methods. The bore log information and the subsurface shear wave velocity profile available at various stations of Japan’s Kyoshin network (K-NET) have been utilized for training this machine learning model. The parametric correlation study indicates that simple parameters like rock/soil type, the thickness of the layer in the model, and standard penetration test (SPT-N) value directly correlate with the medium’s shear wave velocity. A stacked ensemble machine learning model named VelProfES (an acronym for Velocity Profiler using Ensemble machine learning models) has been developed in this paper and has been utilized for effective prediction of the shear wave velocity profile using basic information from borelog data. The dataset used in the training and testing of the machine learning model consists of borelog and shear wave velocity information from 1101 stations. Of 1101 stations, 657, 283, and 71 stations have been utilized for training, testing, and validating the machine learning model. Training, testing, and validation of the developed machine learning model consist of parameters from 12351, 5279, and 1388 velocity layers. The problem of data imbalance based on soil type has been addressed using an additional 10510 layers of synthetic borelog data generated from conditional generative adversarial networks (CTGANs). A feature and model ablation study was conducted for the VelProfES model to provide substantiation for the model and feature selection choices. The predicted shear wave velocity profiles were compared at specific stations, focusing on average velocities at 5, 10, 15, and 20 depths. Further, the predicted values have been compared with the empirical relation of Sil and Haloi (2017) and a trained polynomial model. The machine learning model demonstrates close alignment between predicted and actual values across a broad spectrum of velocities, a contrast not observed in the empirical relation and polynomial model. The results show that the machine learning models and augmented data generated using CTGANs can efficiently minimize the error between actual and predicted subsurface shear wave velocity values.
Read full abstract