The prediction of favorable zone (FZ) is the most important step for conventional hydrocarbon accumulations (CHAs) exploration. Recently, the method of coupling multiple hydrocarbon accumulation (HA) elements is widely used to predict the distribution of FZ in the petroleum exploration field. However, the forming mechanism of CHAs is extremely complicated, which causes difficulty in accurately describing the relationship between multiple HA elements and HA probability (HAP). Hence, it is difficult to predict the distribution of FZ quantitatively and credibly using traditional methods. This study proposes a method for predicting FZ for CHAs based on random undersampling boosted (RUSBoosted) tree machine learning (ML) algorithm. First, the characteristics of data in the petroleum exploration field are clarified, and a suitable ML algorithm is selected. Second, the theory and knowledge of the petroleum exploration field is integrated into the data, a HAP prediction model for CHAs is constructed, and then the method for FZ prediction is proposed. Further, the method is applied to Jin 93 Well Block for predicting FZ of CHAs. Finally, this study discussed the difference in performance among models constructed by the RUSBoosted tree and other five ML algorithms and the difference in training results between the original geological data and preprocessed geological data on the RUSBoosted tree ML algorithm. Results show that, currently, datasets in the petroleum exploration field are small and unbalanced, and the RUSBoosted tree ML algorithm has excellent training results on it. Compared with the original geological data, the performance of the HAP prediction model constructed by preprocessed geological data is improved. On a Jin 93 Well Block dataset, the HAP prediction model constructed by the RUSBoosted tree ML algorithm belongs to a good prediction model, and FZ of CHAs predicted by this HAP prediction model agree well with CHAs discovered areas. The results of this study provide an idea for intelligently predicting the distribution of FZ of CHAs and are of great significance to the development of intelligent petroleum exploration technology.
Read full abstract