Geochemical logs are an essential tool for hydrocarbon reservoir characterization. The rock composition given by these logs is useful for porosity calculation, stratigraphic modeling, diagenesis estimation, and well acidification. In reduced logging operations performed in carbonate reservoirs of the Brazilian pre-salt, the geochemical tool is no longer acquired, aiming at optimization and cost reduction. This research aims the development of synthetic geochemical logs using machine learning algorithms. The database includes 22 wells with complete logging acquisition, and the input logs are natural gamma-ray, gamma-ray spectroscopy, density, photoelectric factor, neutron porosity, nuclear magnetic resonance, and sonic. The chemical elements chosen as output are Al, Ca, Fe, Mg, Na, Si, S, and Ti. Five models based on machine learning are trained using 19 wells: Support-vector machine, Multilayer perceptron, Random Forest, AdaBoost, and XGBoost. AdaBoost represents the best algorithm because, in addition to showing the best results, it allowed for a more simplified preprocessing and hyperparameter tuning. The evaluation of the models applies R2 and root mean squared error (RMSE) in validation data and cross-validation. Robust models are acquired for Al, Ca, Fe, Mg, Si, S, and Ti, with R2 above 0.80. Na shows R2 slightly above 0.70. All the models have RMSE between 10-2 to 10-4. The most important logs during training are density, photoelectric factor, K, Th, U, and neutron porosity. Synthetic geochemical logs created for three test wells show a good agreement with acquired logs, being able to reproduce general trends of the pre-salt formations. The machine learning model is capable of substituting the acquisition of geochemical logs with high confidence, representing cost reduction, and supplying engineers and geoscientists with quality data to be used in formation evaluation.
Read full abstract