Comparison of machine learning approaches used to identify the drivers of Bakken oil well productivity

Emil D Attanasi,Philip A Freeman,Timothy C Coburn

doi:10.1002/sam.11487

Abstract

AbstractGeologists and petroleum engineers have struggled to identify the mechanisms that drive productivity in horizontal hydraulically fractured oil wells. The machine learning algorithms of Random Forest (RF), gradient boosting trees (GBT) and extreme gradient boosting (XGBoost) were applied to a dataset containing 7311 horizontal hydraulically fractured wells drilled into the middle member of the Bakken Formation from 2010 through 2017. The initial goal is to use these data‐driven machine learning algorithms to identify the most important explanatory predictors of well productivity within nine subareas and the composite area. Predictor variables representing initial gas production, the initial 180‐day water cut, and vertical depth vary spatially and are identified with geologically favorable areas. Well‐completion predictors include the well lateral length, number of fracture stages, volume of proppant per stage, and the volume of injected fluids per stage. The performance of methods is compared based on a common test sample. The analysis then examines the comparative predictive performance of the three algorithms for 1330 wells that had initiated production after the initial 7311 well sample had been producing. The computations of predictor importance identified the initial 180‐day water cut and the 30‐day initial gas production predictors as having a dominant influence in most subareas and for the composite area. The relative importance of well completion predictor variables, that is, the number of fracture stages per well, volume of injected proppant per stage, volume of injected fluids per stage, and lateral length, varied considerably across the subareas. For the common test or holdout sample, the models calibrated with the XGBoost algorithm had superior predictive power. The predictive power of all the algorithms trained on the data from the original sample suffered some loss when tested with a sample of wells that had started production after the end of that period. Implications of the empirical findings and strategies to mitigate loss of predictive power are discussed in the concluding section.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Statistical Analysis and Data Mining: The ASA Data Science Journal	Publication Date: Nov 20, 2020
Citations: 2	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Comparison of machine learning approaches used to identify the drivers of Bakken oil well productivity

Abstract

Talk to us

Similar Papers

More From: Statistical Analysis and Data Mining: The ASA Data Science Journal

Lead the way for us

Similar Papers

Well predictive performance of play-wide and Subarea Random Forest models for Bakken productivity
Emil D Attanasi ... Timothy C Coburn
Journal of Petroleum Science and Engineering | VOL. 191
Emil D Attanasi, et. al.Emil D Attanasi ... Timothy C Coburn
25 Mar 2020
Journal of Petroleum Science and Engineering | VOL. 191

Oil and Gas Developments in West Virginia in 1964

AAPG Bulletin | VOL. 49

01 Jan 1964
Oil and Gas Developments in West Virginia in 1964

A data-driven model for predicting initial productivity of offshore directional well based on the physical constrained eXtreme gradient boosting (XGBoost) trees
Yintao Dong ... Guanzhong Chen
Journal of Petroleum Science and Engineering | VOL. 211
Yintao Dong, et. al.Yintao Dong ... Guanzhong Chen
15 Jan 2022
Journal of Petroleum Science and Engineering | VOL. 211

A Well Performance Model Based on Multivariate Analysis of Completion and Production Data from Horizontal Wells in the Montney Formation in British Columbia
B Wolters ... J Jochen
-
B Wolters, et. al.B Wolters ... J Jochen
05 Nov 2013
05 Nov 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of machine learning approaches used to identify the drivers of Bakken oil well productivity

Abstract

Talk to us

Similar Papers

More From: Statistical Analysis and Data Mining: The ASA Data Science Journal