An inadequate understanding of fluid transport processes in ultra-low permeability rocks has propelled data-driven modeling as an alternative and complementary tool to create recovery forecasts that honor the available data in the field. However, newly discovered fields lack data, which makes it challenging to apply traditional data-driven approaches to decision-making. Transfer learning provides an opportunity to start the early-stage analysis of the asset before adequate data becomes available. Here, we present a transfer learning framework that allows the basin-wide assessment of new shale gas and tight oil prospects. The proposed method is developed on real-world data from several thousand horizontal hydraulically fractured wells in the Eagle Ford super-basin in South Texas. A novel aspect of our method is integrating reservoir engineering domain expertise and the relationship among geologic and petrophysical variables, drilling and completion parameters, and well productivity in the data pre-processing and feature generation steps. We consider the temporal and spatial balancing of the training data to ensure that the resulting predictive models honor the real-world practices followed during unconventional field development. Our full-cycle transfer learning workflow consists of dimensionality reduction and unsupervised clustering, supervised learning, and hyperparameter fine-tuning. We test the workflow by examining the performance of a data-driven model of the Eagle Ford Basin on potential plays in the Middle East: Tuwaiq Mountain and Hanifa. The existence of multiple types of hydrocarbons–liquid oil, condensate, and dry gas–in the Eagle Ford results in a model flexible enough to be tested on various types of assets in a new location. We address the challenge of model overfitting by using data types with a relatively low resolution, which allows generalization to different geologies with basin-wide accuracy.
Read full abstract