Data Conditioning and Forecasting Methodology using Machine Learning on Production Data for a Well Pad

Maryam Bagheri,Peggy Lindner,Giulia Toti,Li Huang,Manyang Sun,Srinath Madasu,Haoran Zhao

doi:10.4043/30854-ms

Abstract

Abstract A new machine learning (ML)/statistical-based methodology for conditioning and predicting production data for a well pad has been developed. Typically, data conditioning involves outlier detection, missing data, data imputation, and smoothing. Time-series production data prediction can be challenging because the target (wellbore oil production) depends on large-scale, high-dimensional data sets with unknown distributions and is influenced by missing data and outliers. Hence, data conditioning is key for accurate predictions. The current work is the first attempt at using ensemble ML and statistical techniques, such as multilayer perceptron (MLP), principal component analysis (PCA), and support vector regression (SVR), for well pad data conditioning using recently disclosed subsurface and production data from a field in the southern area of the Norwegian North Sea. The time-series forecasting based on large-scale, high-dimensional conditioned and cleaned data sets is also presented. The data with an oil production rate greater than 10 Sm3 have been retained for data cleansing, which reduced the size of the production well data set by 14.9%. Outliers are detected using the z-score method. The missing values are predicted using a trained ML model on all available nonmissing data. The procedure first predicts the downhole missing values from all the wells, including the available neighboring wells, and then uses these features to predict other missing values for the well pad. In this paper, the two approaches implemented and compared for prediction of missing data are MLP and SVR, and PCA is performed to extract the most important data features. Production data with 12 related variables (i.e., dates, hours, temperature, pressure, etc.) are used to explore the complex nonlinearity of features and estimate wellbore oil production with ML and deep-learning models. Conventional SVR and MLP methods are implemented as the benchmark. During this work, more than 60% of the missing and abnormal data from the field data set are detected and imputed using advanced ML methods, such as MLP and SVR with radial basis function kernel. More than 6% of data are outliers and are removed using the z-score method. The modified SVR with time-series data structure and long short-term memory (LSTM) algorithms are used for the comparisons. An R-squared (R2) of 98% is achieved for both the algorithms; however, LSTM has the lowest root mean square error (RMSE) results compared to SVR. Data conditioning is conventionally performed using statistical techniques, but here, an ensemble of ML techniques is used depending on the available data. This paper presents a new methodology to perform data conditioning and production prediction for a well pad using ML and neighboring well data. The ML algorithms used are highly efficient, as demonstrated by the results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Conditioning and Forecasting Methodology using Machine Learning on Production Data for a Well Pad

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Machine learning approaches for formation matrix volume prediction from well logs: Insights and lessons learned
Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
Geoenergy Science and Engineering | VOL. 229
Pamidi Venkata Durga Kannaiah, et. al.Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
08 Jul 2023
Geoenergy Science and Engineering | VOL. 229

Novel Machine Learning (ML) models for predicting the performance of multi-metal binding green adsorbent for the removal of Cd (II), Cu (II), Pb (II) and Zn (II) ions
Rameez Ahmad Aftab ... Khursheed B Ansari
Environmental Advances | VOL. 9
Rameez Ahmad Aftab, et. al.Rameez Ahmad Aftab ... Khursheed B Ansari
01 Oct 2022
Environmental Advances | VOL. 9

Embedding machine learning techniques into a conceptual model to improve monthly runoff simulation: A nested hybrid rainfall-runoff modeling
Umut Okkan ... Okan Fistikoglu
Journal of Hydrology | VOL. 598
Umut Okkan, et. al.Umut Okkan ... Okan Fistikoglu
07 May 2021
Journal of Hydrology | VOL. 598

A machine learning approach for spatiotemporal imputation of MODIS chlorophyll-a
Hamid Mohebzadeh ... Asim Biswas
International Journal of Remote Sensing | VOL. 42
Hamid Mohebzadeh, et. al.Hamid Mohebzadeh ... Asim Biswas
12 Sep 2021
International Journal of Remote Sensing | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Conditioning and Forecasting Methodology using Machine Learning on Production Data for a Well Pad

Abstract

Talk to us

Similar Papers