Abstract
The rate of penetration (ROP) is a key indicator of drilling efficiency. Many researchers have explored the application of machine learning in ROP prediction. However, few studies have focused on the robustness of the constructed models, and developing a ROP prediction model that can achieve both high accuracy and strong robustness remains a challenge. This paper introduces a novel machine learning approach to constructing a ROP prediction model through ensemble learning algorithms. The model is based on field data from oilfields, incorporating 16 input parameters that influence ROP. First, the feasibility of the collected dataset is verified using correlation analysis. Then, ROP prediction models are developed based on various machine learning algorithms, including Decision Tree Regression (DTR), Random Forest (RF), eXtreme Gradient Boosting (XGB), Light Gradient Boosting Machine (LGBM), Support Vector Regression (SVR), and Backpropagation Neural Network (BPNN). By comparing the performance of these models under noise levels of 0%, 1.7%, and 5.1%, RF, LGBM, XGB, and SVR are selected as base learners. These base learners are then combined to construct multiple ensemble models, and the performance of the optimal ensemble model is evaluated under varying noise levels. The results show that the prediction error of the optimal model remains within 10%, and R2 is greater than 0.96. Finally, the Shapley Additive Explanations (SHAP) method is applied to perform interpretability analysis on the optimal ROP prediction model, examining the impact of different input factors on the model's predictive performance. Compared to single models and other ensemble models, the proposed ensemble model not only achieves higher accuracy but also demonstrates strong robustness and generalization capability.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.