Automatic event identification and extraction from daily drilling reports using an expert system and artificial intelligence

Lucas P Cinelli,Sergio L Netto,Jonathas O Ferreira,Rafael Padilla,Breno Galves,José F.L De Oliveira,Marcello L.R De Campos,Clemente J.C Gonçalves,Vinicius M De Pinho,Felipe L De Oliveira,Patrick F Braz,Domenica P Dalvi,Gabriela Lewenfus,Eduardo A.B Da Silva,Wesley L Passos,Anthony Y.Y Ji

doi:10.1016/j.petrol.2021.108939

Abstract

This work addresses the problem of extracting events from human-written daily drilling reports (DDRs) in an automated way. Two distinct approaches based on an expert system and artificial intelligence techniques are proposed: rule-based language processing (RBLP) and deep neural networks (DNN). The RBLP employs regular expressions that are manually constructed, during the so-called building process, in order to identify the events of interest. The novelty of the present approach is to deal with multi-label classification of DDRs using RBLP and transformers, which provide a powerful DNN architecture. The events of interest are drilling failures such as ‘bump’, ‘drag’, ‘kick’, ‘loss of circulation’, and ‘stuck pipe’. Both algorithms are developed based on a training data set of 4,355 DDRs and evaluated on a test data set of 300 DDRs, all of them written in Brazilian Portuguese but can be readily adapted/replicated to any other language. Average true positive rates (TPR) of 97.30% for RBLP and 85.61% for transformers-DNN were obtained, with average false negative rates (FNR) of 2.70% and 14.39%, respectively. The corresponding false positive rates (FPR) were 4.90% and 13.52%. Transformers-DNN has superior performance if the underrepresented classes are disregarded. In this case, the average TPR was 96.79% for RBLP and 97.32% for transformers-DNN, with an average FNR of 3.21% and 2.68%, respectively. The corresponding FPR changed to 2.37% and 1.81%. The test results indicate that the two proposed approaches can lead to very significant improvements in the efficiency of the otherwise manual annotation processes, which are typically error prone and very time consuming.

Full Text