Abstract

Identification of the extent of heat altered coals is important for coal mining and resource estimation because alteration directly affects key economic properties such as ash content, volatile matter, specific energy, sulphur, free swell index and beneficiation characteristics. The most reliable method to identify altered coal is through geochemical analysis of drill core samples; however this method is time consuming and costly. Cheaper alternative methods to identify altered coal is by macroscopic observation of non-core drill cuttings, but this is less reliable than core. Geophysical wireline log data is also used to identify altered coal, but with variable success. Over the past decades, Machine Learning methods have proven popular to automatically classify a variety of lithotypes from geophysical log data. However rarely has there been focus on predicting altered coal. In this paper we apply the most recent machine learning methods which include gradient boosted machines, random forests and artificial neural networks to automatically predict altered and non-altered lithotypes using geophysical log data. We use a massive data set comprising of 1230 samples from 263 boreholes from a highly intruded deposit in the Bowen Basin, Eastern Australia. To do this, we calculate each sample's distance from intrusion and predict their relative density from geophysical log inputs including gamma ray, caliper and compensated density. We then train our machine learning methods on 80% of the data to predict alteration class using the calculated distance to intrusion, predicted relative density, and geophysical log gamma ray, as primary inputs. We evaluated each of these machine learning methods on the remaining 20% of the data to determine the best performing model. Finally, using the best performing model we further split the altered and non-altered classification into lithotypes using a decision tree based on geological knowledge of the case study area. The results indicate that of the machine learning algorithms the random forest produced the best results with only 11 misclassifications across the entire data set of 1230 samples which represents <1% error.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call