Abstract

Cross-domain lithology identification (CDLI) is a common case in lithology identification, which aims to train a machine learning model using the logging data of an interpreted well to predict the lithology of another uninterpreted well. Compared with the general lithology identification problem, the CDLI problem is more challenging for two reasons: the data distribution shift between the wells, and the expensive label acquisition on the uninterpreted well. To tackle these issues, we propose a novel framework that embeds active learning (AL) and domain adaptation into lithology identification. The proposed framework is composed of two components: an AL algorithm that selects the most uncertain and diverse target samples to query their real labels, and a source reweighting method that leverages the target labels to reduce data distribution discrepancy. Experimental results on two real-world data sets demonstrate that the proposed method can more effectively suppress the performance degradation caused by the data distribution shift than the baselines, with fewer target label queries.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call