Automated pulmonary nodule detection using computed tomography scans is vital in the early diagnosis of lung cancer. Although extensive well-performed methods have been proposed for this task, they suffer from the domain shift issue between training and test images. Unsupervised domain adaptation (UDA) methods provide a promising means to mitigate the domain variance; however, their performance is still limited since no target domain supervision is introduced. To make the pulmonary nodule detection algorithm more applicable in clinical practice and further boost the performance across domains, we propose a human-in-the-loop method in a semi-supervised fashion to enhance the model generalization ability when transferred from source domain to target domain. Specifically, we first train a detector model on source domain, and then the pre-trained detector is utilized with our proposed uncertainty-guided sample selection scheme (USSS) to find a few target domain samples worth annotating most and obtain their human annotations. Finally, the annotated and the rest unlabeled target domain samples are used together to refine the pre-trained model via our proposed zoom-in and zoom-out constraint (ZZC) strategy. We evaluate our method on the Nodule Analysis 2016 (LUNA16) and TianChi datasets. Experimental results show that our method surpasses recent competitive methods on source domain and also achieves surprising performance on target domain.
Read full abstract