The purpose of this study was to develop a robust deep learning approach trained with a small in-vivo MRI dataset for multi-label segmentation of all eight carpal bones for therapy planning and wrist dynamic analysis.
Approach: A small dataset of 15 3.0-T MRI scans from five health subjects was employed within this study. The MRI data was variable with respect to the Field Of View (FOV), wide range of image intensity, and joint pose. A two-stage segmentation pipeline using modified 3D U-Net was proposed. In the first stage, a novel architecture, introduced as Expansion Transfer Learning (ETL), cascades the use of a focused Region Of Interest (ROI) cropped around ground truth for pretraining and a subsequent transfer by an expansion to the original FOV for a primary prediction. The bounding box around the ROI generated was utilized in the second stage for high-accuracy,
labeled segmentations of eight carpal bones. Different metrics including Dice Similarity Coefficient (DSC), Average Surface Distance (ASD) and Hausdorff Distance (HD) were used to evaluate performance between proposed and four state-of-the-art approaches.
Main results: With an average DSC of 87.8 %, an ASD of 0.46 mm, an average HD of 2.42mm in all datasets (96.1 %, 0.16 mm, 0.38mm in 12 datasets after exclusion criteria, respectively), the proposed approach showed an overall strongest performance than comparisons.
Significance: To our best knowledge, this is the first CNN-based multi-label segmentation approach for MRI human carpal bones. The ETL introduced in this work improved the ability to localize a small ROI in a large FOV. Overall, the interplay of a two-stage approach and ETL culminated in convincingly accurate segmentation scores despite a very small amount of image data.
Read full abstract