Abstract

Use of machine learning (ML) for automated annotation of heart structures from echocardiographic videos is an active research area, but understanding of comparative, generalizable performance among models is lacking. This study aimed to (1) assess the generalizability of five state-of-the-art ML-based echocardiography segmentation models within a large Geisinger clinical dataset, and (2) test the hypothesis that a quality control (QC) method based on segmentation uncertainty can further improve segmentation results. Five models were applied to 47,431 echocardiography studies that were independent from any training samples. Chamber volume and mass from model segmentations were compared to clinically-reported values. The median absolute errors (MAE) in left ventricular (LV) volumes and ejection fraction exhibited by all five models were comparable to reported inter-observer errors (IOE). MAE for left atrial volume and LV mass were similarly favorable to respective IOE for models trained for those tasks. A single model consistently exhibited the lowest MAE in all five clinically-reported measures. We leveraged the tenfold cross-validation training scheme of this best-performing model to quantify segmentation uncertainty. We observed that removing segmentations with high uncertainty from 14 to 71% studies reduced volume/mass MAE by 6-10%. The addition of convexity filters improved specificity, efficiently removing < 10% studies with large MAE (16-40%). In conclusion, five previously published echocardiography segmentation models generalized to a large, independent clinical dataset-segmenting one or multiple cardiac structures with overall accuracy comparable to manual analyses-with variable performance. Convexity-reinforced uncertainty QC efficiently improved segmentation performance and may further facilitate the translation of such models.

Highlights

  • Structural segmentation is an important step for interpreting 2D echocardiography, which is highly timeconsuming and subject to significant inter- and intra-observer variability [1]–[6]

  • Among the three models that were able to segment the left ventricular (LV) epicardium and estimate LV wall mass (LVM), only the 2D and 3D models developed by Stough et al were associated with median absolute errors (MAE) that were below the reported inter-observer errors (IOE) [6], with the 2D model outperforming the 3D model (Fig. 1)

  • As for LA volume (LAV), the MAE associated with three of the four models that had the capability of segmenting left atrium (LA) endocardium were within one SD of the reported IOE [5] (Fig. 1)

Read more

Summary

Introduction

Structural segmentation is an important step for interpreting 2D echocardiography, which is highly timeconsuming and subject to significant inter- and intra-observer variability [1]–[6]. In 2019, Leclerc et al published the CAMUS dataset for which LV endo- and epicardium, and LA endocardium were manually segmented [21] This dataset greatly facilitated the development and improvement of multi-structural echocardiography segmentation models [19], [22]–[26], such as those trained with adversarial [19] or motion-segmentation co-learning strategy [25], [26]. The performance of these multi-structural segmentation models within a large, independent, clinically-acquired echocardiography dataset remains unknown None of these models has been tested with automated quality control (QC) methods

Objectives
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.