Abstract

Despite the increasing use of lung ultrasound (LUS) in the evaluation of respiratory disease, operators' competence constrains its effectiveness. We developed a deep-learning (DL) model for multi-label classification using LUS and validated its performance and efficacy on inter-reader variability. We retrospectively collected LUS and labeled as normal, B-line, consolidation, and effusion from patients undergoing thoracentesis at a tertiary institution between January 2018 and January 2022. The development and internal testing involved 7580 images fromJanuary 2018and December2020, andthe model's performance was validated on a temporally separated test set (n = 985 imagescollected after January 2021) and two external test sets (n = 319 and 54 images). Two radiologists interpreted LUS with and without DL assistance and compared diagnostic performance and agreement. The model demonstrated robust performancewith AUCs: 0.93 (95% CI 0.92-0.94) for normal, 0.87 (95% CI 0.84-0.89) for B-line, 0.82 (95% CI 0.78-0.86) for consolidation, and 0.94 (95% CI 0.93-0.95) for effusion.The model improved reader accuracy for binary discrimination (normal vs. abnormal; reader 1: 87.5-95.6%, p = 0.004;reader 2: 95.0-97.5%, p = 0.19), and agreement(k = 0.73-0.83, p = 0.01). In conclusion,the DL-based model may assist interpretation, improving accuracy and overcoming operator competence limitationsin LUS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.