Abstract

BackgroundThe accuracy and consistency of bone age assessments (BAA) using standard methods can vary with physicians' level of experience.MethodsTo assess the impact of information from an artificial intelligence (AI) deep learning convolutional neural network (CNN) model on BAA, specialists with different levels of experience (junior, mid-level, and senior) assessed radiographs from 316 children aged 4–18 years that had been randomly divided into two equal sets-group A and group B. Bone age (BA) was assessed independently by each specialist without additional information (group A) and with information from the model (group B). With the mean assessment of four experts as the reference standard, mean absolute error (MAE), and intraclass correlation coefficient (ICC) were calculated to evaluate accuracy and consistency. Individual assessments of 13 bones (radius, ulna, and short bones) were also compared between group A and group B with the rank-sum test.ResultsThe accuracies of senior, mid-level, and junior physicians were significantly better (all P < 0.001) with AI assistance (MAEs 0.325, 0.344, and 0.370, respectively) than without AI assistance (MAEs 0.403, 0.469, and 0.755, respectively). Moreover, for senior, mid-level, and junior physicians, consistency was significantly higher (all P < 0.001) with AI assistance (ICCs 0.996, 0.996, and 0.992, respectively) than without AI assistance (ICCs 0.987, 0.989, and 0.941, respectively). For all levels of experience, accuracy with AI assistance was significantly better than accuracy without AI assistance for assessments of the first and fifth proximal phalanges.ConclusionsInformation from an AI model improves both the accuracy and the consistency of bone age assessments for physicians of all levels of experience. The first and fifth proximal phalanges are difficult to assess, and they should be paid more attention.

Highlights

  • The accuracy and consistency of bone age assessments (BAA) using standard methods can vary with physicians’ level of experience

  • Bone age assessments of the first and fifth proximal phalanges significantly improved with AI assistance for senior, mid-level, and junior specialists

  • For mid-level specialists, in addition to those for the first and fifth proximal phalanges, mean absolute errors (MAEs) for the radius, the ulna, the third proximal phalanx, and the first distal phalanx were significantly lower with AI assistance than those without

Read more

Summary

Methods

To assess the impact of information from an artificial intelligence (AI) deep learning convolutional neural network (CNN) model on BAA, specialists with different levels of experience (junior, mid-level, and senior) assessed radiographs from 316 children aged 4–18 years that had been randomly divided into two equal sets-group A and group. B. Bone age (BA) was assessed independently by each specialist without additional information (group A) and with information from the model (group B). With the mean assessment of four experts as the reference standard, mean absolute error (MAE), and intraclass correlation coefficient (ICC) were calculated to evaluate accuracy and consistency. Individual assessments of 13 bones (radius, ulna, and short bones) were compared between group A and group B with the rank-sum test

Results
Conclusions
INTRODUCTION
Participants and Methodology
RESULTS
DISCUSSION
ETHICS STATEMENT
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call