Comparison of clinical geneticist and computer visual attention in assessing genetic conditions.

Dat Duong,Tzung-Chien Hsieh,Ping Hu,Ömer Sümer,Behnam Javanmardi,Benjamin D Solomon,Kendall Flaharty,Rebekah L Waikel,Suzanna Ledgister Hanchard,Fabio Hellmann,Christopher Fortney,Hellen Lesmann,Peter Krawitz,Shahida Moosa,Susan Persky,Anna Rose Johny,Cedrik Tekendo-Ngongang,Elisabeth André,Tanviben Patel

doi:10.1371/journal.pgen.1011168

Abstract

Artificial intelligence (AI) for facial diagnostics is increasingly used in the genetics clinic to evaluate patients with potential genetic conditions. Current approaches focus on one type of AI called Deep Learning (DL). While DL- based facial diagnostic platforms have a high accuracy rate for many conditions, less is understood about how this technology assesses and classifies (categorizes) images, and how this compares to humans. To compare human and computer attention, we performed eye-tracking analyses of geneticist clinicians (n = 22) and non-clinicians (n = 22) who viewed images of people with 10 different genetic conditions, as well as images of unaffected individuals. We calculated the Intersection-over-Union (IoU) and Kullback-Leibler divergence (KL) to compare the visual attentions of the two participant groups, and then the clinician group against the saliency maps of our deep learning classifier. We found that human visual attention differs greatly from DL model's saliency results. Averaging over all the test images, IoU and KL metric for the successful (accurate) clinician visual attentions versus the saliency maps were 0.15 and 11.15, respectively. Individuals also tend to have a specific pattern of image inspection, and clinicians demonstrate different visual attention patterns than non-clinicians (IoU and KL of clinicians versus non-clinicians were 0.47 and 2.73, respectively). This study shows that humans (at different levels of expertise) and a computer vision model examine images differently. Understanding these differences can improve the design and use of AI tools, and lead to more meaningful interactions between clinicians and AI technologies.

Full Text