Development of a Method for Clinical Evaluation of Artificial Intelligence–Based Digital Wound Assessment Tools

Raelina S Howell,Ernest S Chiu,Jon S Woods,Helen H Liu,Michael Castellano,Mayur Saxena,Scott A Gorenstein,Brian M Gillette,Patrizio Petrone,Harshit Saxena,Eric Slone,Aziz A Khan,Lawrence J Lin

doi:10.1001/jamanetworkopen.2021.7234

Raelina S Howell, Ernest S Chiu + Show 11 more

Open Access

https://doi.org/10.1001/jamanetworkopen.2021.7234

Copy DOI

Abstract

Accurate assessment of wound area and percentage of granulation tissue (PGT) are important for optimizing wound care and healing outcomes. Artificial intelligence (AI)-based wound assessment tools have the potential to improve the accuracy and consistency of wound area and PGT measurement, while improving efficiency of wound care workflows. To develop a quantitative and qualitative method to evaluate AI-based wound assessment tools compared with expert human assessments. This diagnostic study was performed across 2 independent wound centers using deidentified wound photographs collected for routine care (site 1, 110 photographs taken between May 1 and 31, 2018; site 2, 89 photographs taken between January 1 and December 31, 2019). Digital wound photographs of patients were selected chronologically from the electronic medical records from the general population of patients visiting the wound centers. For inclusion in the study, the complete wound edge and a ruler were required to be visible; circumferential ulcers were specifically excluded. Four wound specialists (2 per site) and an AI-based wound assessment service independently traced wound area and granulation tissue. The quantitative performance of AI tracings was evaluated by statistically comparing error measure distributions between test AI traces and reference human traces (AI vs human) with error distributions between independent traces by 2 humans (human vs human). Quantitative outcomes included statistically significant differences in error measures of false-negative area (FNA), false-positive area (FPA), and absolute relative error (ARE) between AI vs human and human vs human comparisons of wound area and granulation tissue tracings. Six masked attending physician reviewers (3 per site) viewed randomized area tracings for AI and human annotators and qualitatively assessed them. Qualitative outcomes included statistically significant difference in the absolute difference between AI-based PGT measurements and mean reviewer visual PGT estimates compared with PGT estimate variability measures (ie, range, standard deviation) across reviewers. A total of 199 photographs were selected for the study across both sites; mean (SD) patient age was 64 (18) years (range, 17-95 years) and 127 (63.8%) were women. The comparisons of AI vs human with human vs human for FPA and ARE were not statistically significant. AI vs human FNA was slightly elevated compared with human vs human FNA (median [IQR], 7.7% [2.7%-21.2%] vs 5.7% [1.6%-14.9%]; P < .001), indicating that AI traces tended to slightly underestimate the human reference wound boundaries compared with human test traces. Two of 6 reviewers had a statistically higher frequency in agreement that human tracings met the standard area definition, but overall agreement was moderate (352 yes responses of 583 total responses [60.4%] for AI and 793 yes responses of 1166 total responses [68.0%] for human tracings). AI PGT measurements fell in the typical range of variation in interreviewer visual PGT estimates; however, visual PGT estimates varied considerably (mean range, 34.8%; mean SD, 19.6%). This study provides a framework for evaluating AI-based digital wound assessment tools that can be extended to automated measurements of other wound features or adapted to evaluate other AI-based digital image diagnostic tools. As AI-based wound assessment tools become more common across wound care settings, it will be important to rigorously validate their performance in helping clinicians obtain accurate wound assessments to guide clinical care.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JAMA Network Open	Publication Date: May 19, 2021
Citations: 23	License type: cc-by

R Discovery Prime

R Discovery Prime

Development of a Method for Clinical Evaluation of Artificial Intelligence–Based Digital Wound Assessment Tools

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Similar Papers

AI-DRIVEN WOUND ASSESSMENT: LEVERAGING WEARABLE SENSORS FOR CLINICAL INNOVATION
Dr Monica Bhutani ... Ms Shikha
-
Dr Monica Bhutani, et. al.Dr Monica Bhutani ... Ms Shikha
15 Aug 2024
15 Aug 2024

Importance of imaging to wound care practice.
Douglas Queen ... Keith Gordon Harding
International Wound Journal | VOL. 20
Douglas Queen, et. al.Douglas Queen ... Keith Gordon Harding
30 Jan 2023
International Wound Journal | VOL. 20

The state of wound assessment tools in Singapore: an evaluation study
Hongli Sam Goh ... Hui Zhang
British Journal of Nursing | VOL. 31
Hongli Sam Goh, et. al.Hongli Sam Goh ... Hui Zhang
23 Jun 2022
British Journal of Nursing | VOL. 31

Wound assessment tools and nurses' needs: an evaluation study.
Sheila Greatrex-White ... Helen Moxey
International Wound Journal | VOL. 12
Sheila Greatrex-White, et. al.Sheila Greatrex-White ... Helen Moxey
28 May 2013
International Wound Journal | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of a Method for Clinical Evaluation of Artificial Intelligence–Based Digital Wound Assessment Tools

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open