Abstract

To evaluate the performance of artificial intelligence (AI) as a second reviewer within the DistillerSR platform for a systematic literature review (SLR). Originally, a total of 2,613 references identified in the SLR were assessed by 2 independent human reviewers; a third analyst resolved conflicts. For this evaluation, the AI acted as a second reviewer in the screening process (2 possible outcomes: include or exclude) across 3 single-reviewer manually screened reference groups from the original total references (n=300, 400, and 500). Results were analyzed for 3 training and test proportions in each set (% AI train/% AI test: 5%/80%, 20%/65%, and 80%/20%). The AI screening results of the remainder of the 2,613 references were then compared with the original screened references. The accuracy, sensitivity, and specificity were calculated and compared. The accuracy of AI-reviewed references increased with increasing number of manually screened references for the training sets. Overall, the AI screening accuracy was consistently lower using 300 vs 500 manually screened references across the 3 training sets (5%/80%, 93.3% vs 94.6%; 20%/65%, 93.2% vs 94.3%; 80%/20%, 92.5% vs 94.3%). The sensitivity also improved with increasing numbers of manually screened references and with a higher percentage of references used for training (300 references, range: 0.27 to 0.35 vs 500 references, range: 0.31 to 0.37). The specificity was high across all 3 training sets and manually screened reference sets (range, 0.97 to 0.99). Although the accuracy and specificity of the included and excluded references were high, the sensitivity was low across all training sets. AI within DistillerSR may be useful in streamlining the reference-screening process by prioritizing likely inclusions and providing an additional level of security by verifying exclusions; however, more research is needed before substituting AI as a second reviewer.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call