Labelling instructions matter in biomedical image analysis

Tim Rädsch,Minu D Tizabi,Nicholas Schreck,Annika Reinke,Annette Kopp-Schneider,Bünyamin Pekdemir,A Emre Kavur,Lena Maier-Hein,Vivienn Weru,Tobias Roß

doi:10.1038/s42256-023-00625-5

Tim Rädsch, Minu D Tizabi + Show 8 more

Open Access

PDF Available

https://doi.org/10.1038/s42256-023-00625-5

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Biomedical image analysis algorithm validation depends on high-quality annotation of reference datasets, for which labelling instructions are key. Despite their importance, their optimization remains largely unexplored. Here we present a systematic study of labelling instructions and their impact on annotation quality in the field. Through comprehensive examination of professional practice and international competitions registered at the Medical Image Computing and Computer Assisted Intervention Society, the largest international society in the biomedical imaging field, we uncovered a discrepancy between annotators’ needs for labelling instructions and their current quality and availability. On the basis of an analysis of 14,040 images annotated by 156 annotators from four professional annotation companies and 708 Amazon Mechanical Turk crowdworkers using instructions with different information density levels, we further found that including exemplary images substantially boosts annotation performance compared with text-only descriptions, while solely extending text descriptions does not. Finally, professional annotators constantly outperform Amazon Mechanical Turk crowdworkers. Our study raises awareness for the need of quality standards in biomedical image analysis labelling instructions.

Full Text