This study presents a novel approach to skin toxicity assessment in preclinical radiotherapy trials through an advanced imaging setup and deep learning. Skin reactions, commonly associated with undesirable side effects in radiotherapy, were meticulously evaluated in 160 mice across four studies. A comprehensive dataset containing 7542 images was derived from proton/electron trials with matched manual scoring of the acute toxicity on the right hind leg, which was the target area irradiated in the trials. This dataset was the foundation for the subsequent model training. The two-step deep learning framework incorporated an object detection model for hind leg detection and a classification model for toxicity classification. An observer study involving five experts and the deep learning model, was conducted to analyze the retrospective capabilities and inter-observer variations. The results revealed that the hind leg object detection model exhibited a robust performance, achieving an accuracy of almost 99%. Subsequently, the classification model demonstrated an overall accuracy of about 85%, revealing nuanced challenges in specific toxicity grades. The observer study highlighted high inter-observer agreement and showcased the model's superiority in accuracy and misclassification distance. In conclusion, this study signifies an advancement in objective and reproducible skin toxicity assessment. The imaging and deep learning system not only allows for retrospective toxicity scoring, but also presents a potential for minimizing inter-observer variation and evaluation times, addressing critical gaps in manual scoring methodologies. Future recommendations include refining the system through an expanded training dataset, paving the way for its deployment in preclinical research and radiotherapy trials.
Read full abstract