Detection and classification of diatoms has been considered as the most effective method for the drowning diagnosis in forensic practice. However, challenges (such as limited samples, complex background interference, and detecting multiple objects) remain during detection and classification. To address these challenges, we propose a MDMS (multi-scale dynamic multi-head self-attention)-based deep learning network for the diatom-based drowning diagnosis. This framework uses TFFT (two-stage full fine-tuning training strategy) to solve the challenges of limited diatom samples and detecting multiple objects, and builds a multi-scale dynamic multi-head self-attention module to extract the local and global features of diatom images. In the meantime, we introduce an online hard example mining strategy to attenuate the complex background interference. The experimental results show that the proposed framework can effectively reduce the missing and false detection rates of diatom objects, with the mAP (mean Average Precision) reaching 92.434%, which is better than the result using the mainstream methods.
Read full abstract