In recent years, anchor-free object detectors have become predominant in deep learning, the YOLOv8 model as a real-time object detector based on anchor-free frames is universal and influential, it efficiently detects objects across multiple scales. However, the generalization performance of the model is lacking, and the feature fusion within the neck module overly relies on its structural design and dataset size, and it is particularly difficult to localize and detect small objects. To address these issues, we propose the FL-YOLOv8 object detector, which is improved based on YOLOv8s. Firstly, we introduce the FSDI module in the neck, enhancing semantic information across all layers and incorporating rich detailed features through straightforward layer-hopping connections. This module integrates both high-level and low-level information to enhance the accuracy and efficiency of image detection. Meanwhile, the structure of the model was optimized and designed, and the LSCD module is constructed in the detection head; adopting a lightweight shared convolutional detection head reduces the number of parameters and computation of the model by 19% and 10%, respectively. Our model achieves a comprehensive performance of 45.5% on the COCO generalized dataset, surpassing the benchmark by 0.8 percentage points. To further validate the effectiveness of the method, experiments were also performed on specific domain urine sediment data (FCUS22), and the results on category detection also better justify the FL-YOLOv8 object detection algorithm.
Read full abstract