High-accuracy positioning and identification of transmission line components is the premise for their status detection and fault diagnosis. However, due to the limitations of imbalance object scale and distribution in aerial images, the problem of detecting dense-tiny objects still needs to be solved. Considering the prior knowledge of fixed connection scenes, we proposed a novel method named scene iterative reasoning network (SIRN) based on self-adaptive clustering and local scene knowledge. It consists of the coarse-grained detector (CGD), scene self-adaptive clustering (SSAC) subnetwork, and scene structure iterative reasoning (SSIR) subnetwork. The CGD was proposed to obtain global coarse detection results. Then, the SSAC subnetwork utilized unsupervised self-adaptive clustering to obtain accurate dense regions based on the CGD results. Finally, the SSIR subnetwork was designed to extract and fuse the scene semantic structure information of local regions for realizing accurate dense-tiny object detection. In summary, the SIRN model utilizes an iterative inference strategy to detect dense regions accurately. By leveraging scene structure information, the model effectively transforms the challenge of detecting dense objects into an advantage, resulting in precise detection. The SIRN model is compatible with single-stage and two-stage object detection models, achieving a notable mean average precision (mAP) improvement ranging from 4.4% to 12.2% compared to the baseline models. Compared to other state-of-the-art object detection models, SIRN demonstrates significant advantages in accuracy and error rate metrics. In addition, qualitative and quantitative experiments show that the SIRN model considerably enhanced the detection of dense-tiny objects. The code is available at https://github.com/CharmingWang/SIRN.
Read full abstract