Abstract To enhance the intelligent obstacle avoidance capabilities of unmanned aerial vehicles (UAVs), we introduce a refined version of the You Only Look Once (YOLO) algorithm, termed Improved YOLO (I-YOLO). This novel algorithm not only maintains the swift detection speed inherent in the original YOLO but also augments its feature enhancement network. Specifically, we propose a three-branch parallel feature pyramid network (TBPFP) to capture richer semantic information pertaining to the context of small obstacles. Furthermore, we incorporate a generative adversarial network (GAN) in a cascaded manner to synthesize more realistic super-resolution images, thereby bolstering detection accuracy. Experimental evaluations demonstrate that, in comparison to the baseline YOLO algorithm, our I-YOLO variant exhibits superior detection precision and feature extraction capabilities in the perception of intricate environmental targets. While there is a modest decrement in detection speed, it remains adequate to fulfill real-time requirements, particularly in the context of video stream processing.
Read full abstract