Abstract
Multiple object retrieval methods are solved by the CA-YOLO model to find elements in complex spatial images. This addresses weak multiscale feature learning and the delicate balance between model complexity and performance. CA-YOLO is based on YOLOv5 and uses a small coordinate attention module at the lower layer to recover selected features and reduce data. A fast spatial pyramid with tandem design modules at the bottom uses stochastic pooling to speed up thinking and merge features of different sizes. Anchor box interactions and missing features have been updated to make it easier to find items of different sizes and shapes. CA-YOLO outperforms YOLO in multiple object detection and average inference speed of 125 frames/sec. CA-YOLO is a great option in the same conditions and difficulties. The study also investigated V3 tiny, V4, V5s, V8s, CA Yolos, and V5x6 YOLO models. This suggests that YOLO V5x6 can achieve more than 95% mAP on remote sensing object identification datasets. Index terms - Object detection, attention mechanism, coordinate attention, SPPF, SIoU loss.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.