With the development of artificial intelligence technology, intelligent weapon systems that can automatically identify, lock on and strike targets have gradually appeared and can replace humans in executing simple decision-making commands. Target detection is a key part of intelligent weapons. At present, large-scale target detection has serious challenges such as long-tail data distributions, severe occlusion, and category ambiguity. The main detection algorithms only detect each independent area without considering the key semantic dependencies between objects. It has become a hot trend to apply deep learning to prior knowledge to form a model. This article uses both internal and external knowledge to instill a target detection system with human reasoning capabilities. Commonly used external embedded knowledge includes geometric relations, attributes, locations, etc. They have a common shortcoming in that they require large amounts of labeled data, and the integration costs are huge. The purpose of this article is to construct a general external prior knowledge module to guide network learning. By paying attention to the characteristics of each object in different semantic contexts, the characteristics of each object are adaptively enhanced, and the high-level semantics of all categories evolve on a global scale. Internal knowledge uses a convolutional attention module that can learn spatial and channel information at multiple scales. The experimental results show the superiority of our knowledge-YOLOv5. The proposed method achieved 1.7%, 2.2%, 1.1%, and 0.7% improvements over YOLOv5s, YOLOv5m, VOLOv51, and YOLOv5x, respectively, on the COCO data sets; and the proposed method also achieves a 0.9% improvement on the self-built data set. The trained lightweight model Knowledge-YOLOv5s is deployed on an NVIDIA Jetson TX2 through TensorRT acceleration, and the real-time detection frame is 20 ms, which meets the real-time detection requirements. This system can also be used as a module of an intelligent weapon system, which has certain referential significance for autonomous weapons and unmanned combat systems.
Read full abstract