Abstract
In addition to static features, dynamic features are also important for smoke recognition. 3D convolution can extract temporal and spatial information from video sequences. Currently, for video smoke detection, 3D convolution is usually used as a tool for secondary judgment of the detection results of single frame approaches. In this work, an end-to-end object detection neural network based on 3D convolution for video smoke detection, named 3DVSD, is proposed for the first time. The network captures moving objects from the input video sequences by the dynamic feature extraction part first and then inputs the feature tensor to the static feature extraction part for recognition and localization, which makes full use of the spatiotemporal features of smoke and improves the reliability of the algorithm. In addition, a time-series smoke video dataset for network training is proposed. The proposed algorithm is compared with other related studies. The experimental results demonstrated that the 3DVSD is promising with an accuracy rate of 99.54%, a false alarm rate of 1.11%, and a missed detection rate of 0.14%, and meets the requirements of real-time detection.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have