Abstract

For practical use, the relevance of indoor surveillance from multiple cameras to track the movement of people and reidentify them in video sequences is constantly increasing. This is a complex task due to the effect of uneven illumination, background inhomogeneity, overlap, uncertainty of the trajectories of people, and the similarity of their visual features. The article presents an approach to track people by video sequences and reidentify them in multicamera video surveillance systems that are used indoors. At the first step, people are detected using a YOLO v4 convolution neural network (CNN) and described by a rectangular area. Further, the search for the face area and the calculation of its features are carried out, which in the developed method are used when accompanying a person in a video sequence and during his intercamera reidentification. This approach improves the accuracy of tracking with a complex movement trajectory and multiple intersections of people with similar characteristics. The search for faces is carried out on the detected areas based on the multitasking MTCNN, and the MobileFaceNetwork model is used to form the vector of the features of the face. Human features are generated using a modified CNN based on ResNet34 and an HSV color tone channel histogram. The correspondence between people on different frames is established based on the analysis of the spatial coordinates of faces and people, as well as their CNN features, using the Hungarian algorithm. To ensure the accuracy of intercamera tracking, reidentification is performed based on the facial features. Five test video sequences of different numbers of people captured indoors with a fixed video camera were used to test and compare different approaches. The obtained experimental results confirmed the strength of the characteristics of the proposed approach.

Highlights

  • The relevance of indoor surveillance from multiple cameras to track the movement of people and reidentify them in video sequences is constantly increasing

  • The search for the face area and the calculation of its features are carried out, which in the developed method are used when accompanying a person in a video sequence and during his intercamera reidentification

  • The search for faces is carried out on the detected areas based on the multitasking MTCNN, and the MobileFaceNetwork model is used to form the vector of the features of the face

Read more

Summary

Article Google Scholar

X. Wang, “Intelligent multi-camera video surveillance: A review,” Pattern Recognit. Y. Zakharava, “Person tracking algorithm based on convolutional neural network for indoor video surveillance,” Comput. Of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR2017) (Honolulu, HI, 2017), pp. “People tracking and re-identification by face recognition for RGB-D camera networks,” in Proc. Of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (Boston, Massachusetts, 2015), pp. Of the IEEE Conference on Computer Vision and Pattern. Of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (Honolulu, HI, 2017), pp. “Multi-view face detection and landmark localization based on MTCNN,” in Proc. “Robust person tracking algorithm based on Convolutional Neural Network for indoor video surveillance,” in Pattern Recognition and Information Processing, PRIP 2019, Ed. by S.

MathSciNet Article Google Scholar
Additional information
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.