Applications of object and visual relationship detection for safety and security applications are in its infancy. State-of-the-art computer vision research is largely focused on improving mean Average Precision and mean Average Recall performance on standard, general datasets such as the Verbs in COmmon objects in COntext and the Visual Genome dataset and rarely mention the potential of such models in safety and security scenarios. We propose to train and develop an object and visual relationship detection neural network to be used as part of the back-end model for a decision support system. We use a naive Bayesian Network to determine scenarios where our proposed object and visual relationship detection network is error-prone. We also release a graphical user interface which demonstrates how our back-end neural network and naive Bayesian Network can be used for hazardous workplace safety and security applications. All models and source code can be found at <uri xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">https://github.com/th-truong/rd-gui</uri> for future research and applications of object and visual relationship detection models in the safety and security sector.
Read full abstract