Abstract

Studies on room monitoring have only focused on objects in a singular and uniform posture or low-density groups. Considering the wide use of convolutional neural networks for object detection, especially person detection, we use deep learning and perspective correction techniques to propose a room monitoring system that can detect persons with different motion states, high-density groups, and small-sized persons owing to the distance from the camera. This system uses consecutive frames from the monitoring camera as input images. Two approaches are used: perspective correction and person detection. First, perspective correction is used to transform an input image into a 2D top-view image. This allows users to observe the system more easily with different views (2D and 3D views). Second, the proposed person detection scheme combines the Mask region-based convolutional neural network (R-CNN) scheme and the tile technique for person detection, especially for detecting small-sized persons. All results are stored in a cloud database. Moreover, new person coordinates in 2D images are generated from the final bounding boxes and heat maps are created according to the 2D images; these enable users to examine the system quickly in different views. Additionally, a system prototype is developed to demonstrate the feasibility of the proposed system. Experimental results prove that our proposed system outperforms existing schemes in terms of accuracy, mean absolute error (MAE), and root mean squared error (RMSE).

Highlights

  • Smart sustainable cities use multiple technologies to increase people’s comfort levels

  • We proposed a new person detection scheme that combines the Mask region-based convolutional neural network (R-convolutional neural networks (CNNs)) scheme and tile technique

  • We propose a new person detection scheme that combines the Mask R-CNN scheme and tile technique

Read more

Summary

Introduction

Smart sustainable cities use multiple technologies to increase people’s comfort levels This concept has attracted many governments’ interest, and many studies have explored the methods of improving the performance of Internet of Things (IoT) applications. This study proposes a room monitoring system using deep learning and perspective correction techniques. The two image sources are explained as follows: The first image source is from the camera manufacturer that provides an ftp client environment It can automatically return one image every second to the ftp server we set up, which can be integrated in the system; the second image source is to use the browser to view the image. We proposed a room monitoring system using deep learning and perspective correction techniques.

Perspective Correction
Object Detection
Room Monitoring System
System Architecture
Perspective Correction Technique
Proposed Person Detection Scheme
Tile Division
Object Detection and a Person Class Filter
Merging of the Bounding Boxes
System Implementation and Prototype
Testing Datasets
Comparison of Schemes in Terms of Precision and Recall
Evaluation of Schemes Using MAE and RMSE
Analysis of Monitoring System
Evaluation of Used Areas in Room
Summary
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.