Towards Robust Object Detection in Floor Plan Images: A Data Augmentation Approach

Shashank Mishra,Marcus Liwicki,Khurram Azeem Hashmi,Muhammad Zeshan Afzal,Alain Pagani,Didier Stricker

doi:10.3390/app112311174

Abstract

Object detection is one of the most critical tasks in the field of Computer vision. This task comprises identifying and localizing an object in the image. Architectural floor plans represent the layout of buildings and apartments. The floor plans consist of walls, windows, stairs, and other furniture objects. While recognizing floor plan objects is straightforward for humans, automatically processing floor plans and recognizing objects is challenging. In this work, we investigate the performance of the recently introduced Cascade Mask R-CNN network to solve object detection in floor plan images. Furthermore, we experimentally establish that deformable convolution works better than conventional convolutions in the proposed framework. Prior datasets for object detection in floor plan images are either publicly unavailable or contain few samples. We introduce SFPI, a novel synthetic floor plan dataset consisting of 10,000 images to address this issue. Our proposed method conveniently exceeds the previous state-of-the-art results on the SESYD dataset with an mAP of 98.1%. Moreover, it sets impressive baseline results on our novel SFPI dataset with an mAP of 99.8%. We believe that introducing the modern dataset enables the researcher to enhance the research in this domain.

Highlights

We present an end-to-end trainable framework that works on Cascade Mask RCNN [15] with conventional and deformable [16] convolutional backbone network to detect various objects in floor plan images
Our backbone ResNeXt-101 [17] is pre-trained on MS-COCO dataset [26]. Using this pre-trained feature extraction backbone helps our architecture to adapt from the domain of natural scenes to floor plan images
We achieve a 0.995 Mean Average Precision (mAP) score and 0.997 Mean Average Recall (mAR) score. This clearly shows that our model performs better on the SFPI dataset where we have sufficient images to train a model as compared to less number of images we have in SESYD [3]

Summary

Introduction

Architectural floor plans contain both structural and semantic information, e.g., room size, type, location of doors, walls, and furniture [3]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Nov 25, 2021
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Towards Robust Object Detection in Floor Plan Images: A Data Augmentation Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Object Detection in Floor Plan Images
Zahra Ziran ... Simone Marinai
-
Zahra Ziran, et. al.Zahra Ziran ... Simone Marinai
01 Jan 2018
01 Jan 2018

Chapter 10 - A computational approach to understand building floor plan images using machine learning techniques
Shreya Goyal ... Gaurav Bhatnagar
Internet of Multimedia Things (IoMT) | VOL. -
Shreya Goyal, et. al.Shreya Goyal ... Gaurav Bhatnagar
01 Jan 2021
Internet of Multimedia Things (IoMT) | VOL. -

An empirical study of multi-scale object detection in high resolution UAV images
Haijun Zhang ... Yuzhu Ji
Neurocomputing | VOL. 421
Haijun Zhang, et. al.Haijun Zhang ... Yuzhu Ji
28 Sep 2020
Neurocomputing | VOL. 421

Object Detection in Image with Complex Background
Li Dong ... Wang Shengjin
-
Li Dong, et. al.Li Dong ... Wang Shengjin
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Robust Object Detection in Floor Plan Images: A Data Augmentation Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences