Manifesting construction activity scenes via image captioning

Huan Liu,Guangbin Wang,Ting Huang,Ping He,Martin Skitmore,Xiaochun Luo

doi:10.1016/j.autcon.2020.103334

Abstract

This study proposed an automated method for manifesting construction activity scenes by image captioning – an approach rooted in computer vision and natural language generation. A linguistic description schema for manifesting the scenes is developed initially and two unique dedicated image captioning datasets are created for method validation. A general model architecture of image captioning is then instituted by combining an encoder-decoder framework with deep neural networks, followed by three experimental tests involving the selection of model learning strategies and performance evaluation metrics. It is demonstrated the method's performance is comparable with that of state-of-the-art computer vision methods in general. The paper concludes with a discussion of the feasibility of the practical application of the proposed approach at the current technical level.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Automation in Construction	Publication Date: Jul 6, 2020
Citations: 49	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Manifesting construction activity scenes via image captioning

Abstract

Talk to us

Similar Papers

More From: Automation in Construction

Lead the way for us

Similar Papers

Deep Learning in Natural Language Generation from Images
Xiaodong He ... Li Deng
-
Xiaodong He, et. al.Xiaodong He ... Li Deng
01 Jan 2018
01 Jan 2018

From Show to Tell: A Survey on Deep Learning-Based Image Captioning.
Matteo Stefanini ... Marcella Cornia
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Matteo Stefanini, et. al.Matteo Stefanini ... Marcella Cornia
01 Jan 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

A thorough review of models, evaluation metrics, and datasets on image captioning
Gaifang Luo ... Guozhu Song
IET Image Processing | VOL. 16
Gaifang Luo, et. al.Gaifang Luo ... Guozhu Song
22 Nov 2021
IET Image Processing | VOL. 16

Image Captioning Based on Deep Neural Networks
Shuang Liu ... Liang Bai
MATEC Web of Conferences | VOL. 232
Shuang Liu, et. al.Shuang Liu ... Liang Bai
01 Jan 2018
MATEC Web of Conferences | VOL. 232

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Manifesting construction activity scenes via image captioning

Abstract

Talk to us

Similar Papers

More From: Automation in Construction