Scene-level buildings damage recognition based on Cross Conv-Transformer

Lingfei Shi,Feng Zhang,Junshi Xia,Jibo Xie

doi:10.1080/17538947.2023.2261770

Abstract

ABSTRACTDifferent to pixel-based and object-based image recognition, a larger perspective based on the scene can improve the efficiency of assessing large-scale building damage. However, the complexity of disaster scenes and the scarcity of datasets are major challenges in identifying building damage. To address these challenges, the Cross Conv-Transformer model is proposed to classify and evaluate the degree of damage to buildings using aerial images taken after earthquake. We employ Conv-Embedding and Conv-Projection to extract features from the images. The integration of convolution and Transformer reduces the computational burden of the model while enhancing its feature extraction capabilities. Furthermore, the two branch Conv-Transformer architecture with global and local attention is designed, allowing each branch to focus on global and local features respectively. The cross-attention fusion module merges feature information from the two branches to enrich classification features. At last, we utilize aerial images captured during the Beichuan and Yushu earthquakes as both the training and test sets to assess the model. The proposed Cross Conv-Transformer model improved classification accuracy by 4.7% and 2.1% compared to the ViT and EfficientNet. The results show that the Cross Conv-Transformer model could significantly reduces misclassification between severely and moderately damaged categories.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scene-level buildings damage recognition based on Cross Conv-Transformer

Abstract

Talk to us

Similar Papers

More From: International Journal of Digital Earth

Lead the way for us

Journal: International Journal of Digital Earth	Publication Date: Sep 28, 2023
License type: CC BY 4.0

Similar Papers

Joint Coding of Local and Global Deep Features in Videos for Visual Search.
Lin Ding ... Yonghong Tian
IEEE Transactions on Image Processing | VOL. 29
Lin Ding, et. al.Lin Ding ... Yonghong Tian
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 29

Author response: A connectomics-based taxonomy of mammals
Laura E Suarez ... Yossi Yovel
-
Laura E Suarez, et. al.Laura E Suarez ... Yossi Yovel
10 Oct 2022
10 Oct 2022

Damaged building detection in aerial images using shadow Information
Beril Sirmacek ... Cem Unsalan
-
Beril Sirmacek, et. al.Beril Sirmacek ... Cem Unsalan
01 Jun 2009
01 Jun 2009

Human action recognition using bag of global and local Zernike moment features
Saleh Aly ... Asmaa Sayed
Multimedia Tools and Applications | VOL. 78
Saleh Aly, et. al.Saleh Aly ... Asmaa Sayed
15 May 2019
Multimedia Tools and Applications | VOL. 78

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scene-level buildings damage recognition based on Cross Conv-Transformer

Abstract

Talk to us

Similar Papers

More From: International Journal of Digital Earth