Remote Sensing Image Generation From Audio

Zhiyuan Zheng,Xiaoqiang Lu,Jun Chen,Xiangtao Zheng

doi:10.1109/lgrs.2020.2992324

Abstract

Generating image from other modal data has attracted much attention in cross-modal studies, since the generated image offers intuitive vision information. Unlike the previous works which generate an image from text, a novel task is introduced, generating an image from audio. However, semantic gap intrinsically exists in cross-modal data, which disturbs the generative results. In order to explore the relevance between the audio and image, a novel reranking audio-image translation method is proposed. The proposed method: 1) maps the audio and image into a uniform feature space; 2) designs an audio-audio matching network to match the related audio; and 3) adopts an audio-image matching network for every matched audio to generate a related image, and the most frequent image is voted as the final result. Extensive experiments on two remote sensing cross-modal data sets demonstrate that the proposed method can visualize the content of audio.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Remote Sensing Image Generation From Audio

Abstract

Talk to us

Similar Papers

More From: IEEE Geoscience and Remote Sensing Letters

Lead the way for us

Journal: IEEE Geoscience and Remote Sensing Letters	Publication Date: May 15, 2020
Citations: 4

Similar Papers

Common Semantic Representation Method Based on Object Attention and Adversarial Learning for Cross-Modal Data in IoV
Feifei Kou ... Wanqiu Cui
IEEE Transactions on Vehicular Technology | VOL. 68
Feifei Kou, et. al.Feifei Kou ... Wanqiu Cui
01 Dec 2019
IEEE Transactions on Vehicular Technology | VOL. 68

Model updating using uncorrelated modes
S.V Modak
Journal of Sound and Vibration | VOL. 333
S.V ModakS.V Modak
18 Feb 2014
Journal of Sound and Vibration | VOL. 333

STRUCTURAL DAMAGE DETECTION BASED ON A MICRO-GENETIC ALGORITHM USING INCOMPLETE AND NOISY MODAL TEST DATA
F.T.K Au ... Z.Z Bai
Journal of Sound and Vibration | VOL. 259
F.T.K Au, et. al.F.T.K Au ... Z.Z Bai
31 Dec 2003
Journal of Sound and Vibration | VOL. 259

Simultaneous Recognition and Assessment of Post-Stroke Hemiparetic Gait by Fusing Kinematic, Kinetic, and Electrophysiological Data.
Chengkun Cui ... Weiqun Wang
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 26
Chengkun Cui, et. al.Chengkun Cui ... Weiqun Wang
01 Apr 2018
IEEE Transactions on Neural Systems and Rehabilitation Engineering | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Remote Sensing Image Generation From Audio

Abstract

Talk to us

Similar Papers

More From: IEEE Geoscience and Remote Sensing Letters