Abstract

Ship recognition based on ship-radiated noise is one of the most important and challenging subjects in underwater acoustic signal processing. The recognition methods for ship-radiated noise recognition include traditional methods and deep learning (DL) methods. Developing from the DL methods and inspired by audio–video speech recognition (AVSR), the paper further introduces multimodal deep learning (multimodal-DL) methods for the recognition of ship-radiated noise. In this paper, ship-radiated noise (acoustics modality) and visual observation of the ships (visual modality) are two different modalities that the multimodal-DL methods model on. The paper specially designs a multimodal-DL framework, the multimodal convolutional neural networks (multimodal-CNNs) for the recognition of ship-radiated noise. Then the paper proposes a strategy based on canonical correlation analysis (CCA-based strategy) to build a joint representation and recognition on the two different single-modality (acoustics modality and visual modality). The multimodal-CNNs and the CCA-based strategy are tested on real ship-radiated noise data recorded. Experimental results show that, using the CCA-based strategy, strong-discriminative information can be built from weak-discriminative information provided from a single-modality. Experimental results also show that as long as any one of the single-modalities can provide information for the recognition, the multimodal-DL methods can have a much better multiclass recognition performance than the DL methods. The paper also discusses the advantages and superiorities of the multimodal-Dl methods over the traditional methods for ship-radiated noise recognition.

Highlights

  • When a ship moves on the water, it produces noise, called ship-radiated noise

  • Inspired by audio–video speech recognition (AVSR) [2] based on multimodal-deep learning (DL) methods, the paper introduces multimodal-DL methods to jointly model on different modalities from the ships for ship-radiated noise recognition

  • Inspired by AVSR based on multimodal-DL methods, we introduce multimodal-DL methods for joint representation and recognition of the ship-radiated noise

Read more

Summary

Introduction

When a ship moves on the water, it produces noise, called ship-radiated noise. Inspired by AVSR [2] based on multimodal-DL methods, the paper introduces multimodal-DL methods to jointly model on different modalities from the ships for ship-radiated noise recognition. The ship-radiated noise (acoustics modality) and visual observation of the ships (visual modality) are the two different modalities that the multimodal-DL methods model on. Inspired by [4], the paper specially designs a CNN-based multimodal-DL framework (multimodal-CNNs) to jointly model on the two different modalities (acoustics modality and visual modality) from the ships. The most import issue of multimodal-DL methods for ship-radiated noise recognition is to figure out the joint representation problem, that is to build a more discriminative joint representation over the two different modalities. 3. The paper proposes the CCA-based strategy to build a more discriminative joint representation and recognition on the two single-modality.

Related Work
Multimodal Deep Learning Methods for Ship-Radiated Nose Recognition
Application Scenario
The Multimodal-CNNs Framework
Training Method for Multimodal-CNNs Framework
CCA-Based Strategy
Experiment Setting
Single-Modality Consideration
The Acoustics Modality
The Visual Modality
Joint Representation
Joint Recognition
Findings
Conclusions
Future Work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call