Abstract

This paper presents a learning based method for detecting whistles of toothed whales from underwater hydrophone recordings. Our method represents audio signals as time-frequency spectrogram and employs the Fully Convolution Network (FCN) to estimate for each spectrogram a map of contour confidences that are used for extracting discrete whistle contours. To avoid the expensive efforts of annotating whistle contours, we develop a data synthesis approach to generate spectrogram-contour pairs using spectrogram of background environment and a small set of whistle contours. Our study suggests that the deep contour model can be effectively learned from these synthesized samples. However, it is costly and unnecessary to synthesize equal amount of samples for each spectrogram or contour. Instead, we present an alternative learning algorithm that synthesize samples only for those spectrogram or contours that are not well modeled by the current network, measured by recall rates of contour points for each spectrogram-contour sample. This recall-guided learning algorithm can adaptively synthesize difficult samples to boost learning effectiveness. We applied the proposed method to the public DCLDE2011 dataset to extract whistle contours. Results show that our method can improve state-of-the-art method up to 21.9% in terms of F-score for multiple odontocete species.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.