The study of animal sounds in biology and ecology relies heavily upon time-frequency (TF) visualisation, most commonly using the short-time Fourier transform (STFT) spectrogram. This method, however, has inherent bias towards either temporal or spectral details that can lead to misinterpretation of complex animal sounds. An ideal TF visualisation should accurately convey the structure of the sound in terms of both frequency and time, however, the STFT often cannot meet this requirement. We evaluate the accuracy of four TF visualisation methods (superlet transform [SLT], continuous wavelet transform [CWT] and two STFTs) using a synthetic test signal. We then apply these methods to visualise sounds of the Chagos blue whale, Asian elephant, southern cassowary, eastern whipbird, mulloway fish and the American crocodile. We show that the SLT visualises the test signal with 18.48%-28.08% less error than the other methods. A comparison between our visualisations of animal sounds and their literature descriptions indicates that the STFT's bias may have caused misinterpretations in describing pygmy blue whale songs and elephant rumbles. We suggest that use of the SLT to visualise low-frequency animal sounds may prevent such misinterpretations. Finally, we employ the SLT to develop 'BASSA', an open-source, GUI software application that offers a no-code, user-friendly tool for analysing short-duration recordings of low-frequency animal sounds for the Windows platform. The SLT visualises low-frequency animal sounds with improved accuracy, in a user-friendly format, minimising the risk of misinterpretation while requiring less technical expertise than the STFT. Using this method could propel advances in acoustics-driven studies of animal communication, vocal production methods, phonation and species identification.
Read full abstract