Abstract

An automated system capable of reliably segmenting and classifying bird phrases would help analyze field recordings. Here we describe a phrase segmentation method using entropy-based change-point detection. Spectrograms of bird calls are often very sparse while the background noise is relatively white. Therefore, considering the entropy of a sliding time- frequency window on the spectrogram, the entropy dips when detecting a signal and rises back up when the signal ends. Rather than a simple threshold on the entropy to determine the beginning and end of a signal, a Bayesian recursion-based change-point detection(CPD) method is used to detect sudden changes in the entropy sequence. CPD reacts only to those statistical changes, so generates more accurate time labels and reduces the false alarm rate than conventional energy detection methods. The segmented phrases are then used for training and testing a sparse representation(SR) classifier, which performs phrase classification by a sparse linear combination of feature vectors in the training set. With only 7 training tokens for each phrase, the SR classifier achieved 84.17% accuracy on a database containing 852 phrases from Cassins Vireo (Vireo casinii ) phrases that were hand-classified into 32 types. [This work was supported by NSF.]

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call