Abstract

This paper demonstrates automatic recognition of vocalizations of four common bird species (herring gull [Larus argentatus], blue jay [Cyanocitta cristata], Canada goose [Branta canadensis], and American crow [Corvus brachyrhynchos]) using an algorithm that extracts frequency track sets using track properties of importance and harmonic correlation. The main result is that a complex harmonic vocalization is rendered into a set of related tracks that is easily applied to statistical models of the actual bird vocalizations. For each vocalization type, a statistical model of the vocalization was created by transforming the training set frequency tracks into feature vectors. The extraction algorithm extracts sets of frequency tracks from test recordings that closely approximate harmonic sounds in the file being processed. Each extracted set in its final form is then compared with the statistical models generated during the training phase using Mahalanobis distance functions. If it matches one of the models closely, the recognizer declares the set an occurrence of the corresponding vocalization. The method was evaluated against a test set containing vocalizations of both the 4 target species and 16 additional species as well as background noise containing planes, cars, and various natural sounds.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call