Machine learning approaches for pattern recognition are increasingly popular. However, the underlying algorithms are often not open source, may require substantial data for model training, and are not geared toward specific tasks. We used open-source software to build a green toad breeding call detection algorithm that will aid in field data analysis. We provide instructions on how to reproduce our approach for other animal sounds and research questions. Our approach using 34 green toad call sequences and 166 audio files without green toad sounds had an accuracy of 0.99 when split into training (70%) and testing (30%) datasets. The final algorithm was applied to amphibian sounds newly collected by citizen scientists. Our function used three categories: “Green toad(s) detected”, “No green toad(s) detected”, and “Double check”. Ninety percent of files containing green toad calls were classified as “Green toad(s) detected”, and the remaining 10% as “Double check”. Eighty-nine percent of files not containing green toad calls were classified as “No green toad(s) detected”, and the remaining 11% as “Double check”. Hence, none of the files were classified in the wrong category. We conclude that it is feasible for researchers to build their own efficient pattern recognition algorithm.
Read full abstract