Abstract

Audio element detection in wireless sensor networks (WSNs) has great significance in our lives (e.g., in detecting traffic jam and accident, gun shots and explosion, and hurricane). It is particularly useful when video cameras cannot be used effectively (e.g., in darkness, with a wide range to cover); audio sensors are also much cheaper. However, most previous works on audio element detection require a large number of training examples to obtain satisfactory results. This becomes even more infeasible for audio sensors in WSNs where small energy consumption is required. In this paper, we propose a novel approach to solve this difficult problem. We first break down audio clips into a collection of simple “audio elements,” and train these audio elements offline using statistical learning. Then, we train a weighted association graph using the trained audio element models online. This greatly reduces the amount of online training without sacrificing accuracy. We deploy our approach in an audio sensor network for traffic monitoring and venue monitoring to evaluate its performance. The experiments demonstrate that our proposed method achieves better results compared to the state-of-the-art methods while using smaller online training sets.

Highlights

  • Wireless sensor networks (WSNs) which combine the technology of sensor, embedded system, and wireless communications have a bright application prospect

  • To fully evaluate the performance of the three-feature extraction method, 12 basic audio elements are taken into account in the experiments

  • (c) The average recall for the three audio element detection methods in (d) The average precision for the three audio element detection meththe celebration environment ods in the celebration environment graph model in audio element detection, 9 different kinds of weighted association graphs are selected for analysis

Read more

Summary

Introduction

Wireless sensor networks (WSNs) which combine the technology of sensor, embedded system, and wireless communications have a bright application prospect. In some special situations, video cameras cannot work well. In addition to the video cameras, the use of audio sensors in surveillance and monitoring applications is becoming increasingly important [2, 4]. The audio sensors are useful when video cameras cannot be used effectively (e.g., in darkness, with a wide range to cover). Most relevant research focuses on detecting the audio elements in TV programs In these works, most researchers collected enough training samples to train each audio element model. Audio elements are defined as short audio clips, and each represents a basic audio type (such as speech and music). We train a weighted association graph using the trained audio elements online.

Related Works
System Description
Discriminating Principal Component Analysis Feature Extraction Method
Audio Event Modeling and Detection
Implement and Evaluation
Conclusions and Future Works
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call