Abstract

This paper describes a method for non-coplanar microphone arrays that temporally isolates and cleans unknown broadband acoustic impulses for detection, classification, and scene analysis. Possible events are initially identified using a sliding statistical time window. Then the authors posit that most of the false triggers due to environmental noise can be filtered by using generalized cross correlation to phase align the microphone channels and reject implausible velocities. Finally, the phase aligned signals are calibrated and averaged across the microphones. With appropriate hyperparameter tuning, this method appears robust to ambient noise, wind noise and physical interaction. Performance is measured using a simulation and a real historic dataset of over 2 hours of curated acoustic recordings containing 559 gunshots, 120 blasts, and 747 other various weather and non-impulsive events recorded with no prior information under normal operating conditions. Events were found and validated using human listeners with a tool to visualize the waveform and the spectrogram. For this dataset, the model accurately found over 95% of the gunshots with 92% temporal separation and 100% of the blasts identified by the listeners. These results show the method to be a viable solution for impulsive outdoor broadband acoustic signal detection.This paper describes a method for non-coplanar microphone arrays that temporally isolates and cleans unknown broadband acoustic impulses for detection, classification, and scene analysis. Possible events are initially identified using a sliding statistical time window. Then the authors posit that most of the false triggers due to environmental noise can be filtered by using generalized cross correlation to phase align the microphone channels and reject implausible velocities. Finally, the phase aligned signals are calibrated and averaged across the microphones. With appropriate hyperparameter tuning, this method appears robust to ambient noise, wind noise and physical interaction. Performance is measured using a simulation and a real historic dataset of over 2 hours of curated acoustic recordings containing 559 gunshots, 120 blasts, and 747 other various weather and non-impulsive events recorded with no prior information under normal operating conditions. Events were found and validated using human listen...

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.