Abstract

A database of 272 common environmental sounds, recorded for use in the development of a recognition system is described. These recordings represent 5–8 repetitions of 50 individual types of sound. The repetitions span a variety of recording conditions such as microphone positions, location of the source in the recording space, etc. The types of sound include airplanes, sirens, whistles, pouring small objects, telephones, motors, household appliances, footsteps, animal vocalizations, and speech. Simultaneous near- and far-field recordings were made of each sound. Unlike many commercially available collections, our database was produced via a consistent format of equipment use, equipment type, standardized microphone positions, etc. all of which has been exhaustively documented. Beyond those features mentioned above, the database includes information on the proportions and salient features of the recording spaces, the dimensions and salient features of the sources, and details of the recording and computer entry process. Currently, the database is comprised of four parts: (1) digitized audio waveforms; (2) information and notes on recording setup, recording space, source type, equipment, etc.; (3) sketches indicating shape and proportions of the recording setup, and (4) photographs showing sound sources and recording setups. This information is available in machine-readable format on a personal computer. [Work supported by NIH.]

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.