Abstract

Audio in the video carries abundant semantic message. An audio scene is temporal audio segments which represented by a few basic audio effects. The semantic similarity of pair audio scenes is very useful for high-level audio semantic understanding. A computing approach for audio scene semantic similarity is proposed in this paper. Firstly, audio track is pre-segmented to audio scenes. Then, basic audio effects dominating each audio scene are recognized. Finally, the similarity of two audio scenes is calculated based on a model consist with information theoretic similarity principles and Tversky's set-theoretic similarity. The results of experiments indicate the audio scene semantic similarity computing approach could count quantitative semantic similarity of two scenes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.