Abstract

An earlier work of the authors introduced an adapted version of the Computational Geometry Algorithm (CGA) designed to analyse an audio stream and produce a unique coding-independent fingerprint. As the adaptability and the induced calculation load of the proposed algorithm form a key characteristic for multiple applications, our current investigation aims to measure its performance and stability in dynamic, real-time applications, i.e., in large audio library indexing and dynamic audio recognition. In addition, we investigate the fact that context similarity is also evident across fingerprints; hence a number of comparisons are used to explore the possible uses of this highly desirable algorithmic feature.

Highlights

  • The wide availability of Internet connectivity and media-enabled devices has altered the way content is produced, distributed and reproduced

  • We evaluated the proposed audio fingerprinting and classification algorithm efficiency under three different, but typical and widely-employed application scenarios: a) the compressed application scenario, which aims to assess the identification efficiency of the Computational Geometry Algorithm (CGA) algorithm for compressed quality audio

  • C) the different performance scenario, were we aim at investigating the identification capabilities of the CGA algorithm on the same audio/music material, performed under different conditions

Read more

Summary

Introduction

The wide availability of Internet connectivity and media-enabled devices has altered the way content is produced, distributed and reproduced. These global changes introduce new markets and services (Simpson, 2004), a fact that is clearly evident when observing the evolution of standards such as MPEG-7 linking content to context and offering multimedia accessibility for all as with MPEG-21 (Kosch, 2004). Audio (and/or video streams) are often degraded in terms of quality due to the employment of various compression techniques and the inevitable stream re-compression processes introduced by the wide variety of transmission formats available in all media-enabled platforms These algorithmic-based compression processes such as MPEG 1, 2, 3 and 4 are based on mechanisms of human perception for minimizing the required transmission bandwidth.

Related Study
Implementation Issues-Decision Stages
Results
Case 1
Case 2
Case 3
Results Aggregation-Statistical Evaluation
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.