Abstract

Onset detection still has room for improvement, especially when dealing with polyphonic music signals. For certain purposes in which the correctness of the result is a must, user intervention is hence required to correct the mistakes performed by the detection algorithm. In such interactive paradigm, the exactitude of the detection can be guaranteed at the expense of user’s work, being the effort required to accomplish the task, the value that has to be both quantified and reduced. The present work studies the idea of interactive onset detection and proposes a methodology for assessing the user’s workload, as well as a set of interactive schemes for reducing such workload when carrying out this detection task. Results show that the evaluation strategy proposed is able to quantitatively assess the invested user effort. Also, the presented interactive schemes significantly facilitate the correction task compared with the manual annotation.

Highlights

  • Every onset detection algorithm based on signal processing comprises two different stages: an initial phase, Let NGT denote the total number of onsets to be annotated in an audio file

  • The idea is that the total number of corrections performed in an interactive system CiTnt is lower than, or in the worst-case scenario, equal to, the amount required in a complete manual correction CmTan, i.e. CiTnt ≤ CmTan

  • As a reminder to the reader, the two metrics considered were a) Total Corrections ratio (RTC) that compares the amount of corrections required for correcting a sequence under the interactive paradigm with respect to complete a manual correction; and b) Corrections to Ground Truth ratio (RGT) which contrasts the total amount of interactions performed with the total number of onsets to be annotated

Read more

Summary

Introduction

Every onset detection algorithm based on signal processing comprises two different stages: an initial phase, Fig. 1 Block diagram commonly used for onset detection. The amount of onsets obtained by a detection algorithm may be defined as ND = NOK + NFP whereas the total number of onsets to be estimated can be expressed as NGT = NOK + NFN. Due to the influence of the OSF stage in the overall onset detection process [25], we assume that the detection errors are exclusively produced by considering an inappropriate configuration of this selection function. This constitutes a simplification, we restrict our work to this hypothesis. A set of measures for quantitatively assessing the user effort is proposed

Interaction methodologies
Percentile-based interaction
User effort assessment
Total corrections ratio
Corrections to ground truth ratio
Evaluation methodology
Static metrics
User-centred metrics
Results
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.