Abstract

As music data storage becomes increasingly diverse in the era of big data, ensuring alignment of music works with the same semantics for online music education is crucial. To achieve this, a multi-modal music score alignment algorithm model based on deep learning was developed and optimized. Experimental results demonstrated that Note + DCO feature combination yielded the best MIDI input characteristics (mean value: 13.27 ms), whereas CQT feature comparison produced the best results for audio input (average: 12.85 ms). The ResNet-34 network was noted to have the most effective music score alignment effect with alignment errors averaging less than one frame. Compared with other algorithms, the proposed algorithm had the lowest average value of 9.28 ms, median value of 5.85 ms, and standard deviation of 20.17 ms. Actual music retrieval showed a Top-1 retrieval accuracy of 10.93% that was close to 11%. Overall, the proposed algorithm is significant for score alignment and music retrieval recognition in online music education.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.