The problem of pattern classification between musical or non-musical song audio snippets and non-songs is the main subject of this paper. The machine learning approach accomplishes the goal of automatically interpreting recordings to see if they are songs or not. There are two types of songs: musical and non-musical. This is due to the possibility that it could be difficult to discern between non-musical songs and non-song patterns. Thus, it is necessary to have such systems that differentiate between non-song patterns and songs, whether or not they are melodic. The extraction and selection of characteristics have made advantage of the preprocessing phases. The features that were obtained were pitch, intensity, length (duration), tempo, and sample rate. The Back-propagation Multi-layer Perceptron Neural Network model is used for both dataset testing and model training. This technique will distinguish between a musical or non-musical song and a non-musical piece (such conversation and speech). Training data for the dataset included recordings of speeches and dialogues in Hindi and Urdu for non-song audio files, as well as audio of various male and female singers from Pakistan and India for song audio files in Hindi and Urdu. When the classifier was evaluated using a range of audio samples, 90% of its classifications for songs and non-songs were accurate.
Read full abstract