IRAWNET: A Method for Transcribing Indonesian Classical Music Notes Directly from Multichannel Raw Audio

Dewi Nurdiyah Dewi Nurdiyah,Yoyon Kusnendar Suprapto Yoyon Kusnendar Suprapto,Eko Mulyanto Yuniarno Eko Mulyanto Yuniarno,Mauridhi Hery Purnomo Mauridhi Hery Purnomo

doi:10.24003/emitter.v11i2.827

Dewi Nurdiyah Dewi Nurdiyah, Yoyon Kusnendar Suprapto Yoyon Kusnendar Suprapto + Show 2 more

Open Access

https://doi.org/10.24003/emitter.v11i2.827

Copy DOI

Abstract

A challenging task when developing real-time Automatic Music Transcription (AMT) methods is directly leveraging inputs from multichannel raw audio without any handcrafted signal transformation and feature extraction steps. The crucial problems are that raw audio only contains an amplitude in each timestamp, and the signals of the left and right channels have different amplitude intensities and onset times. Thus, this study addressed these issues by proposing the IRawNet method with fused feature layers to merge different amplitude from multichannel raw audio. IRawNet aims to transcribe Indonesian classical music notes. It was validated with the Gamelan music dataset. The Synthetic Minority Oversampling Technique (SMOTE) overcame the class imbalance of the Gamelan music dataset. Under various experimental scenarios, the performance effects of oversampled data, hyperparameters tuning, and fused feature layers are analyzed. Furthermore, the performance of the proposed method was compared with Temporal Convolutional Network (TCN), Deep WaveNet, and the monochannel IRawNet. The results proved that proposed method almost achieves superior results in entire metric performances with 0.871 of accuracy, 0.988 of AUC, 0.927 of precision, 0.896 of recall, and 0.896 of F1 score.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IRAWNET: A Method for Transcribing Indonesian Classical Music Notes Directly from Multichannel Raw Audio

Abstract

Talk to us

Similar Papers

More From: EMITTER International Journal of Engineering Technology

Lead the way for us

Journal: EMITTER International Journal of Engineering Technology	Publication Date: Dec 22, 2023
License type: CC BY-NC-SA 4.0

Similar Papers

Stereo Feature Enhancement and Temporal Information Extraction Network for Automatic Music Transcription
Wen Zhang ... Jie Shao
IEEE Signal Processing Letters | VOL. 28
Wen Zhang, et. al.Wen Zhang ... Jie Shao
01 Jan 2020
IEEE Signal Processing Letters | VOL. 28

Time Series Forecasting and Classification Models Based on Recurrent with Attention Mechanism and Generative Adversarial Networks.
Kun Zhou ... Kai Deng
Sensors | VOL. 20
Kun Zhou, et. al.Kun Zhou ... Kai Deng
16 Dec 2020
Sensors | VOL. 20

Automatic detection of epilepsy from EEGs using a temporal convolutional network with a self-attention layer
Leen Huang ... Jinxin Zhang
BioMedical Engineering OnLine | VOL. 23
Leen Huang, et. al.Leen Huang ... Jinxin Zhang
01 Jun 2024
BioMedical Engineering OnLine | VOL. 23

Hybrid model for short-term wind power forecasting based on singular spectrum analysis and a temporal convolutional attention network with an adaptive receptive field
Zhen Shao ... Shanlin Yang
Energy Conversion and Management | VOL. 269
Zhen Shao, et. al.Zhen Shao ... Shanlin Yang
01 Sep 2022
Energy Conversion and Management | VOL. 269

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IRAWNET: A Method for Transcribing Indonesian Classical Music Notes Directly from Multichannel Raw Audio

Abstract

Talk to us

Similar Papers

More From: EMITTER International Journal of Engineering Technology