Monophonic Audio Research Articles

In computer music, instrument recognition is a critical part of sound modeling. Pitch, timbre, loudness, duration, and spatialization are all components of musical sounds. All of these components play a significant part in determining the quality of the tonal sound. It is possible to alter the first four parameters, but timbre always poses a challenge [6]. It was inevitable that timbre would take center stage. Musical instruments are distinguished from one other by their distinct sound quality, independent of their pitch or volume. To distinguish between monophonic and polyphonic music recordings, this method might be used. In Musical Information Retrieval, classification plays one of the critical role. Monophonic instrument classification can be found in literature with quiet a substantial combinations of features and classifiers. Polyphonic instrument classification witnessed less references in the literature and is still an area to be explored specifically when it comes to Indian Classical domain. The present paper exactly focusses on this experimentation. Several Indian instruments were used to produce training data sets for the proposed approach’s evaluation purposes. Among the instruments utilized are the flute, harmonium, and sitar. Statistical and spectral factors are used to classify Indian musical instruments along with the Artificial Intelligence-based methods. Hybrid features from multiple domains that extract essential musical properties are extracted. Accuracy is demonstrated through an Indian Musical Instrument SVM and GMM classification. With monophonic sounds, SVM and Polyphonic produce an average accuracy of 89% and 91%. GMM outperforms SVM in monophonic recordings by a factor of 96.33 and polyphonic recordings by a factor of 93.33, according to the results of the studies. The future scope of this recognition framework can be an Artificial Intelligence System with a system linked with the Industrial Internet of Things (IIOT) framework to develop a standalone system or application which can be used for real- time classification of instruments.

Read full abstract

This combined fMRI and MEG study investigated brain activations during listening and attending to natural auditory scenes. We first recorded, using in-ear microphones, vocal non-speech sounds, and environmental sounds that were mixed to construct auditory scenes containing two concurrent sound streams. During the brain measurements, subjects attended to one of the streams while spatial acoustic information of the scene was either preserved (stereophonic sounds) or removed (monophonic sounds). Compared to monophonic sounds, stereophonic sounds evoked larger blood-oxygenation-level-dependent (BOLD) fMRI responses in the bilateral posterior superior temporal areas, independent of which stimulus attribute the subject was attending to. This finding is consistent with the functional role of these regions in the (automatic) processing of auditory spatial cues. Additionally, significant differences in the cortical activation patterns depending on the target of attention were observed. Bilateral planum temporale and inferior frontal gyrus were preferentially activated when attending to stereophonic environmental sounds, whereas when subjects attended to stereophonic voice sounds, the BOLD responses were larger at the bilateral middle superior temporal gyrus and sulcus, previously reported to show voice sensitivity. In contrast, the time-resolved MEG responses were stronger for mono- than stereophonic sounds in the bilateral auditory cortices at ~360 ms after the stimulus onset when attending to the voice excerpts within the combined sounds. The observed effects suggest that during the segregation of auditory objects from the auditory background, spatial sound cues together with other relevant temporal and spectral cues are processed in an attention-dependent manner at the cortical locations generally involved in sound recognition. More synchronous neuronal activation during monophonic than stereophonic sound processing, as well as (local) neuronal inhibitory mechanisms in the auditory cortex, may explain the simultaneous increase of BOLD responses and decrease of MEG responses. These findings highlight the complimentary role of electrophysiological and hemodynamic measures in addressing brain processing of complex stimuli.

Read full abstract

Monophonic Audio Research Articles

Related Topics

Articles published on Monophonic Audio

Cross-modal generative model for visual-guided binaural stereo generation

Using deep learning for recreating binaural audio

Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection

Teaching Integration of Piano and Traditional Music Elements in Colleges and Universities Based on Network Flow Optimization

Comparative Study of Musical Timbral Variations: Crescendo and Vibrato Using FFT-Acoustic Descriptor

Exploring the relationships between teacher noticing, ambisonic audio, and variance in focus when viewing 360 video.

Bit rate required for mono audio object in object-based audio program compressed with MPEG-H 3D Audio

Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning

Artificial intelligence-based classification performance evaluation in monophonic and polyphonic indian classical instruments recognition with hybrid domain features amalgamation

Points2Sound: from mono to binaural audio using 3D point cloud scenes

Acoustic diversity of forested landscapes: Relationships to habitat structure and anthropogenic pressure

Binaural Synthetic Aperture Imaging of the Field of Audition as the Head Rotates and Localisation Perception of Monophonic Sound Listened to through Headphones

Binaural audio generation via multi-task learning

Hybrid Feature Based Classifier Performance Evaluation of Monophonic and Polyphonic Indian Classical Instruments Recognition

Identification of Fake Stereo Audio Using SVM and CNN

Monophonic Musical Instrument Sound Classification Using Impulse Response Modeling

MUSIC IN DANTE'S DIVINE COMEDY: To the 700th Anniversary of the Memory

SPICE: Self-Supervised Pitch Estimation

Octave Error Reduction in Pitch Detection Algorithms Using Fourier Series Approximation Method

Attention Modulates the Auditory Cortical Processing of Spatial and Category Cues in Naturalistic Auditory Scenes.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Monophonic Audio Research Articles

Related Topics

Articles published on Monophonic Audio

Cross-modal generative model for visual-guided binaural stereo generation

Using deep learning for recreating binaural audio

Multi-space channel representation learning for mono-to-binaural conversion based audio deepfake detection

Teaching Integration of Piano and Traditional Music Elements in Colleges and Universities Based on Network Flow Optimization

Comparative Study of Musical Timbral Variations: Crescendo and Vibrato Using FFT-Acoustic Descriptor

Exploring the relationships between teacher noticing, ambisonic audio, and variance in focus when viewing 360 video.

Bit rate required for mono audio object in object-based audio program compressed with MPEG-H 3D Audio

Similarity of Musical Timbres Using FFT-Acoustic Descriptor Analysis and Machine Learning

Artificial intelligence-based classification performance evaluation in monophonic and polyphonic indian classical instruments recognition with hybrid domain features amalgamation

Points2Sound: from mono to binaural audio using 3D point cloud scenes

Acoustic diversity of forested landscapes: Relationships to habitat structure and anthropogenic pressure

Binaural Synthetic Aperture Imaging of the Field of Audition as the Head Rotates and Localisation Perception of Monophonic Sound Listened to through Headphones

Binaural audio generation via multi-task learning

Hybrid Feature Based Classifier Performance Evaluation of Monophonic and Polyphonic Indian Classical Instruments Recognition

Identification of Fake Stereo Audio Using SVM and CNN

Monophonic Musical Instrument Sound Classification Using Impulse Response Modeling

MUSIC IN DANTE'S DIVINE COMEDY: To the 700th Anniversary of the Memory

SPICE: Self-Supervised Pitch Estimation

Octave Error Reduction in Pitch Detection Algorithms Using Fourier Series Approximation Method

Attention Modulates the Auditory Cortical Processing of Spatial and Category Cues in Naturalistic Auditory Scenes.