Genre Detection Research Articles

Computational machine intelligence approaches have enabled a variety of music-centric technologies in support of creating, sharing and interacting with music content. A strong performance on specific downstream application tasks, such as music genre detection and music emotion recognition, is paramount to ensuring broad capabilities for computational music understanding and Music Information Retrieval. Traditional approaches have relied on supervised learning to train models to support these music-related tasks. However, such approaches require copious annotated data and still may only provide insight into one view of music—namely, that related to the specific task at hand. We present a new model for generating audio-musical features that support music understanding, leveraging self-supervision and cross-domain learning. After pre-training using masked reconstruction of musical input features using self-attention bidirectional transformers, output representations are fine-tuned using several downstream music understanding tasks. Results show that the features generated by our multi-faceted, multi-task, music transformer model, which we call M3BERT, tend to outperform other audio and music embeddings on several diverse music-related tasks, indicating the potential of self-supervised and semi-supervised learning approaches toward a more generalized and robust computational approach to modeling music. Our work can offer a starting point for many music-related modeling tasks, with potential applications in learning deep representations and enabling robust technology applications.

Read full abstract

Machine learning methods are extensively used for processing and analysing speech signals by virtue of their performance gains over multiple domains. Deep learning and ensemble learning are the two most commonly used techniques, which results in benchmark performance across different downstream tasks. Ensemble deep learning is a recent development which combines these two techniques to result in a robust architecture having substantial performance gains, as well as better generalization performance over the individual techniques. In this paper, we extensively review the use of ensemble deep learning methods for different speech signal related tasks, ranging from general objectives such as automatic speech recognition and voice activity detection, to more specific areas such as biomedical applications involving the detection of pathological speech or music genre detection. We provide a discussion on the use of different ensemble strategies such as bagging, boosting and stacking in the context of speech signals, and identify the various salient features and advantages from a broader perspective when coupled with deep learning architectures. The main objective of this study is to comprehensively evaluate existing works in the area of ensemble deep learning, and highlight the future directions that may be explored to further develop it as a tool for several speech related tasks. To the best of our knowledge, this is the first review study which primarily focuses on ensemble deep learning for speech applications. This study aims to serve as a valuable resource for researchers in academia and in industry working with speech signals, supporting advanced novel applications of ensemble deep learning models towards solving challenges in existing speech processing systems.

Read full abstract

Genre Detection Research Articles

Related Topics

Articles published on Genre Detection

Creating musical features using multi-faceted, multi-task encoders based on transformers

Ensemble deep learning in speech signal tasks: A review

GenSpecVidOnt: a reference ontology for knowledge based video analytics with multimodal genre detection

Music genre detection using deep learning models

Signal Processing and Machine Learning Approaches and Evaluation for Indian Classical Music

APPLICATION OF DEEP LEARNING ALGORITHMS IN MUSICOLOGY: AN OVERVIEW

Multi-label Movie Genre Detection from a Movie Poster Using Knowledge Transfer Learning

On the influence of low-level visual features in film classification.

Improvised emotion and genre detection for songs through signal processing and genetic algorithm

Stylometric analysis of classical Arabic texts for genre detection

Open set evaluation of web genre identification

Tekstgenres analyseren op lexicale complexiteit met T‑Scan

Learning to recognize webpage genres

Learning Subjective Language

Automatic Text Categorization in Terms of Genre and Author

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Genre Detection Research Articles

Related Topics

Articles published on Genre Detection

Creating musical features using multi-faceted, multi-task encoders based on transformers

Ensemble deep learning in speech signal tasks: A review

GenSpecVidOnt: a reference ontology for knowledge based video analytics with multimodal genre detection

Music genre detection using deep learning models

Signal Processing and Machine Learning Approaches and Evaluation for Indian Classical Music

APPLICATION OF DEEP LEARNING ALGORITHMS IN MUSICOLOGY: AN OVERVIEW

Multi-label Movie Genre Detection from a Movie Poster Using Knowledge Transfer Learning

On the influence of low-level visual features in film classification.

Improvised emotion and genre detection for songs through signal processing and genetic algorithm

Stylometric analysis of classical Arabic texts for genre detection

Open set evaluation of web genre identification

Tekstgenres analyseren op lexicale complexiteit met T‑Scan

Learning to recognize webpage genres

Learning Subjective Language

Automatic Text Categorization in Terms of Genre and Author