Design of Neural Network Model for Cross-Media Audio and Video Score Recognition Based on Convolutional Neural Network Model.

Hongxia Liu

doi:10.1155/2022/4626867

Abstract

In this paper, the residual convolutional neural network is used to extract the note features in the music score image to solve the problem of model degradation; then, multiscale feature fusion is used to fuse the feature information of different levels in the same feature map to enhance the feature representation ability of the model. A network composed of a bidirectional simple loop unit and a chained time series classification function is used to identify notes, parallelizing a large number of calculations, thereby speeding up the convergence speed of training, which also makes the data in the dataset no longer need to be strict with labels. Alignment also reduces the requirements on the dataset. Aiming at the problem that the existing cross-modal retrieval methods based on common subspace are insufficient for mining local consistency within modalities, a cross-modal retrieval method fused with graph convolution is proposed. The K-nearest neighbor algorithm is used to construct modal graphs for samples of different modalities, and the original features of samples from different modalities are encoded through a symmetric graph convolutional coding network and a symmetric multilayer fully connected coding network, and the encoded features are fused and input. We jointly optimize the intramodal semantic constraints and intermodal modality-invariant constraints in the common subspace to learn highly locally consistent and semantically consistent common representations for samples from different modalities. The error value of the experimental results is used to illustrate the effect of parameters such as the number of iterations and the number of neurons on the network. In order to more accurately illustrate that the generated music sequence is very similar to the original music sequence, the generated music sequence is also framed, and finally the music sequence spectrogram and spectrogram are generated. The accuracy of the experiment is illustrated by comparing the spectrogram and the spectrogram, and genre classification predictions are also performed on the generated music to show that the network can generate music of different genres.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational intelligence and neuroscience	Publication Date: Jun 13, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Design of Neural Network Model for Cross-Media Audio and Video Score Recognition Based on Convolutional Neural Network Model.

Abstract

Talk to us

Similar Papers

More From: Computational intelligence and neuroscience

Lead the way for us

Similar Papers

GrapeNet: A Lightweight Convolutional Neural Network Model for Identification of Grape Leaf Diseases
Jianwu Lin ... Jitong Cai
Agriculture | VOL. 12
Jianwu Lin, et. al.Jianwu Lin ... Jitong Cai
20 Jun 2022
Agriculture | VOL. 12

Public Perception Analysis of Tweets During the 2015 Measles Outbreak: Comparative Study Using Convolutional Neural Network Models.
Jingcheng Du ... Cui Tao
Journal of Medical Internet Research | VOL. 20
Jingcheng Du, et. al.Jingcheng Du ... Cui Tao
09 Jul 2018
Journal of Medical Internet Research | VOL. 20

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

NemaNet: A convolutional neural network model for identification of soybean nematodes
André Abade ... Flávio De Barros Vidal
Biosystems Engineering | VOL. 213
André Abade, et. al.André Abade ... Flávio De Barros Vidal
06 Dec 2021
Biosystems Engineering | VOL. 213

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design of Neural Network Model for Cross-Media Audio and Video Score Recognition Based on Convolutional Neural Network Model.

Abstract

Talk to us

Similar Papers

More From: Computational intelligence and neuroscience