Convolutional Restricted Boltzmann Machine Research Articles

To learn auditory filterbanks, recently, we have proposed an unsupervised learning model based on convolutional restricted Boltzmann machine RBM with rectified linear units. In this paper, theory, training algorithm of our proposed model, and detailed analysis of learned filterbank are being presented. Learning of the model with different databases shows that the model is able to learn cochlear-like impulse responses that are localized in frequency-domain. An auditory-like scale obtained from filterbanks learned from clean and noisy datasets resembles the Mel scale, which is known to mimic perceptually relevant aspect of speech. We have experimented with both cepstral denoted as ConvRBM-CC as well as filterbank features denoted as ConvRBM-BANK. On large vocabulary continuous speech recognition task, we achieved relative improvement of 7.21-17.8% in word error rate WER compared to Mel frequency cepstral coefficient MFCC features and 1.35-6.82% compared to Mel filterbank FBANK features. On AURORA 4 multicondition training database, the relative improvement in WER by 4.8-13.65% was achieved using a Hybrid Deep Neural Network-Hidden Markov Model DNN-HMM system with ConvRBM-CC features. Using ConvRBM-BANK features, we achieve absolute reduction of 1.25-3.85% in WER on AURORA 4 test sets compared to FBANK features. A context-dependent DNN-HMM system further improves performance with a relative improvement of 3.6-4.6% on an average for bigram 5k and tri-gram 5k language models. Hence, our proposed learned filterbank performs better than traditional MFCC and Mel-filterbank features for both clean and multicondition automatic speech recognition ASR tasks. A system combination of ConvRBM-BANK and FBANK features further improve performance in all ASR tasks. Cross-domain experiments where subband filters trained on one database are used for the ASR task of another database show that model learns generalized representations of speech signals.

Read full abstract

Extracting local features from 3D shapes is an important and challenging task that usually requires carefully designed 3D shape descriptors. However, these descriptors are hand-crafted and require intensive human intervention with prior knowledge. To tackle this issue, we propose a novel deep learning model, namely circle convolutional restricted Boltzmann machine (CCRBM), for unsupervised 3D local feature learning. CCRBM is specially designed to learn from raw 3D representations. It effectively overcomes obstacles such as irregular vertex topology, orientation ambiguity on the 3D surface, and rigid or slightly non-rigid transformation invariance in the hierarchical learning of 3D data that cannot be resolved by the existing deep learning models. Specifically, by introducing the novel circle convolution, CCRBM holds a novel ring-like multi-layer structure to learn 3D local features in a structure preserving manner. Circle convolution convolves across 3D local regions via rotating a novel circular sector convolution window in a consistent circular direction. In the process of circle convolution, extra points are sampled in each 3D local region and projected onto the tangent plane of the center of the region. In this way, the projection distances in each sector window are employed to constitute a novel local raw 3D representation called projection distance distribution (PDD). In addition, to eliminate the initial location ambiguity of a sector window, the Fourier transform modulus is used to transform the PDD into the Fourier domain, which is then conveyed to CCRBM. Experiments using the learned local features are conducted on three aspects: global shape retrieval, partial shape retrieval, and shape correspondence. The experimental results show that the learned local features outperform other state-of-the-art 3D shape descriptors.

Read full abstract

Convolutional Restricted Boltzmann Machine Research Articles

Related Topics

Articles published on Convolutional Restricted Boltzmann Machine

Unsupervised modulation filter learning for noise-robust speech recognition

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition

Unsupervised 3D Local Feature Learning by Circle Convolutional Restricted Boltzmann Machine.

A novel feature extraction method for scene recognition based on Centered Convolutional Restricted Boltzmann Machines

Mesh Convolutional Restricted Boltzmann Machines for Unsupervised Learning of Features With Structure Preservation on 3-D Meshes.

Combining Generative and Discriminative Representation Learning for Lung CT Analysis With Convolutional Restricted Boltzmann Machines.

Active semi-supervised learning method with hybrid deep belief networks.

Convolutional Deep Networks for Visual Data Classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Convolutional Restricted Boltzmann Machine Research Articles

Related Topics

Articles published on Convolutional Restricted Boltzmann Machine

Unsupervised modulation filter learning for noise-robust speech recognition

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition

Unsupervised 3D Local Feature Learning by Circle Convolutional Restricted Boltzmann Machine.

A novel feature extraction method for scene recognition based on Centered Convolutional Restricted Boltzmann Machines

Mesh Convolutional Restricted Boltzmann Machines for Unsupervised Learning of Features With Structure Preservation on 3-D Meshes.

Combining Generative and Discriminative Representation Learning for Lung CT Analysis With Convolutional Restricted Boltzmann Machines.

Active semi-supervised learning method with hybrid deep belief networks.

Convolutional Deep Networks for Visual Data Classification