Stochastic Variational Inference Research Articles

AbstractSignificant attention has recently been paid to deep learning as a method for improved catchment modeling. Compared with process‐based models, deep learning is often criticized for its lack of interpretability. One solution is to combine a process‐based hydrological model with a residual error model based on deep learning to give full scope to their respective advantages. In classical residual error models, Bayesian inference via Markov chain Monte Carlo (MCMC) is commonly used to provide an estimation of the uncertainty. However, deep neural networks tend to have excessively large numbers of parameters, making MCMC an unsuitable approach. Here, we introduce an alternative to Bayesian MCMC sampling called stochastic variational inference (SVI) which has recently been developed for Bayesian deep learning in Natural Language Processing. We implement SVI in a Long Short‐Term Memory (LSTM) network and construct residual error models in process‐based hydrological models. This approach is examined in the contrasting geographical and climatic characteristics of two catchments from China, the Tangnaihai catchment and the Shiquan catchment. Compared with the Bayesian linear regression model, the Bayesian LSTM provides better uncertainty estimates. Specifically, the proposed method improves the Continuous Ranked Probability Score (CRPS) by over 10% in both two catchments. In the Tangnaihai catchment, it provides more than 10% narrower uncertainty intervals in terms of Sharpness with slightly superior Reliability. In the Shiquan catchment, it provides comparable uncertainty intervals with better Reliability. Further, our study highlights the scalability of SVI to high‐dimensional parameter spaces in hydrological applications (e.g., distributed hydrological models, groundwater models).

Deep convolutional neural networks have shown great potential in image recognition tasks. However, the fact that the mechanism of deep learning is difficult to explain hinders its development. It involves a large amount of parameter learning, which results in high computational complexity. Moreover, deep convolutional neural networks are often limited by overfitting in regimes in which the number of training samples is limited. Conversely, kernel learning methods have a clear mathematical theory, fewer parameters, and can contend with small sample sizes; however, they are not able to handle high-dimensional data, e.g., images. It is important to achieve a performance and complexity trade-off in complicated tasks. In this paper, we propose a novel scalable deep convolutional random kernel learning in Gaussian process architecture called SDCRKL-GP, which is characterized by excellent performance and low complexity. First, we successfully incorporated the deep convolutional architecture into kernel learning by implementing the random Fourier feature transform for Gaussian processes, which can effectively capture hierarchical and local image-level features. This approach enabled the kernel method to effectively handle image processing problems. Second, we optimized the parameters of deep convolutional filters and Gaussian kernels by stochastic variational inference. Then, we derived the lower variational bound of the marginal likelihood. Finally, we explored the model architecture design space selection method to determine the appropriate network architecture for different datasets. The design space consists of the number of layers, the channels per layer, and so on. Different design space selections improved the scalability of the SDCRKL-GP architecture. We evaluated SDCRKL-GP on the MNIST, FMNIST, CIFAR10, and CALTECH4 benchmark datasets. Taking MNIST as an example, the error rate of classification is 0.60%, and the number of parameters, number of computations and memory access cost of the architecture are 19.088k, 0.984M, and 1.057M, respectively. The experimental results verified that the proposed SDCRKL-GP method outperforms several state-of-the-art algorithms in both accuracy and speed in image recognition tasks. The code is available at https://github.com/w-tingting/deep-rff-pytorch.

Stochastic Variational Inference Research Articles

Related Topics

Articles published on Stochastic Variational Inference

Online Gaussian Process State-space Model: Learning and Planning for Partially Observable Dynamical Systems

Infinite Switching Dynamic Probabilistic Network With Bayesian Nonparametric Learning

Stratified Stochastic Variational Inference for High-Dimensional Network Factor Model

Variational Bayesian Approach to Condition-Invariant Feature Extraction for Visual Place Recognition

Bayesian LSTM With Stochastic Variational Inference for Estimating Model Uncertainty in Process‐Based Hydrological Models

Online Downlink Multi-User Channel Estimation for mmWave Systems Using Bayesian Neural Network

Stochastic variational inference for probabilistic optimal power flows

Scalable clustering of segmented trajectories within a continuous time framework: application to maritime traffic data

Deep Gaussian process models for integrating multifidelity experiments with nonstationary relationships

Simultaneous inference of periods and period-luminosity relations for Mira variable stars

SDCRKL-GP: Scalable deep convolutional random kernel learning in gaussian process for image recognition

Synergetic Learning of Heterogeneous Temporal Sequences for Multi-Horizon Probabilistic Forecasting

Deep Switching Auto-Regressive Factorization: Application to Time Series Forecasting

Robust Deep Gaussian Process-Based Probabilistic Electrical Load Forecasting Against Anomalous Events

Tracking Disease Outbreaks from Sparse Data with Bayesian Inference

Large scale multi-label learning using Gaussian processes

Investigating hypotheses of neurodegeneration by learning dynamical systems of protein propagation in the brain

Two-step hybrid collaborative filtering using deep variational Bayesian autoencoders

Large-Scale Heteroscedastic Regression via Gaussian Process.

Learning Waveform-Based Acoustic Models Using Deep Variational Convolutional Neural Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Stochastic Variational Inference Research Articles

Related Topics

Articles published on Stochastic Variational Inference

Online Gaussian Process State-space Model: Learning and Planning for Partially Observable Dynamical Systems

Infinite Switching Dynamic Probabilistic Network With Bayesian Nonparametric Learning

Stratified Stochastic Variational Inference for High-Dimensional Network Factor Model

Variational Bayesian Approach to Condition-Invariant Feature Extraction for Visual Place Recognition

Bayesian LSTM With Stochastic Variational Inference for Estimating Model Uncertainty in Process‐Based Hydrological Models

Online Downlink Multi-User Channel Estimation for mmWave Systems Using Bayesian Neural Network

Stochastic variational inference for probabilistic optimal power flows

Scalable clustering of segmented trajectories within a continuous time framework: application to maritime traffic data

Deep Gaussian process models for integrating multifidelity experiments with nonstationary relationships

Simultaneous inference of periods and period-luminosity relations for Mira variable stars

SDCRKL-GP: Scalable deep convolutional random kernel learning in gaussian process for image recognition

Synergetic Learning of Heterogeneous Temporal Sequences for Multi-Horizon Probabilistic Forecasting

Deep Switching Auto-Regressive Factorization: Application to Time Series Forecasting

Robust Deep Gaussian Process-Based Probabilistic Electrical Load Forecasting Against Anomalous Events

Tracking Disease Outbreaks from Sparse Data with Bayesian Inference

Large scale multi-label learning using Gaussian processes

Investigating hypotheses of neurodegeneration by learning dynamical systems of protein propagation in the brain

Two-step hybrid collaborative filtering using deep variational Bayesian autoencoders

Large-Scale Heteroscedastic Regression via Gaussian Process.

Learning Waveform-Based Acoustic Models Using Deep Variational Convolutional Neural Networks