Stochastic Gradient Estimation Research Articles

Summary The popularity of intelligent wells (I-wells), which provide layer-by-layer monitoring and control capability of production and injection, is growing. However, the number of available techniques for optimal control of I-wells is limited (Sarma et al. 2006; Alghareeb et al. 2009; Almeida et al. 2010; Grebenkin and Davies 2012). Currently, most of the I-wells that are equipped with interval control valves (ICVs) are operated to enhance the current production and to resolve problems associated with breakthrough of the unfavorable phase. This reactive strategy is unlikely to deliver the long-term optimum production. On the other side, the proactive-control strategy of I-wells, with its ambition to provide the optimum control for the entire well's production life, has the potential to maximize the cumulative oil production. This strategy, however, results in a high-dimensional, nonlinear, and constrained optimization problem. This study provides guidelines on selecting a suitable proactive optimization approach, by use of state-of-the-art stochastic gradient-approximation algorithms. A suitable optimization approach increases the practicality of proactive optimization for real field models under uncertain operational and subsurface conditions. We evaluate the simultaneous-perturbation stochastic approximation (SPSA) method (Spall 1992) and the ensemble-based optimization (EnOpt) method (Chen et al. 2009). In addition, we present a new derivation of the EnOpt by use of the concept of directional derivatives. The numerical results show that both SPSA and EnOpt methods can provide a fast solution to a large-scale and multiple I-well proactive optimization problem. A criterion for tuning the algorithms is proposed and the performance of both methods is compared for several test cases. The used methodology for estimating the gradient is shown to affect the application area of each algorithm. SPSA provides a rough estimate of the gradient and performs better in search environments, characterized by several local optima, especially with a large ensemble size. EnOpt was found to provide a smoother estimation of the gradient, resulting in a more-robust algorithm to the choice of the tuning parameters, and a better performance with a small ensemble size. Moreover, the final optimum operation obtained by EnOpt is smoother. Finally, the obtained criteria are used to perform proactive optimization of ICVs in a real field.

Read full abstract

Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is influenced by an environmental signal, termed a reward, which directs the changes in appropriate directions. We model a network of spiking neurons as a Partially Observed Markov Decision Process (POMDP) and apply a recently introduced policy learning algorithm from Machine Learning to the network [1]. Based on computing a stochastic gradient approximation of the average reward, we derive a plasticity rule falling in the class of Spike Time Dependent Plasticity (STDP) rules, which ensures convergence to a local maximum of the average reward. The approach is applicable to a broad class of neuronal models, including the Hodgkin-Huxley model. The obtained update rule is based on the correlation between the reward signal and local data available at the synaptic site. This data depends on local activity (e.g., pre and post synaptic spikes) and requires mechanisms that are available at the cellular level. Simulations on several toy problems demonstrate the utility of the approach. Like most stochastic gradient based methods, the convergence rate is slow, even though the percentage of convergence to global maxima is high. Additionally, through statistical analysis we show that the synaptic plasticity rule established is closely related to the widely used BCM rule [2], for which good biological evidence exists. The relation to the BCM rule captures the nature of the relation between pre and post synaptic spiking rates, and in particular the self-regularizing nature of the BCM rule. Compared to previous work in this field, our model is more realistic than the one used in [3], and the derivation of the update rule applies to a broad class of voltage based neuronal models, eliminating some of the additional statistical assumptions required in [4]. Finally, the connection between Reinforcement Learning and the BCM rule is, to the best of our knowledge, new.

Read full abstract

Stochastic Gradient Estimation Research Articles

Articles published on Stochastic Gradient Estimation

The auxiliary model based hierarchical gradient algorithms and convergence analysis using the filtering technique

Proactive Optimization of Intelligent-Well Production Using Stochastic Gradient-Based Algorithms

Stochastic variational inference for large-scale discrete choice models using adaptive batch sizes

A Stochastic Variational Framework for Fitting and Diagnosing Generalized Linear Mixed Models

STOCHASTIC GRADIENT METHODS FOR UNCONSTRAINED OPTIMIZATION

Integrated Traffic and Emission Simulation: a Model Calibration Approach Using Aggregate Information

Enhancing Stochastic Kriging Metamodels with Gradient Estimators

Maximum likelihood stochastic gradient estimation for Hammerstein systems with colored noise based on the key term separation technique

Blind Adaptive Constrained Constant-Modulus Reduced-Rank Interference Suppression Algorithms Based on Interpolation and Switched Decimation

Efficient price sensitivity estimation of financial derivatives by weak derivatives

Quaternion-Valued Stochastic Gradient-Based Adaptive IIR Filtering

Performance analysis of the auxiliary models based multi-innovation stochastic gradient estimation algorithm for output error systems

A ROBUST ENSEMBLE LEARNING USING ZERO-ONE LOSS FUNCTION

Direct reinforcement learning, spike time dependent plasticity and the BCM rule

Stochastic gradient algorithms for design of minimum error-rate linear dispersion codes in MIMO wireless systems

Model-Based Search for Combinatorial Optimization: A Critical Survey

Constrained SPSA controller for operations processes

A manufacturing system with general stationary failure process: stability and IPA of hedging control policies

Stochastic Approximation Algorithm eith Gradient Averaging and On-Line Stepsize Rules

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Stochastic Gradient Estimation Research Articles

Articles published on Stochastic Gradient Estimation

The auxiliary model based hierarchical gradient algorithms and convergence analysis using the filtering technique

Proactive Optimization of Intelligent-Well Production Using Stochastic Gradient-Based Algorithms

Stochastic variational inference for large-scale discrete choice models using adaptive batch sizes

A Stochastic Variational Framework for Fitting and Diagnosing Generalized Linear Mixed Models

STOCHASTIC GRADIENT METHODS FOR UNCONSTRAINED OPTIMIZATION

Integrated Traffic and Emission Simulation: a Model Calibration Approach Using Aggregate Information

Enhancing Stochastic Kriging Metamodels with Gradient Estimators

Maximum likelihood stochastic gradient estimation for Hammerstein systems with colored noise based on the key term separation technique

Blind Adaptive Constrained Constant-Modulus Reduced-Rank Interference Suppression Algorithms Based on Interpolation and Switched Decimation

Efficient price sensitivity estimation of financial derivatives by weak derivatives

Quaternion-Valued Stochastic Gradient-Based Adaptive IIR Filtering

Performance analysis of the auxiliary models based multi-innovation stochastic gradient estimation algorithm for output error systems

A ROBUST ENSEMBLE LEARNING USING ZERO-ONE LOSS FUNCTION

Direct reinforcement learning, spike time dependent plasticity and the BCM rule

Stochastic gradient algorithms for design of minimum error-rate linear dispersion codes in MIMO wireless systems

Model-Based Search for Combinatorial Optimization: A Critical Survey

Constrained SPSA controller for operations processes

A manufacturing system with general stationary failure process: stability and IPA of hedging control policies

Stochastic Approximation Algorithm eith Gradient Averaging and On-Line Stepsize Rules