Hamilton Jacobi Bellman Partial Differential Equation Research Articles

A EROSPACE engineering applications greatly stimulated the development of optimal control theory during the 1950s and 1960s, where the objective was to drive the system states in such a way that some defined cost was minimized. This turned out to have very useful applications in the design of regulators (where some steady state is to be maintained) and in tracking control strategies (where some predetermined state trajectory is to be followed). Among such applications was the problem of optimal flight trajectories for aircraft and space vehicles. Linear optimal control theory in particular has been very well documented and widely applied, where the plant that is controlled is assumed linear and the feedback controller is constrained to be linear with respect to its input. However, the availability of powerful low-cost microprocessors has spurred great advantages in the theory and applications of nonlinear control. The competitive era of rapid technological change, particularly in aerospace exploration, now demands stringent accuracy and cost requirements in nonlinear control systems. This has motivated the rapid development of nonlinear control theory for application to challenging, complex, dynamical real-world problems, particularly those that bear major practical significance in aerospace, marine, and defense industries. Infinite-time horizon nonlinear optimal control (ITHNOC) presents a viable option for synthesizing stabilizing controllers for nonlinear systems by making a state-input tradeoff, where the objective is to minimize the cost given by a performance index. The original theory of nonlinear optimal control dates from the 1960s. Various theoretical and practical aspects of the problem have been addressed in the literature over the decades since. In particular, the continuous-time nonlinear deterministic optimal control problem associated with autonomous (time-invariant) nonlinear regulator systems that are affine (linear) in the controls has been studied by many authors. The long-established theory of optimal control offers quite mature and well-documented techniques for solving this control-affine nonlinear optimization problem, based on dynamic programming or calculus of variations, but their application is generally a very tedious task. Bellman’s dynamic programming approach reduces to solving a nonlinear first-order partial differential equation (PDE), expressed by the Hamilton–Jacobi–Bellman (HJB) equation. The solution to the HJB equation gives the optimal performance/cost value (or storage) function and determines an optimal control in feedback form under some smoothness assumptions. Alternatively, in the classical calculus of variations, optimal control problems can be characterized locally in terms of the Hamiltonian dynamics arising from Pontryagin’s minimum principle. These are the characteristic equations of the HJB PDE, which result in a nonlinear, constrained two-point boundary value problem (TPBVP) that, in general, can only be solved by successive approximation of the optimal control input using iterative numerical techniques for each set of initial conditions. Numerically, even though the nonlinear TPBVP is somewhat easier to solve than the HJBPDE, control signals can only be determined offline and are thus best suited for feedforward control of plants for which the state trajectories are known a priori. Therefore, contrary to the dynamic programming approach, the resultant control law is not generally in feedback form. Open-loop control, however, is sensitive to random disturbances and requires that the initial state be on the optimal trajectory. In contrast, nonlinear optimal feedback has inherent robustness properties (inherent in the sense that it is obtained by ignoring uncertainty and disturbances). The potential difficulty with the HJB approach is that no efficient algorithm is available to solve the PDE when it is nonlinear and the problem dimension is high, making it impossible to derive exact expressions for optimal controls for most nontrivial problems of interest. The optimal can only be computed in special cases, such as linear dynamics and quadratic cost, or very low-dimensional systems. In particular, if the plant is linear time invariant (LTI) and the (infinite-time) performance index is quadratic, then the corresponding HJB equation for this infamous linear-quadratic regulator (LQR) problem reduces to an algebraic Riccati equation (ARE). Contrary to the well-developed and widely applied theory and computational tools for theRiccati equation (for example, see [1]), theHJB equation is difficult, if not impossible, to solve for most practical applications. The exact solution for the optimal control policies is very complex

Read full abstract

We consider optimal control of a stochastic network, where service is controlled to prevent buffer overflow. We use a risk-sensitive escape time criterion, which in comparison to the ordinary escape time criteria heavily penalizes exits which occur on short time intervals. A limit as the buffer sizes tend to infinity is considered. In [2] we showed that, for a large class of networks, the limit of the normalized cost agrees with the value function of a differential game. In this game, one player controls the service discipline (who to serve and whether to serve), and the other player chooses arrival and service rates in the network. The game's value is characterized in [2] as the unique solution to a Hamilton–Jacobi–Bellman Partial Differential Equation (PDE). In the current paper we apply this general theory to the important case of a network of queues in tandem. Our main results are: (i) the construction of an explicit solution to the corresponding PDE, and (ii) drawing out the implications for optimal risk-sensitive and robust regulation of the network. In particular, the following general principle can be extracted. To avoid buffer overflow there is a natural competition between two tendencies. One may choose to serve a particular queue, since that will help prevent its own buffer from overflowing, or one may prefer to stop service, with the goal of preventing overflow of buffers further down the line. The solution to the PDE indicates the optimal choice between these two, specifying the parts of the state space where each queue must be served (so as not to lose optimality), and where it can idle. Referring to those queues which must be served as bottlenecks, one can use the solution to the PDE to explicitly calculate the bottleneck queues as a function of the system's state, in terms of a simple set of equations.

Read full abstract

Hamilton Jacobi Bellman Partial Differential Equation Research Articles

Related Topics

Articles published on Hamilton Jacobi Bellman Partial Differential Equation

Dynamic cointegrated pairs trading: Mean–variance time-consistent strategies

American option valuation in a stochastic volatility model with transaction costs

Closed-form solution to finite-horizon suboptimal control of nonlinear systems

Approximate finite-horizon optimal control for input-affine nonlinear systems with input constraints

Statistical Control for Performance Shaping Using Cost Cumulants

Solution of the Feedback Control Problem in the Mathematical Model of Leukaemia Therapy

Survey of State-Dependent Riccati Equation in Nonlinear Optimal Feedback Control Synthesis

Convergence Rate for a Curse-of-Dimensionality-Free Method for a Class of HJB PDEs

Differential Transformation and Its Application to Nonlinear Optimal Control

Convergence Rate for a Curse-of-dimensionality-Free Method for Hamilton–Jacobi–Bellman PDEs Represented as Maxima of Quadratic Forms

Dirichlet Problems for some Hamilton–Jacobi Equations with Inequality Constraints

A Curse-of-Dimensionality-Free Numerical Method for Solution of Certain HJB PDEs

Max-plus summation of Fenchel-transformed semigroups for solution of nonlinear Bellman equations

A numerical method to approximate optimal production and maintenance plan in a flexible manufacturing system

Global optimal feedback control for general nonlinear systems with nonquadratic performance criteria

Explicit Solution for a Network Control Problem in the Large Deviation Regime

Max-Plus Methods for Nonlinear H∞ Control: Operating in the Transform Space

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Hamilton Jacobi Bellman Partial Differential Equation Research Articles

Related Topics

Articles published on Hamilton Jacobi Bellman Partial Differential Equation

Dynamic cointegrated pairs trading: Mean–variance time-consistent strategies

American option valuation in a stochastic volatility model with transaction costs

Closed-form solution to finite-horizon suboptimal control of nonlinear systems

Approximate finite-horizon optimal control for input-affine nonlinear systems with input constraints

Statistical Control for Performance Shaping Using Cost Cumulants

Solution of the Feedback Control Problem in the Mathematical Model of Leukaemia Therapy

Survey of State-Dependent Riccati Equation in Nonlinear Optimal Feedback Control Synthesis

Convergence Rate for a Curse-of-Dimensionality-Free Method for a Class of HJB PDEs

Differential Transformation and Its Application to Nonlinear Optimal Control

Convergence Rate for a Curse-of-dimensionality-Free Method for Hamilton–Jacobi–Bellman PDEs Represented as Maxima of Quadratic Forms

Dirichlet Problems for some Hamilton–Jacobi Equations with Inequality Constraints

A Curse-of-Dimensionality-Free Numerical Method for Solution of Certain HJB PDEs

Max-plus summation of Fenchel-transformed semigroups for solution of nonlinear Bellman equations

A numerical method to approximate optimal production and maintenance plan in a flexible manufacturing system

Global optimal feedback control for general nonlinear systems with nonquadratic performance criteria

Explicit Solution for a Network Control Problem in the Large Deviation Regime

Max-Plus Methods for Nonlinear H∞ Control: Operating in the Transform Space