Finite-horizon Performance Research Articles

This paper addresses the problem of sensitivity analysis for finite-horizon performance measures of general Markov chains. We derive closed-form expressions and associated unbiased gradient estimators for the derivatives of finite products of Markov kernels by measure-valued differentiation (MVD). In the MVD setting, the derivatives of Markov kernels, called $\mathcal{D}$ -derivatives, are defined with respect to a class of performance functions $\mathcal{D}$ such that, for any performance measure $g\in\mathcal{D}$ , the derivative of the integral of g with respect to the one-step transition probability of the Markov chain exists. The MVD approach (i) yields results that can be applied to performance functions out of a predefined class, (ii) allows for a product rule of differentiation, that is, analyzing the derivative of the transition kernel immediately yields finite-horizon results, (iii) provides an operator language approach to the differentiation of Markov chains and (iv) clearly identifies the trade-off between the generality of the performance classes that can be analyzed and the generality of the classes of measures (Markov kernels). The $\mathcal{D}$ -derivative of a measure can be interpreted in terms of various (unbiased) gradient estimators and the product rule for $\mathcal {D}$ -differentiation yields a product-rule for various gradient estimators.

Probabilistic Boolean Networks (PBN's) have been recently introduced as a rule-based paradigm for modeling gene regulatory networks. Such networks, which form a subclass of Markovian Genetic Regulatory Networks, provide a convenient tool for studying interactions between different genes while allowing for uncertainty in the knowledge of these relationships. This paper deals with the issue of control in probabilistic Boolean networks. More precisely, given a general Markovian Genetic Regulatory Network whose state transition probabilities depend on an external (control) variable, the paper develops a procedure by which one can choose the sequence of control actions that minimize a given performance index over a finite number of steps. The procedure is based on the theory of controlled Markov chains and makes use of the classical technique of Dynamic Programming. The choice of the finite horizon performance index is motivated by cancer treatment applications where one would ideally like to intervene only over a finite time horizon, then suspend treatment and observe the effects over some additional time before deciding if further intervention is necessary. The undiscounted finite horizon cost minimization problem considered here is the simplest one to formulate and solve, and is selected mainly for clarity of exposition, although more complicated costs could be used, provided appropriate technical conditions are satisfied.

Finite-horizon Performance Research Articles

Related Topics

Articles published on Finite-horizon Performance

Q-learning based tracking control with novel finite-horizon performance index

Finite horizon robust synthesis using integral quadratic constraints

Multi‐input control design for a constrained bilinear biquadratic regulator with external excitation

Guaranteed performance control of switched linear systems: A differential-Riccati-equation-based approach

On the rate of convergence to equilibrium for two-sided reflected Brownian motion and for the Ornstein–Uhlenbeck process

Finite-Length Linear Schemes for Joint Source-Channel Coding over Gaussian Broadcast Channels with Feedback

Separated Design of Encoder and Controller for Networked Linear Quadratic Optimal Control

Portfolios and risk premia for the long run

Dynamic output feedback robust model predictive control

Portfolios and Risk Premia for the Long Run

Nonparametric nearest neighbor based empirical portfolio selection strategies

Measure-Valued Differentiation for Markov Chains

External Control in Markovian Genetic Regulatory Networks

A Differential Calculus for Random Matrices with Applications to (max, +)-Linear Stochastic Systems

Linear nonquadratic optimal control

Receding horizon H tracking control for time-varying discrete linear systems

First- and second-derivative estimators for cyclic closed-queueing networks

Synthesis of suboptimal H∞controllers over a finite horizon

Worst-case optimal control over a finite horizon

Computation of infimalH∞ norm over a finite horizon

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Finite-horizon Performance Research Articles

Related Topics

Articles published on Finite-horizon Performance

Q-learning based tracking control with novel finite-horizon performance index

Finite horizon robust synthesis using integral quadratic constraints

Multi‐input control design for a constrained bilinear biquadratic regulator with external excitation

Guaranteed performance control of switched linear systems: A differential-Riccati-equation-based approach

On the rate of convergence to equilibrium for two-sided reflected Brownian motion and for the Ornstein–Uhlenbeck process

Finite-Length Linear Schemes for Joint Source-Channel Coding over Gaussian Broadcast Channels with Feedback

Separated Design of Encoder and Controller for Networked Linear Quadratic Optimal Control

Portfolios and risk premia for the long run

Dynamic output feedback robust model predictive control

Portfolios and Risk Premia for the Long Run

Nonparametric nearest neighbor based empirical portfolio selection strategies

Measure-Valued Differentiation for Markov Chains

External Control in Markovian Genetic Regulatory Networks

A Differential Calculus for Random Matrices with Applications to (max, +)-Linear Stochastic Systems

Linear nonquadratic optimal control

Receding horizon H tracking control for time-varying discrete linear systems

First- and second-derivative estimators for cyclic closed-queueing networks

Synthesis of suboptimal H∞controllers over a finite horizon

Worst-case optimal control over a finite horizon

Computation of infimalH∞ norm over a finite horizon