Tsallis Relative Entropy Research Articles

This work investigates a somewhat different point of view on Markov decision processes by reinterpreting them as a randomized shortest paths problem on a bipartite graph, therefore establishing bridges with entropy-regularized reinforcement learning. The graph structure contains the set of states as “left” nodes and the set of actions as “right” nodes. In that context, the action-to-state transition probabilities are provided by the environment whereas the state-to-action probabilities correspond to the (stochastic) policy to be found. The randomized shortest paths formalism (minimizing expected cost to the goal state subject to (Shannon or Tsallis) relative entropy regularization) is then readily applied to this bipartite structure, providing a possibly sparse stochastic policy interpolating between a least-cost and a purely random policy. The algorithm computing the policy is closely related to the dual linear programming formulation of the Markov decision processes to which the relative entropy regularization term, multiplied by a scaling factor balancing exploitation and exploration (the temperature), is added. It is derived from well-known techniques of discrete optimal control, relying on costates (Lagrange parameters) backward computation. In summary, the proposed algorithm allows the design of optimal stochastic – but still sparse – policies, ranging from a purely rational to a random behavior, depending on the temperature parameter.

The present study uses the concept of information entropy to derive a velocity distribution model in open channel flow. The Tsallis relative entropy theory is explored for that purpose, where the prior probability density function (PDF) is chosen from the maximum Tsallis entropy distribution. Both the cases of the fixed and varying entropy index are considered from the literature. The velocity equations are derived analytically using the homotopy analysis method (HAM) together with the Padé approximation technique in a general framework. The approximations and the HAM-based series solution are verified against laboratory and field data sets. Also, the prediction accuracies of the proposed models are assessed through the measures of statistical errors. It is seen that the model corresponding to the varying index is superior to the other model. The entropy index of Tsallis relative entropy is considered a variable, and the non-unity values of this index justify the applicability of the entropy function. Moreover, it is observed that one of the Lagrange multipliers corresponding to the velocity model with a varying index becomes negligible subject to the data sets considered, and hence, it simplifies the model development mathematically. The proposed approach can be further extended to develop models for estimating the suspended sediment concentration and shear stress distribution.

Tsallis Relative Entropy Research Articles

Related Topics

Articles published on Tsallis Relative Entropy

Sparse randomized policies for Markov decision processes based on Tsallis divergence regularization

Matrix trace inequalities related to the Tsallis relative entropies of real order, II

Some closer estimations for Tsallis relative operator entropy

Monotonicity of a trace related to Tsallis relative operator entropy

Wrapping phase repair method based on Tsallis Relative Entropy evaluation

Tsallis relative α entropy of coherence dynamics in Grover′s search algorithm

Some Properties of Fractal Tsallis Entropy

Coherence as entropy increment for Tsallis and Rényi entropies

On a new generalized Tsallis relative operator entropy

Bounds on positive operator-valued measure based coherence of superposition

Pricing Principle via Tsallis Relative Entropy in Incomplete Markets

Reverse of Fujii-Seo type log-majorization and its application to the Tsallis relative entropies

A probabilistic model on streamwise velocity profile in open channels using Tsallis relative entropy theory

Sharp bounds for the Tsallis relative operator entropy

Relative operator entropies and Tsallis relative operator entropies in JB-algebras

Estimating coherence with respect to general quantum measurements

Some operator inequalities via convexity

Keyword Extraction Methodologies Based on Rényi Entropy and Tsallis Relative Entropy

Further inequalities involving the weighted geometric operator mean and the Heinz operator mean

Matrix trace inequalities related to the Tsallis relative entropies of real order

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Tsallis Relative Entropy Research Articles

Related Topics

Articles published on Tsallis Relative Entropy

Sparse randomized policies for Markov decision processes based on Tsallis divergence regularization

Matrix trace inequalities related to the Tsallis relative entropies of real order, II

Some closer estimations for Tsallis relative operator entropy

Monotonicity of a trace related to Tsallis relative operator entropy

Wrapping phase repair method based on Tsallis Relative Entropy evaluation

Tsallis relative α entropy of coherence dynamics in Grover′s search algorithm

Some Properties of Fractal Tsallis Entropy

Coherence as entropy increment for Tsallis and Rényi entropies

On a new generalized Tsallis relative operator entropy

Bounds on positive operator-valued measure based coherence of superposition

Pricing Principle via Tsallis Relative Entropy in Incomplete Markets

Reverse of Fujii-Seo type log-majorization and its application to the Tsallis relative entropies

A probabilistic model on streamwise velocity profile in open channels using Tsallis relative entropy theory

Sharp bounds for the Tsallis relative operator entropy

Relative operator entropies and Tsallis relative operator entropies in JB-algebras

Estimating coherence with respect to general quantum measurements

Some operator inequalities via convexity

Keyword Extraction Methodologies Based on Rényi Entropy and Tsallis Relative Entropy

Further inequalities involving the weighted geometric operator mean and the Heinz operator mean

Matrix trace inequalities related to the Tsallis relative entropies of real order