KL Divergence Research Articles

Active inference is a theory of perception, learning, and decision making that can be applied to neuroscience, robotics, psychology, and machine learning. Recently, intensive research has been taking place to scale up this framework using Monte Carlo tree search and deep learning. The goal of this activity is to solve more complicated tasks using deep active inference. First, we review the existing literature and then progressively build a deep active inference agent as follows: we (1) implement a variational autoencoder (VAE), (2) implement a deep hidden Markov model (HMM), and (3) implement a deep critical hidden Markov model (CHMM). For the CHMM, we implemented two versions, one minimizing expected free energy, CHMM[EFE] and one maximizing rewards, CHMM[reward]. Then we experimented with three different action selection strategies: the ε-greedy algorithm as well as softmax and best action selection. According to our experiments, the models able to solve the dSprites environment are the ones that maximize rewards. On further inspection, we found that the CHMM minimizing expected free energy almost always picks the same action, which makes it unable to solve the dSprites environment. In contrast, the CHMM maximizing reward keeps on selecting all the actions, enabling it to successfully solve the task. The only difference between those two CHMMs is the epistemic value, which aims to make the outputs of the transition and encoder networks as close as possible. Thus, the CHMM minimizing expected free energy repeatedly picks a single action and becomes an expert at predicting the future when selecting this action. This effectively makes the KL divergence between the output of the transition and encoder networks small. Additionally, when selecting the action down the average reward is zero, while for all the other actions, the expected reward will be negative. Therefore, if the CHMM has to stick to a single action to keep the KL divergence small, then the action down is the most rewarding. We also show in simulation that the epistemic value used in deep active inference can behave degenerately and in certain circumstances effectively lose, rather than gain, information. As the agent minimizing EFE is not able to explore its environment, the appropriate formulation of the epistemic value in deep active inference remains an open question.

Read full abstract

A characteristic feature of “quantum chaotic” systems is that their eigenspectra and eigenstates display universal statistical properties described by random matrix theory (RMT). However, eigenstates of local systems also encode structure beyond RMT. To capture this feature, we introduce a framework that allows us to compare the properties of eigenstates in local systems with those of pure random states. In particular, our framework defines a notion of distance between quantum state ensembles that utilizes the Kullback-Leibler divergence to compare the microcanonical distribution of entanglement entropy (EE) of eigenstates with a reference RMT distribution generated by pure random states (with appropriate constraints). This notion gives rise to a quantitative metric for quantum chaos that not only accounts for averages of the distributions but also higher moments. The differences in moments are compared on a highly resolved scale set by the standard deviation of the RMT distribution, which is exponentially small in system size. As a result, the metric can distinguish between chaotic and integrable behaviors and, in addition, quantify and compare the of chaos (in terms of proximity to RMT behavior) between two systems that are assumed to be chaotic. We implement our framework in local, minimally structured, Floquet random circuits, as well as a canonical family of many-body Hamiltonians, the mixed-field Ising model (MFIM). Importantly, for Hamiltonian systems, we find that the reference random distribution must be appropriately constrained to incorporate the effect of energy conservation in order to describe the ensemble properties of midspectrum eigenstates. The metric captures deviations from RMT across all models and parameters, including those that have been previously identified as strongly chaotic, and for which other diagnostics of chaos such as level spacing statistics look strongly thermal. In Floquet circuits, the dominant source of deviations is the second moment of the distribution, and this persists for all system sizes. For the MFIM, we find significant variation of the KL divergence in parameter space. Notably, we find a small region where deviations from RMT are minimized, suggesting that “maximally chaotic” Hamiltonians may exist in fine-tuned pockets of parameter space. Published by the American Physical Society 2024

Read full abstract

KL Divergence Research Articles

Related Topics

Articles published on KL Divergence

Sonar image segmentation using a multi-spatial information constraint fuzzy C-means clustering algorithm based on KL divergence

Trainability barriers and opportunities in quantum generative modeling

Augmented ELBO regularization for enhanced clustering in variational autoencoders

Deconstructing Deep Active Inference: A Contrarian Information Gatherer.

BEATRICE: Bayesian fine-mapping from summary data using deep variational inference.

Detection and Mitigation of Label-Flipping Attacks in FL Systems With KL Divergence

Diffusion model conditioning on gaussian mixture model and negative gaussian mixture gradient

Mix-DDPM: Enhancing Diffusion Models through Fitting Mixture Noise with Global Stochastic Offset

Suppressed possibilistic fuzzy c-means clustering based on shadow sets for noisy data with imbalanced sizes

3D mineral prospectivity modeling using deep adaptation network transfer learning: A case study of the Xiadian gold deposit, Eastern China

BEATRICE: Bayesian Fine-mapping from Summary Data using Deep Variational Inference.

Disturbances of thalamus and prefrontal cortex contribute to cognitive aging: A structure-function coupling analysis based on KL divergence

GAP: A group-based automatic pruning algorithm via convolution kernel fusion

Fund transfer fraud detection: Analyzing irregular transactions and customer relationships with self-attention and graph neural networks

Learning the feature distribution similarities for online time series anomaly detection

Intrinsic Rewards for Exploration Without Harm From Observational Noise: A Simulation Study Based on the Free Energy Principle.

Multi-modal incomplete label information three-way bidirectional decision-making: Applications of disease assessment

Quantifying Quantum Chaos through Microcanonical Distributions of Entanglement

An uncertainty-aware domain adaptive semantic segmentation framework

IoT traffic classification and anomaly detection method based on deep autoencoders

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

KL Divergence Research Articles

Related Topics

Articles published on KL Divergence

Sonar image segmentation using a multi-spatial information constraint fuzzy C-means clustering algorithm based on KL divergence

Trainability barriers and opportunities in quantum generative modeling

Augmented ELBO regularization for enhanced clustering in variational autoencoders

Deconstructing Deep Active Inference: A Contrarian Information Gatherer.

BEATRICE: Bayesian fine-mapping from summary data using deep variational inference.

Detection and Mitigation of Label-Flipping Attacks in FL Systems With KL Divergence

Diffusion model conditioning on gaussian mixture model and negative gaussian mixture gradient

Mix-DDPM: Enhancing Diffusion Models through Fitting Mixture Noise with Global Stochastic Offset

Suppressed possibilistic fuzzy c-means clustering based on shadow sets for noisy data with imbalanced sizes

3D mineral prospectivity modeling using deep adaptation network transfer learning: A case study of the Xiadian gold deposit, Eastern China

BEATRICE: Bayesian Fine-mapping from Summary Data using Deep Variational Inference.

Disturbances of thalamus and prefrontal cortex contribute to cognitive aging: A structure-function coupling analysis based on KL divergence

GAP: A group-based automatic pruning algorithm via convolution kernel fusion

Fund transfer fraud detection: Analyzing irregular transactions and customer relationships with self-attention and graph neural networks

Learning the feature distribution similarities for online time series anomaly detection

Intrinsic Rewards for Exploration Without Harm From Observational Noise: A Simulation Study Based on the Free Energy Principle.

Multi-modal incomplete label information three-way bidirectional decision-making: Applications of disease assessment

Quantifying Quantum Chaos through Microcanonical Distributions of Entanglement

An uncertainty-aware domain adaptive semantic segmentation framework

IoT traffic classification and anomaly detection method based on deep autoencoders