Bisimulation Metrics Research Articles

How to accurately learn task-relevant state representations from high-dimensional observations with visual distractions is a realistic and challenging problem in visual reinforcement learning. Recently, unsupervised representation learning methods based on bisimulation metrics, contrast, prediction, and reconstruction have shown the ability for task-relevant information extraction. However, due to the lack of appropriate mechanisms for the extraction of task information in the prediction, contrast, and reconstruction-related approaches and the limitations of bisimulation-related methods in domains with sparse rewards, it is still difficult for these methods to be effectively extended to environments with distractions. To alleviate these problems, in the paper, the action sequences, which contain task-intensive signals, are incorporated into representation learning. Specifically, we propose a Sequential Action–induced invariant Representation (SAR) method, which decouples the controlled part (i.e., task-relevant information) and the uncontrolled part (i.e., task-irrelevant information) in noisy observations through sequential actions, thereby extracting effective representations related to decision tasks. To achieve it, the characteristic function of the action sequence’s probability distribution is modeled to specifically optimize the state encoder. We conduct extensive experiments on the distracting DeepMind Control suite while achieving the best performance over strong baselines. We also demonstrate the effectiveness of our method at disregarding task-irrelevant information by applying SAR to real-world CARLA-based autonomous driving with natural distractions. Finally, we provide the analysis results of generalization drawn from the generalization decay and t-SNE visualization. Code and demo videos are available at https://github.com/DMU-XMU/SAR.git.

Differential privacy is a formal definition of privacy ensuring that sensitive information relative to individuals cannot be inferred by querying a database. In this paper, we exploit a modeling of this framework via labeled Markov Chains (LMCs) to provide a logical characterization of differential privacy : we consider a probabilistic variant of the Hennessy-Milner logic and we define a syntactic distance on formulae in it measuring their syntactic disparities. Then, we define a trace distance on LMCs in terms of the syntactic distance between the sets of formulae satisfied by them. We prove that such distance corresponds to the level of privacy of the LMCs. Moreover, we use the distance on formulae to define a real-valued semantics for them, from which we obtain a logical characterization of weak anonymity : the level of anonymity is measured in terms of the formulae distinguishing the considered LMCs. Then, we focus on bisimulation semantics on nondeterministic probabilistic processes and we provide a logical characterization of generalized bisimulation metrics , namely those defined via the generalized Kantorovich lifting . Our characterization is based on the notion of mimicking formula of a process and the syntactic distance on formulae, where the former captures the observable behavior of the corresponding process and allows us to characterize bisimilarity. We show that the generalized bisimulation distance on processes is equal to the syntactic distance on their mimicking formulae. Moreover, we use the distance on mimicking formulae to obtain bounds on differential privacy. • Logical characterization of differential privacy. • Novel characterization technique based on metric over modal formulae. • Logical characterization of generalized bisimulation metrics. • Definition of real-valued semantics of formulae via metric on formulae. • Logical characterization of weak anonymity.

Bisimulation Metrics Research Articles

Related Topics

Articles published on Bisimulation Metrics

Sequential action-induced invariant representation for reinforcement learning

Integrated operations strategies for shared and privately-owned autonomous vehicles: A deep reinforcement learning framework

Invariant Representations Learning with Future Dynamics

Back to the format: A survey on SOS for probabilistic processes

Robust Representation Learning by Clustering with Bisimulation Metrics for Visual Reinforcement Learning with Distractions

Behavioural Pseudometrics for Nondeterministic Probabilistic Systems

A weak semantic approach to bisimulation metrics in models with nondeterminism and continuous state spaces

Bisimulation metrics and norms for real-weighted automata

Scalable Methods for Computing State Similarity in Deterministic Markov Decision Processes

A logical characterization of differential privacy

The metric linear-time branching-time spectrum on nondeterministic probabilistic processes

Variational Bayesian Exploration-Based Active Sarsa Algorithm

Logical characterization of branching metrics for nondeterministic probabilistic transition systems

Logical Characterization of Bisimulation Metrics

Representation Discovery for MDPs Using Bisimulation Metrics

Representation Discovery for MDPs Using Bisimulation Metrics

Fixed-point Characterization of Compositionality Properties of Probabilistic Processes Combinators

Verification of Safety and Liveness Properties of Metric Transition Systems

Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics

Bisimulation Metrics for Continuous Markov Decision Processes

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bisimulation Metrics Research Articles

Related Topics

Articles published on Bisimulation Metrics

Sequential action-induced invariant representation for reinforcement learning

Integrated operations strategies for shared and privately-owned autonomous vehicles: A deep reinforcement learning framework

Invariant Representations Learning with Future Dynamics

Back to the format: A survey on SOS for probabilistic processes

Robust Representation Learning by Clustering with Bisimulation Metrics for Visual Reinforcement Learning with Distractions

Behavioural Pseudometrics for Nondeterministic Probabilistic Systems

A weak semantic approach to bisimulation metrics in models with nondeterminism and continuous state spaces

Bisimulation metrics and norms for real-weighted automata

Scalable Methods for Computing State Similarity in Deterministic Markov Decision Processes

A logical characterization of differential privacy

The metric linear-time branching-time spectrum on nondeterministic probabilistic processes

Variational Bayesian Exploration-Based Active Sarsa Algorithm

Logical characterization of branching metrics for nondeterministic probabilistic transition systems

Logical Characterization of Bisimulation Metrics

Representation Discovery for MDPs Using Bisimulation Metrics

Representation Discovery for MDPs Using Bisimulation Metrics

Fixed-point Characterization of Compositionality Properties of Probabilistic Processes Combinators

Verification of Safety and Liveness Properties of Metric Transition Systems

Basis Function Discovery Using Spectral Clustering and Bisimulation Metrics

Bisimulation Metrics for Continuous Markov Decision Processes