Quantum Tree-Based Planning

Andre Sequeira,Luis Soares Barbosa,Luis Paulo Santos

doi:10.1109/access.2021.3110652

Abstract

Reinforcement Learning is at the core of a recent revolution in Artificial Intelligence. Simultaneously, we are witnessing the emergence of a new field: Quantum Machine Learning. In the context of these two major developments, this work addresses the interplay between Quantum Computing and Reinforcement Learning. Learning by interaction is possible in the quantum setting using the concept of oraculization of environments. The paper extends previous oracular instances to address more general stochastic environments. In this setting, we developed a novel quantum algorithm for near-optimal decision-making based on the Reinforcement Learning paradigm known as Sparse Sampling. The proposed algorithm exhibits a quadratic speedup compared to its classical counterpart. To the best of the authors’ knowledge, this is the first quantum planning algorithm exhibiting a time complexity independent of the number of states of the environment, which makes it suitable for large state space environments, where planning is otherwise intractable.

Highlights

We take one step further and enforce the simulated environment to be fully quantized, a notion that first appeared in [6], [7], allowing a quantum agent to act in its environment according to the laws of quantum mechanics. Based on this interaction we prove that a quantum version of the sparse sampling algorithm produces near-optimal actions with quadratically less computational effort when compared to its classical counterpart
This demonstrates that the quantum algorithm proposed suggests an -optimal action to be taken in any initial state of a given Markov Decision Processes (MDP) with quadratically less computational effort compared with the original classical Sparse Sampling algorithm
The total number of queries performed by the classical algorithm is equal to how many times the condition presented in line 1 of the method EstimateQ() evaluates to True

Summary

INTRODUCTION

That prepares a linear combination between all possible transition states, weighted by the product of the state transition probabilities and the respective outcome states rewards This reasoning can be extended to allow for h interactions, i.e., sequences of h actions, by resorting to the quantum oracular environment O, as given by Equation (12); this is equivalent to compute a lookahead tree of depth h in superposition. O|ψ0 acts in the respective transition step sub-registers, preparing a superposition state |ψ in which the term with |r = |1 and with the highest amplitude represents the maximum expected reward. |ψi ← |ψi ⊗ |ai ⊗ |si+1 ; |ψi ← T (|si ⊗ |ai ⊗ |si+1 ); |ψi+1 ← Rs (|si+1 ⊗ |r ); i ← i + 1; end action ← QSearch(|ψh−1 ); A[action] ← A[action] + 1 ;

COMPLEXITY ANALYSIS

BOUNDING THE SEARCH SPACE

BOUNDING THE SAMPLE SIZE

NUMERICAL EXPERIMENTS AND RESULTS

STOCHASTIC GRIDWORLD

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Quantum Tree-Based Planning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Exploring the Landscape: A Systematic Review of Quantum Machine Learning and Its Diverse Applications
Dr Sajeeda Parveen Shaik
INTERNATIONAL RESEARCH JOURNAL OF ENGINEERING & APPLIED SCIENCES | VOL. 8
Dr Sajeeda Parveen ShaikDr Sajeeda Parveen Shaik
01 Jan 2020
INTERNATIONAL RESEARCH JOURNAL OF ENGINEERING & APPLIED SCIENCES | VOL. 8

Recent Progress in Quantum Machine Learning
Amandeep Singh Bhatia ... Renata Wong
-
Amandeep Singh Bhatia, et. al.Amandeep Singh Bhatia ... Renata Wong
01 Jan 2020
01 Jan 2020

Quantum Deep Recurrent Reinforcement Learning
Samuel Yen-Chi Chen
-
Samuel Yen-Chi ChenSamuel Yen-Chi Chen
04 Jun 2023
04 Jun 2023

Modern Trends in Quantum AI: Distributed and High-Definition Computation
Jae Pyoung Kim ... Hankyul Baek
-
Jae Pyoung Kim, et. al.Jae Pyoung Kim ... Hankyul Baek
11 Jan 2023
11 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantum Tree-Based Planning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access