Global optimization of quantum dynamics with AlphaZero deep exploration

Mogens Dalgaard,Jacob Sherson,Felix Motzoi,Jens Jakob Sørensen

doi:10.1038/s41534-019-0241-0

Mogens Dalgaard, Jacob Sherson + Show 2 more

Open Access

https://doi.org/10.1038/s41534-019-0241-0

Copy DOI

Journal: NPJ quantum information	Publication Date: Jan 14, 2020
Citations: 68	License type: open-access

Affiliation: Aarhus University

Abstract

While a large number of algorithms for optimizing quantum dynamics for different objectives have been developed, a common limitation is the reliance on good initial guesses, being either random or based on heuristics and intuitions. Here we implement a tabula rasa deep quantum exploration version of the Deepmind AlphaZero algorithm for systematically averting this limitation. AlphaZero employs a deep neural network in conjunction with deep lookahead in a guided tree search, which allows for predictive hidden-variable approximation of the quantum parameter landscape. To emphasize transferability, we apply and benchmark the algorithm on three classes of control problems using only a single common set of algorithmic hyperparameters. AlphaZero achieves substantial improvements in both the quality and quantity of good solution clusters compared to earlier methods. It is able to spontaneously learn unexpected hidden structure and global symmetry in the solutions, going beyond even human heuristics.

Highlights

The agent must select an action that updates the unitary representing the state of the system
AlphaZero is a policy improvement algorithm that combines a neural network with a Monte Carlo Tree Search (MCTS) as depicted in Fig. 1b, c
A MCTS is a way of looking several steps ahead by only visiting a small subset of possible future states

Summary

Introduction

The agent must select an action that updates the unitary representing the state of the system. Similar to ref., 45 we used a population size of 70 and a mutation probability of 0.001. AlphaZero is a policy improvement algorithm that combines a neural network with a Monte Carlo Tree Search (MCTS) as depicted, c.47,48 The neural network maps from states to policies p 1⁄4 ðp[1]; p2; 1⁄4 Þ and values v.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Global optimization of quantum dynamics with AlphaZero deep exploration

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: NPJ quantum information

Lead the way for us

Similar Papers

Bài toán điều khiển phân bố và điều khiển biên cho phương trình đạo hàm riêng elliptic nửa tuyến tính
Nguyễn Thành Quí
Can Tho University Journal of Science | VOL. 56(NaturalScience)
Nguyễn Thành QuíNguyễn Thành Quí
01 Jan 2020
Can Tho University Journal of Science | VOL. 56(NaturalScience)

Optimal ergodic control of nonlinear stochastic systems
Fabien Campillo
-
Fabien CampilloFabien Campillo
24 May 2006
24 May 2006

A Class of Control Problems Under Uncertainty
N L Grigorenko ... D G Pivovarchuk
Computational Mathematics and Modeling | VOL. 27
N L Grigorenko, et. al.N L Grigorenko ... D G Pivovarchuk
06 Jun 2016
Computational Mathematics and Modeling | VOL. 27

The intrinsic comparative dynamics of infinite horizon optimal control problems with a time-varying discount rate and time-distance discounting
Michael R Caputo
Journal of economic dynamics & control | VOL. 37
Michael R CaputoMichael R Caputo
07 Dec 2012
Journal of economic dynamics & control | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Global optimization of quantum dynamics with AlphaZero deep exploration

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: NPJ quantum information