Absence of Barren Plateaus in Quantum Convolutional Neural Networks

Arthur Pesah,M Cerezo,Samson Wang,Patrick J Coles,Tyler Volkoff,Andrew T Sornborger

doi:10.1103/physrevx.11.041011

Abstract

Quantum neural networks (QNNs) have generated excitement around the possibility of efficiently analyzing quantum data. But this excitement has been tempered by the existence of exponentially vanishing gradients, known as barren plateau landscapes, for many QNN architectures. Recently, quantum convolutional neural networks (QCNNs) have been proposed, involving a sequence of convolutional and pooling layers that reduce the number of qubits while preserving information about relevant data features. In this work, we rigorously analyze the gradient scaling for the parameters in the QCNN architecture. We find that the variance of the gradient vanishes no faster than polynomially, implying that QCNNs do not exhibit barren plateaus. This result provides an analytical guarantee for the trainability of randomly initialized QCNNs, which highlights QCNNs as being trainable under random initialization unlike many other QNN architectures. To derive our results, we introduce a novel graph-based method to analyze expectation values over Haar-distributed unitaries, which will likely be useful in other contexts. Finally, we perform numerical simulations to verify our analytical results.9 MoreReceived 12 March 2021Revised 13 July 2021Accepted 2 August 2021DOI:https://doi.org/10.1103/PhysRevX.11.041011Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation, and DOI.Published by the American Physical SocietyPhysics Subject Headings (PhySH)Research AreasMachine learningQuantum algorithmsQuantum computationQuantum Information

Highlights

The field of classical machine learning has been revolutionized by the advent of neural networks (NNs)
Our second main result is that we employ the graph recursion integration method (GRIM) to obtain a lower bound on Var1⁄2∂μC for the quantum convolutional neural networks (QCNNs) architecture
We present our results to guarantee the trainability of a pooling-based QCNN

Summary

INTRODUCTION

The field of classical machine learning has been revolutionized by the advent of neural networks (NNs). These architectures employ a noisy hardware to evaluate a cost (or loss) function, while leveraging the power of classical optimizers to train the parameters in a quantum circuit, or a neural network. We show that the variance of the cost function partial derivatives is at most polynomially vanishing with the system size This implies that the cost function landscape does not exhibit a barren plateau, and that the QCNN architecture is trainable under random initialization of parameters. The QCNN architecture takes as input an n-qubit input state ρin in a Hilbert space Hin, which is sent through a circuit composed of a sequence of convolutional and pooling layers. After the final pooling layer, one applies a fully connected unitary (F) to the remaining qubits and obtains an output state ρout whose dimension is much smaller than that of ρin. Note that the nonlinearities in a QCNN arise from the pooling operators (measurement and conditioned unitary) in the pooling layers, which effectively reduce the degrees of freedom in each layer

Cost function

Ansatz

Trainability and variance of the cost

Haar-distributed unitaries

MAIN RESULTS

Trainability of the QCNN

Λg: ð27Þ

Trainability from pooling

Numerical verification

DISCUSSION

Haar-distributed unitaries and integration over the unitary group

Useful results

Variance of the cost function partial derivative

General lower bound on the variance

Case 1

Case 2

Case 3

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Physical Review X	Publication Date: Oct 15, 2021
Citations: 124	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Absence of Barren Plateaus in Quantum Convolutional Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physical Review X

Lead the way for us

Similar Papers

Theory for Equivariant Quantum Neural Networks
Quynh T Nguyen ... Patrick J Coles
PRX Quantum | VOL. 5
Quynh T Nguyen, et. al.Quynh T Nguyen ... Patrick J Coles
06 May 2024
PRX Quantum | VOL. 5

Error-tolerant quantum convolutional neural networks for symmetry-protected topological phases
Petr Zapletal ... Michael J Hartmann
Physical Review Research | VOL. 6
Petr Zapletal, et. al.Petr Zapletal ... Michael J Hartmann
26 Jul 2024
Physical Review Research | VOL. 6

Quantum Convolutional Neural Networks (QCNN) Using Deep Learning for Computer Vision Applications
Varadi Rajesh ... Mohana
-
Varadi Rajesh, et. al.Varadi Rajesh ... Mohana
27 Aug 2021
27 Aug 2021

Detection of anomaly in surveillance videos using quantum convolutional neural networks
Javaria Amin ... Ruben González Crespo
Image and Vision Computing | VOL. 135
Javaria Amin, et. al.Javaria Amin ... Ruben González Crespo
18 May 2023
Image and Vision Computing | VOL. 135

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Absence of Barren Plateaus in Quantum Convolutional Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physical Review X