Dropout Rademacher complexity of deep neural networks

Wei Gao,Zhi-Hua Zhou

doi:10.1007/s11432-015-5470-z

Dropout Rademacher complexity of deep neural networks

Wei Gao, Zhi-Hua Zhou

Open Access

https://doi.org/10.1007/s11432-015-5470-z

Copy DOI

Journal: Science in China Series F: Information Sciences	Publication Date: Jun 16, 2016
Citations: 51

Affiliation: Nanjing University

#Rademacher Complexity #Complexity Of Different Types + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Great successes of deep neural networks have been witnessed in various real applications. Many algorithmic and implementation techniques have been developed, however, theoretical understanding of many aspects of deep neural networks is far from clear. A particular interesting issue is the usefulness of dropout, which was motivated from the intuition of preventing complex co-adaptation of feature detectors. In this paper, we study the Rademacher complexity of different types of dropout, and our theoretical results disclose that for shallow neural networks (with one or none hidden layer) dropout is able to reduce the Rademacher complexity in polynomial, whereas for deep neural networks it can amazingly lead to an exponential reduction of the Rademacher complexity.

Full Text