A law of data separation in deep learning

Hangfeng He,Weijie J Su

doi:10.1073/pnas.2221704120

Abstract

While deep learning has enabled significant advances in many areas of science, its black-box nature hinders architecture design for future artificial intelligence applications and interpretation for high-stakes decision-makings. We addressed this issue by studying the fundamental question of how deep neural networks process data in the intermediate layers. Our finding is a simple and quantitative law that governs how deep neural networks separate data according to class membership throughout all layers for classification. This law shows that each layer improves data separation at a constant geometric rate, and its emergence is observed in a collection of network architectures and datasets during training. This law offers practical guidelines for designing architectures, improving model robustness and out-of-sample performance, as well as interpreting the predictions.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Aug 28, 2023
Citations: 1	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

A law of data separation in deep learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Similar Papers

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
Zanco journal of pure and applied sciences | VOL. 34
--
12 Apr 2022
Zanco journal of pure and applied sciences | VOL. 34

Artificial intelligence in interdisciplinary life science and drug discovery research.
Jürgen Bajorath
Future Science OA | VOL. 8
Jürgen BajorathJürgen Bajorath
08 Mar 2022
Future Science OA | VOL. 8

Cancer detection in breast cells using a hybrid method based on deep complex neural network and data mining.
Ling Yang ... Leren Qian
Zeitschrift f�r Krebsforschung und Klinische Onkologie | VOL. 149
Ling Yang, et. al.Ling Yang ... Leren Qian
24 Jul 2023
Zeitschrift f�r Krebsforschung und Klinische Onkologie | VOL. 149

Instantaneous Frequency Estimation of FM Signals under Gaussian and Symmetric α-Stable Noise: Deep Learning versus Time–Frequency Analysis
Huda Saleem Razzaq ... Zahir M Hussain
Information | VOL. 14
Huda Saleem Razzaq, et. al.Huda Saleem Razzaq ... Zahir M Hussain
28 Dec 2022
Information | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A law of data separation in deep learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America