Fundamentals of Artificial Neural Networks and Deep Learning

Osval Antonio Montesinos López,Jose Crossa,Abelardo Montesinos López

doi:10.1007/978-3-030-89010-0_10

Abstract

AbstractIn this chapter, we go through the fundamentals of artificial neural networks and deep learning methods. We describe the inspiration for artificial neural networks and how the methods of deep learning are built. We define the activation function and its role in capturing nonlinear patterns in the input data. We explain the universal approximation theorem for understanding the power and limitation of these methods and describe the main topologies of artificial neural networks that play an important role in the successful implementation of these methods. We also describe loss functions (and their penalized versions) and give details about in which circumstances each of them should be used or preferred. In addition to the Ridge, Lasso, and Elastic Net regularization methods, we provide details of the dropout and the early stopping methods. Finally, we provide the backpropagation method and illustrate it with two simple artificial neural networks.

Highlights

The inspiration for artificial neural networks (ANN), or neural networks, resulted from the admiration for how the human brain computes complex processes, which is entirely different from the way conventional digital computers do this
It is estimated that the brain is composed of around 1011 neurons that work in parallel, since the processing done by the neurons and the memory captured by the synapses are distributed together over the network
The net input is evaluated in this function and we obtain the output of the network as shown : Fig. 10.3 General artificial neural network model

Summary

10.1 The Inspiration for the Neural Network Model

The inspiration for artificial neural networks (ANN), or neural networks, resulted from the admiration for how the human brain computes complex processes, which is entirely different from the way conventional digital computers do this. One of the characteristics of biological neurons, to which they owe their great capacity to process and perform highly complex tasks, is that they are highly connected to other neurons from which they receive stimuli from an event as it occurs, or hundreds of electrical signals with the information learned. When it reaches the body of the neuron, this information affects its behavior and can affect a neighboring neuron or muscle (Francisco-Caicedo and López-Sotelo 2009). Anderson et al (1990) were more expressive in this sense and pointed out that “ANN are statistics for amateurs since most neural networks conceal the statistics from the user.”

10.2 The Building Blocks of Artificial Neural Networks

10.3 Activation Functions

10.3.1 Linear

10.3.3 Leaky ReLU

10.3.4 Sigmoid

10.3.5 Softmax

10.4 The Universal Approximation Theorem

10.5 Artificial Neural Network Topologies

10.6 Successful Applications of ANN and DL

10.7 Loss Functions

10.7.1 Loss Functions for Continuous Outcomes

Xn XL À

10.7.2 Loss Functions for Binary and Ordinal Outcomes

10.7.3 Regularized Loss Functions

10.7.4 Early Stopping Method of Training

10.8 The King Algorithm for Training Artificial Neural Networks

10.8.1.1 Feedforward Part

10.8.1.2 Backpropagation Part

Findings

66 À0:1139 Â 666 À0:2697

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fundamentals of Artificial Neural Networks and Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2022
Citations: 39	License type: CC BY 4.0

Similar Papers

Optimization of an Artificial Neural Network Topology for Predicting Drying Kinetics of Carrot Cubes Using Combined Response Surface and Genetic Algorithm
Mortaza Aghbashlo ... Shahin Rafiee
Drying Technology | VOL. 29
Mortaza Aghbashlo, et. al.Mortaza Aghbashlo ... Shahin Rafiee
01 Jun 2011
Drying Technology | VOL. 29

PCA-SCG-ANN for Detection of Non-structural Protein 1 from SERS Salivary Spectra
N H Othman ... A R M Radzol
-
N H Othman, et. al.N H Othman ... A R M Radzol
01 Jan 2017
01 Jan 2017

CORR Synthesis: When Should the Orthopaedic Surgeon Use Artificial Intelligence, Machine Learning, and Deep Learning?
Michael P Murphy ... Nicholas M Brown
Clinical Orthopaedics and Related Research | VOL. 479
Michael P Murphy, et. al.Michael P Murphy ... Nicholas M Brown
17 Feb 2021
Clinical Orthopaedics and Related Research | VOL. 479

Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review.
Mehrdad Kaveh ... Mohammad Saadi Mesgari
Neural Processing Letters | VOL. 55
Mehrdad Kaveh, et. al.Mehrdad Kaveh ... Mohammad Saadi Mesgari
31 Oct 2022
Neural Processing Letters | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fundamentals of Artificial Neural Networks and Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers