Hidden Unit Activation Research Articles

AbstractThis paper makes a simple but previously neglected point with regard to an empirical application of the test of White (1989) and Lee, White, and Granger (LWG, 1993), for neglected nonlinearity in conditional mean, using the feedforward single layer artificial neural network (ANN). Because the activation parameters in the hidden layer are not identified under the null hypothesis of linearity, LWG suggested to activate the ANN hidden units based on the randomly generated activation parameters. Their Monte Carlo experiments demonstrated the excellent performance (good size and power), even if LWG considered a fairly small number (10 or 20) of random hidden unit activations. However, in this paper, we note that the good size and power of Monte Carlo experiments are the average frequencies of rejecting the null hypothsis over multiple replications of the data generating process. The average over many simulations in Monte Carlo smooths out the randomness of the activations. In an empirical study, unlike in a Monte Carlo study, multiple realizations of the data are not possible or available. In this case, the ANN test is sensitive to the randomly generated activation parameters. One solution is the use of Bonferroni bounds as suggested by LWG (1993), which however still exhibits some excessive dependence on the random activations (as shown in this paper). Another solution is to integrate the test statistic over the nuisance parameter space, for which however, bootstrap or simulation should be used to obtain the null distribution of the integrated statistic. In this paper, we consider a much simpler solution that is shown to work very well. That is, we simply increase the number of randomized hidden unit activations to a (very) large number (e.g. 1,000). We show that using many randomly generated activation parameters can robustify the performance of the ANN test when it is applied to a real empirical data. This robustification is reliable and useful in practice and can be achieved at no cost as increasing the number of random activations is almost costless given today’s computer technology.

Read full abstract

Evidence from biological studies suggests that humans are able to predict the sensory consequences of their own actions [1]. Computational studies also demonstrate the advantage of systems that predict sensory consequences of actions over those that predict the value of actions alone [2]. But how could the ability to predict sensory consequences of actions have evolved? One solution suggested by [3] is that prediction mechanisms first evolved to deal with natural sources of delay. Delay is commonly considered to be a purely negative feature of real world systems; however, we argue that delay can actually encourage evolution of the prediction of sensory consequences. We hypothesize that increasing sensory delay to an evolving population of sensory-motor agents will increase reliance on internal prediction of sensory consequences. To test our hypothesis we evolved populations of artificial neural networks at a complex control task (i.e. pole balancing, see figure figure1)1) with varied neural conduction delay (Δt) between sensory neurons and input to the control network (see figure figure2),2), which estimates the long term cost of applying a specific action. For top fitness networks, hidden unit activations were recorded as well as the true consequent sensory state during several evaluation trials. Each sensory variable was associated with the hidden unit that the sensory variable was maximally correlated with. Taking the average of these correlation values provides a measure of how well an agent can predict the sensory consequences of actions. We expected to find that increasing sensory delay also increases the average correlation measure described above. Figure 1 Cart-Pole Balancing Figure 2 Control network structure The result of the experiment (summarized in figure figure3)3) show that with no delay successful agents use a range of strategies, however, as delay increases successful strategies are forced to rely more and more on prediction of the next state to compensate for sensory delay. This seems surprising when considering that under conditions of no delay it is considerably easier to predict the next state than conditions with increased delay. Figure 3 Absolute correlation between hidden until activations and variables of the state at time t+Δt as delay increases. Although the common conception of delay is negative, sensory delay can direct natural selection to favor individuals that are better able to predict the sensory consequences of their actions.

Read full abstract

Hidden Unit Activation Research Articles

Related Topics

Articles published on Hidden Unit Activation

Self-consistent dynamical field theory of kernel evolution in wide neural networks *

A Correspondence Between Normalization Strategies in Artificial and Biological Neural Networks.

Neural state space alignment for magnitude generalization in humans and recurrent networks

An EM Algorithm for Capsule Regression.

Artificial neural networks reveal individual differences in metacognitive monitoring of memory.

Modelling the N400 brain potential as change in a probabilistic representation of meaning.

Improving EEG-Based Driver Fatigue Classification Using Sparse-Deep Belief Networks.

STDP-Compatible Approximation of Backpropagation in an Energy-Based Model.

Conservativeness of Untied Auto-Encoders

Gaussian Cardinality Restricted Boltzmann Machines

Testing for Neglected Nonlinearity Using Artificial Neural Networks with Many Randomized Hidden Unit Activations

Function analysis based rule extraction from artificial neural networks for transformer incipient fault diagnosis

Neural conduction delay forces the emergence of predictive function in simulated evolution

Greedy rule generation from discrete data and its use in neural network rule extraction

Extracting rules from multilayer perceptrons in classification problems: A clustering-based approach

Neural Network Training Algorithm with Positive Correlation

Incremental training of first order recurrent neural networks to predict a context-sensitive language

A Clustering Genetic Algorithm For Extracting Rules From Multilayer Perceptrons Trained In Classification Problems

GENERATING CONCISE SETS OF LINEAR REGRESSION RULES FROM ARTIFICIAL NEURAL NETWORKS

Chapter 16 Connectionist contributions to population coding in the motor cortex

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Hidden Unit Activation Research Articles

Related Topics

Articles published on Hidden Unit Activation

Self-consistent dynamical field theory of kernel evolution in wide neural networks *

A Correspondence Between Normalization Strategies in Artificial and Biological Neural Networks.

Neural state space alignment for magnitude generalization in humans and recurrent networks

An EM Algorithm for Capsule Regression.

Artificial neural networks reveal individual differences in metacognitive monitoring of memory.

Modelling the N400 brain potential as change in a probabilistic representation of meaning.

Improving EEG-Based Driver Fatigue Classification Using Sparse-Deep Belief Networks.

STDP-Compatible Approximation of Backpropagation in an Energy-Based Model.

Conservativeness of Untied Auto-Encoders

Gaussian Cardinality Restricted Boltzmann Machines

Testing for Neglected Nonlinearity Using Artificial Neural Networks with Many Randomized Hidden Unit Activations

Function analysis based rule extraction from artificial neural networks for transformer incipient fault diagnosis

Neural conduction delay forces the emergence of predictive function in simulated evolution

Greedy rule generation from discrete data and its use in neural network rule extraction

Extracting rules from multilayer perceptrons in classification problems: A clustering-based approach

Neural Network Training Algorithm with Positive Correlation

Incremental training of first order recurrent neural networks to predict a context-sensitive language

A Clustering Genetic Algorithm For Extracting Rules From Multilayer Perceptrons Trained In Classification Problems

GENERATING CONCISE SETS OF LINEAR REGRESSION RULES FROM ARTIFICIAL NEURAL NETWORKS

Chapter 16 Connectionist contributions to population coding in the motor cortex