Outputs Of Hidden Units Research Articles

To obtain both good training performance and good generalization in multilayer perceptron (MLP) networks, it is essential to use small networks that avoid overfitting the training data. A common approach for doing this is to train a large network and then to prune the unnecessary units or weights. An effective hidden unit-pruning algorithm called linear dependence (LD) pruning utilizing sets of linear equations is presented in this paper. In this approach, hidden unit outputs (basis functions) are modeled as a linear combination of outputs of other units. The least useful hidden unit is identified as that which is predicted to increase training error the least when replaced by its model. After this hidden unit is found, the new pruning algorithm replaces it with its model and retrains the network output weights by one iteration of training. The LD pruning algorithm's performance is compared with that of a modified optimal brain surgeon (OBS) pruning algorithm. We show that the LD pruning algorithm performs as well as the OBS method, yet requires orders of magnitude fewer multiplies.

Read full abstract

A novel neural network has been devised that combines the advantages of cascade correlation and computational temperature constraints. The combination of advantages yields a nonlinear calibration method that is easier to use, stable, and faster than back-propagation networks. Cascade correlation networks adjust only a single unit at a time, so they train very rapidly when compared to back-propagation networks. Cascade correlation networks determine their topology during training. In addition, the hidden units are not readjusted once they have been trained, so these networks are capable of incremental learning and caching. With the cascade architecture, temperature may be optimized for each hidden unit. Computational temperature is a parameter that controls the fuzziness of a hidden unit's output. The magnitude of the change in covariance with respect to temperature is maximized. This criterion avoids local minima, forces the hidden units to model larger variances in the data, and generates hidden units that furnish fuzzy logic. As a result, models built using temperature-constrained cascade correlation networks are better at interpolation or generalization of the design points. These properties are demonstrated for exemplary linear interpolations, a nonlinear interpolation, and chemical data sets for which the numbers of chlorine atoms in polychlorinated biphenyl molecules are predicted from mass spectra.

Read full abstract

Outputs Of Hidden Units Research Articles

Related Topics

Articles published on Outputs Of Hidden Units

Pruning of basis functions in nonlinear approximators

Temperature-Constrained Cascade Correlation Networks

Optimization of the hidden unit function in feedforward neural networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Outputs Of Hidden Units Research Articles

Related Topics

Articles published on Outputs Of Hidden Units

Pruning of basis functions in nonlinear approximators

Temperature-Constrained Cascade Correlation Networks

Optimization of the hidden unit function in feedforward neural networks