Abstract

We study the extent to which fixing the second-layer weights reduces the capacity and generalization ability of a two-layer perceptron. Architectures with N inputs, K hidden units, and a single output are considered, with both overlapping and nonoverlapping receptive fields. We obtain from simulations one measure of the strength of a network---its critical capacity, ${\mathrm{\ensuremath{\alpha}}}_{\mathit{c}}$. Using the ansatz ${\mathrm{\ensuremath{\tau}}}_{\mathrm{med}}$\ensuremath{\propto}(${\mathrm{\ensuremath{\alpha}}}_{\mathit{c}}$-\ensuremath{\alpha}${)}^{\mathrm{\ensuremath{-}}2}$ to describe the manner in which the median learning time diverges as ${\mathrm{\ensuremath{\alpha}}}_{\mathit{c}}$ is approached, we estimate ${\mathrm{\ensuremath{\alpha}}}_{\mathit{c}}$ in a manner that does not depend on arbitrary impatience parameters. The chir learning algorithm is used in our simulations. For K=3 and overlapping receptive fields we show that the general machine is equivalent to the committee machine with the same architecture. For K=5 and the same connectivity the general machine is the union of four distinct networks with fixed second layer weights, of which the committee machine is the one with the highest ${\mathrm{\ensuremath{\alpha}}}_{\mathit{c}}$. Since the capacity of the union of a finite set of machines equals that of the strongest constituent, the capacity of the general machine with K=5 equals that of the committee machine. We were not able to prove this for general K , but believe that it does hold. We investigated the internal representations used by different machines, and found that high correlations between the hidden units and the output reduce the capacity. Finally we studied the Boolean functions that can be realized by networks with fixed second layer weights. We discovered that two different machines implement two completely distinct sets of Boolean functions.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.