Neural Network Weight Matrices Research Articles

At the core of any inference procedure, deep neural networks are dot product operations, which are the component that requires the highest computational resources. For instance, deep neural networks, such as VGG-16, require up to 15-G operations in order to perform the dot products present in a single forward pass, which results in significant energy consumption and thus limits their use in resource-limited environments, e.g., on embedded devices or smartphones. One common approach to reduce the complexity of the inference is to prune and quantize the weight matrices of the neural network. Usually, this results in matrices whose entropy values are low, as measured relative to the empirical probability mass distribution of its elements. In order to efficiently exploit such matrices, one usually relies on, inter alia, sparse matrix representations. However, most of these common matrix storage formats make strong statistical assumptions about the distribution of the elements; therefore, cannot efficiently represent the entire set of matrices that exhibit low-entropy statistics (thus, the entire set of compressed neural network weight matrices). In this paper, we address this issue and present new efficient representations for matrices with low-entropy statistics. Alike sparse matrix data structures, these formats exploit the statistical properties of the data in order to reduce the size and execution complexity. Moreover, we show that the proposed data structures can not only be regarded as a generalization of sparse formats but are also more energy and time efficient under practically relevant assumptions. Finally, we test the storage requirements and execution performance of the proposed formats on compressed neural networks and compare them to dense and sparse representations. We experimentally show that we are able to attain up to ×42 compression ratios, ×5 speed ups, and ×90 energy savings when we lossless convert the state-of-the-art networks, such as AlexNet, VGG-16, ResNet152, and DenseNet, into the new data structures and benchmark their respective dot product.

Read full abstract

This paper presents a methodology that reflected functions by reflecting the weight matrices of an artificial neural network. One of the major problems with the connectionist approach is that trained neural networks can only associate fixed sets of input–output mappings. We provide a methodology which allows the post-trained net to associate different input–output mappings. The different mappings are reflected in a horizontal axis, reflected in a vertical axis and scaling of the initial mapping. The methodology does not train the net on the different mappings but it transforms the weight matrix of the neural network. This paper describes a novel way of utilising sigma–pi neural networks. Our new methodology manipulates sigma–pi unit's weight matrices which transform the unit's output. The weights are cast in a matrix formulation, and then transformations can be performed on the weight matrix of the sigma–pi net. To test the new methodology, the following three steps were carried out on a neural network: (1) the network was trained to perform a mapping function, f; (2) the weights of the network were transformed; and (3) the network was tested to evaluate whether it performs the reflection in the vertical axis, f ref−vert( x)= a− f( x). This reflects the function in one dimension. A reflection transformation was used to manipulate the network's weight matrices to obtain a reflection in the vertical axis. Note that the network was not trained to perform the reflection in the vertical axis. The transformation of the weight matrix transformed the function the output performs. This article explains the theory which enables us to perform transformations of sigma–pi networks and obtain reflections of the output by reflecting the weight matrices. These transforms empower the network to perform related mapping tasks once one mapping task has been learnt. This article explains how each transformation is performed and it considers whether a set of ‘standard’ transformations can indeed be derived.

Read full abstract

Neural Network Weight Matrices Research Articles

Related Topics

Articles published on Neural Network Weight Matrices

A method for the dimensionality reduction of neural network weight matrices for natural language processing

Boundary between noise and information applied to filtering neural network weight matrices.

Random matrix analysis of deep neural network weight matrices.

Adaptive neural network hierarchical sliding mode control for six degrees of freedom overhead crane

Compact and Computationally Efficient Representation of Deep Neural Networks.

An FPGA-Based Hardware Emulator for Neuromorphic Chip With RRAM

Sliding mode adaptive neural network control for hybrid visual servoing of underwater vehicles

Adaptive neural network visual servo control for dynamic positioning of underwater vehicles

Sliding mode control for a class of nonlinear systems based on robust adaptive neural network estimation

Intelligent decision making in multi-agent robot soccer system through compounded artificial neural networks

Absolute exponential stability of a class of recurrent neural networks with multiple and variable delays

Transformations of sigma–pi nets: obtaining reflected functions by reflecting weight matrices

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Neural Network Weight Matrices Research Articles

Related Topics

Articles published on Neural Network Weight Matrices

A method for the dimensionality reduction of neural network weight matrices for natural language processing

Boundary between noise and information applied to filtering neural network weight matrices.

Random matrix analysis of deep neural network weight matrices.

Adaptive neural network hierarchical sliding mode control for six degrees of freedom overhead crane

Compact and Computationally Efficient Representation of Deep Neural Networks.

An FPGA-Based Hardware Emulator for Neuromorphic Chip With RRAM

Sliding mode adaptive neural network control for hybrid visual servoing of underwater vehicles

Adaptive neural network visual servo control for dynamic positioning of underwater vehicles

Sliding mode control for a class of nonlinear systems based on robust adaptive neural network estimation

Intelligent decision making in multi-agent robot soccer system through compounded artificial neural networks

Absolute exponential stability of a class of recurrent neural networks with multiple and variable delays

Transformations of sigma–pi nets: obtaining reflected functions by reflecting weight matrices