Abstract

This paper proposes a new neural network structure for image processing whose convolutional layers, instead of using kernels with fixed coefficients, use space-variant coefficients. The adoption of this strategy allows the system to adapt its behavior according to the spatial characteristics of the input data. This type of layers performs, as we demonstrate, a non-linear transfer function. The features generated by these layers, compared to the ones generated by canonical CNN layers, are more complex and more suitable to fit to the local characteristics of the images. Networks composed by these non-linear layers offer performance comparable with or superior to the ones which use canonical Convolutional Networks, using fewer layers and a significantly lower number of features. Several applications of these newly conceived networks to classical image-processing problems are analyzed. In particular, we consider: Single-Image Super-Resolution (SISR), Edge-Preserving Smoothing (EPS), Noise Removal (NR), and JPEG artifacts removal (JAR).

Highlights

  • In digital images, pixels which are spatially close one to each other have highly correlated values

  • In the system we propose in this paper, 1. the linear convolution process used in standard Convolutional Neural Networks (CNNs) is replaced by a Non-Linear Convolution (NLC) that more thoroughly exploits the relationships that exist among adjacent image pixels; 2. the input-dependent weights generated inside the NLC are normalized to preserve the dynamic range of the data while the input image is transformed into a set of features, processed in the hidden layers, and projected into the desired output image; 3. the NLC modules are trained end-to-end by a standard back-propagation algorithm

  • Different approaches for the implementation of CNNs in field programmable gate arrays (FPGAs) have been proposed in the literature, aiming to exploit the different levels of parallelisms: the feature-map-level, operator-level, task-level and layer-level [37,38,39,40], among others, and the same considerations should be taken into account in the context of Non-Linear Convolution Network (NLCN) to have a fully and optimized implementation in FPGA technology

Read more

Summary

Introduction

Pixels which are spatially close one to each other have highly correlated values This well-known fact is exploited by image-processing algorithms for image enhancement, noise reduction, interpolation, etc., and to effectively perform image compression and image analysis or understanding tasks. It has been demonstrated [1] that in natural images such an inter-dependency can be better exploited through non-linear functions, rather than through simple convolution-based linear operators. The NLC process that will be described in the following permits continuous and dynamic modification of the weights of the convolution, making the operator spatially variant and dependent on the local characteristics of the input image.

Non-Linear Convolution Methods in CNN
One-Channel Case
Multi-Input-Output Channel Case
Non-Linear Layer Discussion
Network Architectures
Parallel Architecture to Increase Receptive Field
Performance Comparison
NLCN Performance Showcase for Noise Removal
NLCN Performance Showcase for JPEG Artifacts Removal
Perspectives for an FPGA Realization of the NLCN
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.