Abstract

Using Spatial Domain Correlation Pattern Recognition (CPR) in Internet-of-Things (IoT)-based applications often faces constraints, like inadequate computational resources and limited memory. To reduce the computation workload of inference due to large spatial-domain CPR filters and convert filter weights into hardware-friendly data-types, this paper introduces the power-of-two (Po2) and dynamic-fixed-point (DFP) quantization techniques for weight compression and the sparsity induction in filters. Weight quantization re-training (WQR), the log-polar, and the inverse log-polar geometric transformations are introduced to reduce quantization error. WQR is a method of retraining the CPR filter, which is presented to recover the accuracy loss. It forces the given quantization scheme by adding the quantization error in the training sample and then re-quantizes the filter to the desired quantization levels which reduce quantization noise. Further, Particle Swarm Optimization (PSO) is used to fine-tune parameters during WQR. Both geometric transforms are applied as pre-processing steps. The Po2 quantization scheme showed better performance close to the performance of full precision, while the DFP quantization showed further closeness to the Receiver Operator Characteristic of full precision for the same bit-length. Overall, spatial-trained filters showed a better compression ratio for Po2 quantization after retraining of the CPR filter. The direct quantization approach achieved a compression ratio of 8 at 4.37× speedup with no accuracy degradation. In contrast, quantization with a log-polar transform is accomplished at a compression ratio of 4 at 1.12× speedup, but, in this case, 16% accuracy of degradation is noticed. Inverse log-polar transform showed a compression ratio of 16 at 8.90× speedup and 6% accuracy degradation. All the mentioned accuracies are reported for a common database.

Highlights

  • The resulting properties of these quantization schemes are studied in conjunction with direct, log-polar, inverse log-polar, and filter retraining

  • Equation (33) represents the peak-signal-to-noise ratio (PSNR), while the power of noise in the denominator is defined by the Mean Square Error (MSE), which is the average of the square of the pixel-by-pixel difference between the original image and the approximated version of the image; the MSE depends on the variance of the original and quantized signals

  • We propose the Weight quantization re-training (WQR) approach and preprocessing steps, like log-map and inverse log-map, to improve the accuracy degradation through full-precision weight quantization

Read more

Summary

Introduction

The computer vision system competence has faced many challenges during their early development phases. These challenges impeded the target detection performance of artificial vision systems. CPR filters are trained and tested in the frequency domain. Spatial domain CPR filters are trained in the frequency domain, and they are later converted back to the space domain for inference. The current paper refers to this methodology as frequency-trained (FT) In addition to this approach, this paper considers complete training and inference in the space domain known as spatially-trained (ST). Inference in the spatial domain is computationally expensive as compared to the frequency domain It involves cross-correlation between the test image and the reference template. Inference can be performed on various devices, like CPU, Internet-of-Things (IoT) devices, GPU, or ASICs

Motivation and Research Challenges
Contributions
Mathematical Background and Related Work
Limitations
Log-Polar Transform
Overview
Quantization Schemes
Retraining the CPR Filter
Geometric Transform
Reducing the Standard Deviation Using Log-Polar Transform
Reducing the Standard Deviation Using Inverse Log-Polar Transform
Configurations for Weight Quantization
CPR Filter Implementations and Setting
Database
Evaluation Framework
Parameter Optimization
Rotational Analysis
Scale and Moving Light Analysis
16. Rotation test fortest
18. Scalability test for test
20. CPR for the object under different lighting
ROC Comparative Analysis
F T 4 - 0S
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.