Kernel Design Research Articles

Light guide plate (LGP) is a key component of liquid crystal display (LCD) display systems, so its quality directly affects the display effect of LCD. However, LGPs have complex background texture, low contrast, varying defect size and numerous defect types, which makes realizing efficient and accuracy-satisfactory surface defect automatic detection of LGPS still a big challenge. Therefore, combining its optical properties, dot distribution, defect imaging characteristics and detection requirements, a surface defect detection algorithm based on LGP-YOLO for practical industrial applications is proposed in this paper. To enhance the feature extraction ability of the network without dimensionality reduction, expand the effective receptive field and reduce the interference of invalid targets, we built the receptive field module (RFM) by combining the effective channel attention network (ECA-Net) and reviewing large kernel design in CNNs (RepLKNet). For the purpose of optimizing the performance of the network in downstream tasks, enhance the network's expression ability and improve the network’s ability of detecting multi-scale targets, we construct the small detection module (SDM) by combining space-to-depth non-strided convolution (SPDConv) and omini-dimensional dynamic convolution (ODConv). Finally, an LGP defect dataset is constructed using a set of images collected from industrial sites, and a multi-round experiment is carried out to test the proposed method on the LGP detect dataset. The experimental results show that the proposed LGP-YOLO network can achieve high performance, with mAP and F1-score reaching 99.08% and 97.45% respectively, and inference speed reaching 81.15 FPS. This demonstrates that LGP-YOLO can strike a good balance between detection accuracy and inference speed, capable of meeting the requirements of high-precision and high-efficiency LGP defect detection in LGP manufacturing factories.

Read full abstract

Over the past few years, 2-D convolutional neural networks (CNNs) have demonstrated their great success in a wide range of 2-D computer vision applications, such as image classification and object detection. At the same time, 3-D CNNs, as a variant of 2-D CNNs, have shown their excellent ability to analyze 3-D data, such as video and geometric data. However, the heavy algorithmic complexity of 2-D and 3-D CNNs imposes a substantial overhead over the speed of these networks, which limits their deployment in real-life applications. Although various domain-specific accelerators have been proposed to address this challenge, most of them only focus on accelerating 2-D CNNs, without considering their computational efficiency on 3-D CNNs. In this article, we propose a unified hardware architecture to accelerate both 2-D and 3-D CNNs with high hardware efficiency. Our experiments demonstrate that the proposed accelerator can achieve up to 92.4% and 85.2% multiply-accumulate efficiency on 2-D and 3-D CNNs, respectively. To improve the hardware performance, we propose a hardware-friendly quantization approach called static block floating point (BFP), which eliminates the frequent representation conversions required in traditional dynamic BFP arithmetic. Comparing with the integer linear quantization using zero-point, the static BFP quantization can decrease the logic resource consumption of the convolutional kernel design by nearly 50% on a field-programmable gate array (FPGA). Without time-consuming retraining, the proposed static BFP quantization is able to quantize the precision to 8-bit mantissa with negligible accuracy loss. As different CNNs on our reconfigurable system require different hardware and software parameters to achieve optimal hardware performance and accuracy, we also propose an automatic tool for parameter optimization. Based on our hardware design and optimization, we demonstrate that the proposed accelerator can achieve 3.8-5.6 times higher energy efficiency than graphics processing unit (GPU) implementation. Comparing with the state-of-the-art FPGA-based accelerators, our design achieves higher generality and up to 1.4-2.2 times higher resource efficiency on both 2-D and 3-D CNNs.

Read full abstract

Kernel Design Research Articles

Related Topics

Articles published on Kernel Design

HawkEye Conv-Driven YOLOv10 with Advanced Feature Pyramid Networks for Small Object Detection in UAV Imagery

Compatible and Applicable Platform design of Secure Real-Time Operating System Kernel

NPFormer: Interpretable rotating machinery fault diagnosis architecture design under heavy noise operating scenarios

Asymmetric convolutional modulation network for efficient image super-resolution

Adaptive Joint Carrier and DOA Estimations of FHSS Signals Based on Knowledge-Enhanced Compressed Measurements and Deep Learning.

Fault diagnosis of rolling bearings under varying speeds based on gray level co-occurrence matrix and DCCNN

LSKANet: Long Strip Kernel Attention Network for Robotic Surgical Scene Segmentation.

A distance-based kernel for classification via Support Vector Machines.

Incremental learning-based optimal design of BFN kernel for online spacecraft disturbance rejection control

Spherical harmonic coefficients of isotropic polynomial functions with applications to gravity field modeling

LGP-YOLO: an efficient convolutional neural network for surface defect detection of light guide plate

On kernel design for regularized non-causal system identification

High-Performance Acceleration of 2-D and 3-D CNNs on FPGAs Using Static Block Floating Point.

Physics-constrained Gaussian process model for prediction of hydrodynamic interactions between wave energy converters in an array

A note on microlocal kernel design for some slow–fast stochastic differential equations with critical transitions and application to EEG signals

A Novel Variable Convolution Kernel Design According to Time-frequency Resolution Altering in Bearing Fault Diagnosis

Analysis of Kernel Matrices via the von Neumann Entropy and Its Relation to RVM Performances.

Error Bounds for Kernel-Based Linear System Identification With Unknown Hyperparameters

Is Formal Verification of seL4 Adequate to Address the Key Security Challenges of Kernel Design?

Knowledge-Enhanced Compressed Measurements for Detection of Frequency-Hopping Spread Spectrum Signals Based on Task-Specific Information and Deep Neural Networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Kernel Design Research Articles

Related Topics

Articles published on Kernel Design

HawkEye Conv-Driven YOLOv10 with Advanced Feature Pyramid Networks for Small Object Detection in UAV Imagery

Compatible and Applicable Platform design of Secure Real-Time Operating System Kernel

NPFormer: Interpretable rotating machinery fault diagnosis architecture design under heavy noise operating scenarios

Asymmetric convolutional modulation network for efficient image super-resolution

Adaptive Joint Carrier and DOA Estimations of FHSS Signals Based on Knowledge-Enhanced Compressed Measurements and Deep Learning.

Fault diagnosis of rolling bearings under varying speeds based on gray level co-occurrence matrix and DCCNN

LSKANet: Long Strip Kernel Attention Network for Robotic Surgical Scene Segmentation.

A distance-based kernel for classification via Support Vector Machines.

Incremental learning-based optimal design of BFN kernel for online spacecraft disturbance rejection control

Spherical harmonic coefficients of isotropic polynomial functions with applications to gravity field modeling

LGP-YOLO: an efficient convolutional neural network for surface defect detection of light guide plate

On kernel design for regularized non-causal system identification

High-Performance Acceleration of 2-D and 3-D CNNs on FPGAs Using Static Block Floating Point.

Physics-constrained Gaussian process model for prediction of hydrodynamic interactions between wave energy converters in an array

A note on microlocal kernel design for some slow–fast stochastic differential equations with critical transitions and application to EEG signals

A Novel Variable Convolution Kernel Design According to Time-frequency Resolution Altering in Bearing Fault Diagnosis

Analysis of Kernel Matrices via the von Neumann Entropy and Its Relation to RVM Performances.

Error Bounds for Kernel-Based Linear System Identification With Unknown Hyperparameters

Is Formal Verification of seL4 Adequate to Address the Key Security Challenges of Kernel Design?

Knowledge-Enhanced Compressed Measurements for Detection of Frequency-Hopping Spread Spectrum Signals Based on Task-Specific Information and Deep Neural Networks