Abstract

Driver distraction recognition has been studied by many researchers. However, most studies have failed to balance the efficiency and accuracy of models. In this study, a lightweight network called CaTNet is proposed. The CaTNet is a simplified framework based on the existing model ConvNeXt, which prunes redundant feature layers. And a whole new module CaT containing self-attention is introduced in tandem, both of which are combined to enhance the feature characterization. It captures long-range dependencies and retains the local inductive bias provided by the convolutional neural network (CNN). The proposed method is verified on the dataset of American University in Cairo (AUC) and the State Farm Distracted Driver Detection (SFD3) Dataset. The CaTNet achieves 94.82% and 99.91% Top-1 accuracy while the number of model parameters is only 2.832M, with 12.44 frames per second (FPS) running on Jetson Nano. These results are superior to other existing models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call