Abstract

Traffic accidents caused by distracted driving have become a threat to people’s lives and properties, so it is necessary to recognize driver actions for early warning effectively. Since it is sometimes impossible to distinguish similar activities only by global driving image features, we explicitly extract the action-related keypoint features and propose a keypoint-enhanced model for classification. Specifically, we construct an effective frequency channel attention module to generate discriminative global representations. Considering the diversity of keypoint information, we design an adaptive weighted residual bottleneck to make the model weights of the input keypoint features dynamic. Furthermore, we propose a keypoint-guided conditional computation module. Under the guidance of keypoint features, the expert weights generated by conditional computation enable the model to adapt to different categories of driving images. Essentially, the model generates keypoint-enhanced attention to scale the classification feature channels. We also propose a model backbone training strategy combining self-supervised and supervised contrastive learning so that the model can achieve better results without large-scale labeled driver behavior data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.