Abstract

The increasing concerns of public health and safety lead to a practical need to detect smoking behaviors (or smokers) in public places. Previous smoker detection methods often focus on cigarette detection, which overlook the interaction between the smoker and the cigarette. In light of this, this paper presents a single-image smoker detection framework via human-object interaction with lite transformer network (HOLT-Net). Specifically, a one-stage human–object interaction module is devised to identify the interaction between the smoker and the cigarette. To incorporate the global information for better feature representation, a simple yet powerful lite transformer module is leveraged, where the multi-head self-attention blocks are exploited. Beyond that, a post-refinement module is integrated for taking advantage of an additional fine-grained cigarette detector to enhance the interaction detection accuracy. It is noteworthy that we present a new benchmark dataset named SCAU-Smoker Detection (SCAU-SD), which, to the best of our knowledge, is the first benchmark dataset for the specific task of smoker detection in single images with human–object interaction annotations. Extensive experimental results demonstrate the effectiveness of our HOLT-Net framework. The code is publicly available at https://github.com/JackKoLing/HOLT-Net.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.