Abstract

Human pose activity recognition (HPAR) offers a wide range of applications due to the widespread use of collection devices such as smartphones and video cameras, as well as its capacity to gather human activity data. Electronic devices and applications continue to evolve, and breakthroughs in artificial intelligence (AI) have transformed the capacity to extract deeply buried information for accurate recognition and interpretation. We propose a systematic design for integrating conventional networks and constraints into the attention framework for learning long-range dependencies, thereby achieving end-to-end pose estimation with flexibility and scalability. The proposed method modifies the temporal receptive field using a multi-scale structure of dilated convolutions and can be adapted to a causal model for real-time performance. Our approach achieves state-of-the-art performance on the task of three-dimensional HPAR and outperforms previous methods while maintaining a lower complexity cost.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.