Abstract

With the increasing aging of the society, a series of consequences caused by falls of the elderly have become a serious medical problem and a real social problem. Aiming at the fall behavior of the elderly, the research and development of human fall recognition technology has practical application value. This paper proposes a method to recognize human targets in indoor images using target detection technology, and then use the trained neural network based on Self-attention technique named Vision Transformer (ViT) to recognize the falling posture of the elderly. Firstly, the target detection model is optimized and trained for indoor scene, so that the model can accurately detect human targets in indoor scene; Next, the indoor scene human target image is preprocessed, and the indoor human pose image data set is constructed; Then, the data set is used to train the ViT depth network, and enable it to correctly classify five postures (standing, sitting, lying, bending and crawling). The experimental results show that ViT network achieves high accuracy and excellent generalization capacity for the above 5 human posture and can realize the recognition of elderly falls.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call