Abstract
Remote photoplethysmography (rPPG) is a video-based heart rate measurement technology, which is widely used in special scenes where contact equipment is difficult to measure. However, the rPPG signal is very weak, and it is easily affected by factors such as uneven environmental illumination changes and the tester’s head movement, which leads to the poor robustness of the existing methods in natural scenes. A deep learning model based on vision transformer is proposed to segment the facial skin region to generate the spatiotemporal feature map of the video sequence, inputs the feature map into the model for rPPG physiological feature extraction, and then fits the rPPG signal. The experiments verify the effectiveness of the method on mixed data sets and can ensure that the model has a high degree of signal fitting while significantly reducing the computational complexity of the transformer.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.