Cardio-respiratory signal extraction from video camera data for continuous non-contact vital sign monitoring using deep learning

Sitthichok Chaichulee,Mauricio Villarroel,João Jorge,Carlos Arteta,Andrew Zisserman,Kenny Mccormick,Lionel Tarassenko

doi:10.1088/1361-6579/ab525c

Sitthichok Chaichulee, Mauricio Villarroel + Show 5 more

Open Access

https://doi.org/10.1088/1361-6579/ab525c

Copy DOI

Abstract

Non-contact vital sign monitoring enables the estimation of vital signs, such as heart rate, respiratory rate and oxygen saturation (SpO2), by measuring subtle color changes on the skin surface using a video camera. For patients in a hospital ward, the main challenges in the development of continuous and robust non-contact monitoring techniques are the identification of time periods and the segmentation of skin regions of interest (ROIs) from which vital signs can be estimated. We propose a deep learning framework to tackle these challenges. Approach: This paper presents two convolutional neural network (CNN) models. The first network was designed for detecting the presence of a patient and segmenting the patient’s skin area. The second network combined the output from the first network with optical flow for identifying time periods of clinical intervention so that these periods can be excluded from the estimation of vital signs. Both networks were trained using video recordings from a clinical study involving 15 pre-term infants conducted in the high dependency area of the neonatal intensive care unit (NICU) of the John Radcliffe Hospital in Oxford, UK. Main results: Our proposed methods achieved an accuracy of 98.8% for patient detection, a mean intersection-over-union (IOU) score of 88.6% for skin segmentation and an accuracy of 94.5% for clinical intervention detection using two-fold cross validation. Our deep learning models produced accurate results and were robust to different skin tones, changes in light conditions, pose variations and different clinical interventions by medical staff and family visitors. Significance: Our approach allows cardio-respiratory signals to be continuously derived from the patient’s skin during which the patient is present and no clinical intervention is undertaken.

Highlights

Non-contact vital sign monitoring using a video camera enables the measurement of vital signs to be performed by measuring subtle color changes on the surface of the skin from a distance, without any sensors attached to the patient
We show that photoplethysmographic imaging (PPGi) and respiratory signals can be derived using our deep learning framework
A baseline experiment for clinical intervention detection was implemented using the two-stream deep learning architecture for action recognition proposed by Simonyan and Zisserman (2014) in which the outputs of the two network streams were combined using the Support Vector Machine (SVM) technique

Summary

Introduction an us cri

Non-contact vital sign monitoring using a video camera enables the measurement of vital signs to be performed by measuring subtle color changes on the surface of the skin from a distance, without any sensors attached to the patient. Shadows are cast on the infant when clinical staff walk between these light sources and the incubator These scenarios present challenges to the development of algorithms for the detection of appropriate time periods and ROIs in which vital signs could be estimated. The proposed framework consists of two deep learning networks: the patient detection and skin segmentation network; and the intervention detection network. These networks operate in sequence to identify appropriate time periods and ROIs from which vital signs can be estimated. Vital signs could be estimated from ROIs on the patient’s skin only when the patient is present and no clinical intervention is being undertaken.

Clinical study an

Patient detection and skin segmentation network pte

Network architecture ce

Training data ce

Network training us cri

Intervention detection network us cri

Network training an

Patient detection and skin segmentation

Intervention detection an

Patient detection pte

Skin segmentation

Intervention detection ce

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Physiological Measurement	Publication Date: Nov 1, 2019
Citations: 39	License type: cc-by

R Discovery Prime

R Discovery Prime

Cardio-respiratory signal extraction from video camera data for continuous non-contact vital sign monitoring using deep learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physiological Measurement

Lead the way for us

Similar Papers

Continuous non-contact vital sign monitoring in neonatal intensive care unit.
Mauricio Villarroel ... Peter Watkinson
Healthcare Technology Letters | VOL. 1
Mauricio Villarroel, et. al.Mauricio Villarroel ... Peter Watkinson
01 Sep 2014
Healthcare Technology Letters | VOL. 1

Non-contact physiological monitoring of preterm infants in the Neonatal Intensive Care Unit
Mauricio Villarroel ... Sara Davis
npj Digital Medicine | VOL. 2
Mauricio Villarroel, et. al.Mauricio Villarroel ... Sara Davis
01 Dec 2019
npj Digital Medicine | VOL. 2

Localised photoplethysmography imaging for heart rate estimation of pre-term infants in the clinic
Gabrielle Green ... Kenny Mccormick
-
Gabrielle Green, et. al.Gabrielle Green ... Kenny Mccormick
20 Feb 2018
20 Feb 2018

Multi-Task Convolutional Neural Network for Patient Detection and Skin Segmentation in Continuous Non-Contact Vital Sign Monitoring
Sitthichok Chaichulee ... Mauricio Villarroel
-
Sitthichok Chaichulee, et. al.Sitthichok Chaichulee ... Mauricio Villarroel
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cardio-respiratory signal extraction from video camera data for continuous non-contact vital sign monitoring using deep learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Physiological Measurement