Abstract

In this work, a nonfiducial electrocardiogram (ECG) identification algorithm based on statistical features and random forest classifier is presented. Two feature extraction approaches are investigated: direct and band-based approaches. In the former, eleven simple statistical features are directly extracted from a single-lead ECG signal segment. In the latter, the single-lead ECG signal is first decomposed into bands, and the statistical features are extracted from each segment of a given band and concatenated to form the feature vector. Nonoverlapping segments of different lengths (i.e., 1, 3, 5, 7, 10, or 15 sec) are examined. The extracted feature vectors are applied to a random forest classifier, for the purpose of identification. This study considers 290 reference subjects from the ECG database of the Physikalisch-Technische Bundesanstalt (PTB). The proposed identification algorithm achieved an accuracy rate of 99.61% utilizing the single limb lead (I) with the band-based approach. A single chest lead (V1), augmented limb lead (aVF), and Frank’s lead (Vx) achieved an accuracy rate of 99.37%, 99.76%, and 99.76%, respectively, using the same approach.

Highlights

  • The aim of a biometric system is to uniquely identify or authenticate persons based on one or more behavioral and/or physiological characteristics, including the retina, fingerprint, or gait [1, 2]

  • While in the band-based approach, the single-lead ECG signal is first decomposed into bands, and the statistical features are extracted from each band and concatenated to form the feature vector, which is fed to the random forest classifier

  • This paper presents an ECG-based identification system that relies on statistical features and random forest classifier

Read more

Summary

Introduction

The aim of a biometric system is to uniquely identify or authenticate persons based on one or more behavioral and/or physiological characteristics, including the retina, fingerprint, or gait [1, 2]. We propose a new nonfiducial method for subject identification based on statistical features and random forest classifier. Eleven statistical features are extracted directly from the single-lead ECG signal and fed to a random forest classifier. While in the band-based approach, the single-lead ECG signal is first decomposed into bands, and the statistical features are extracted from each band and concatenated to form the feature vector, which is fed to the random forest classifier. We show by the t-distribution stochastic neighbor embedding (t-SNE) algorithm that subjects’ features based on these statistics are separable, which leads to high subject identification rate. (3) It reports high identification accuracy results for 290 (healthy and nonhealthy) subjects using features extracted from simple statistics.

Method
Results and Discussion
II III aVR aVL aVF V1 V2 V3 V4 V5 V6 Vx Vy Vz
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.