Abstract

The success of biological signal pattern recognition depends crucially on the selection of relevant features. Across signal and imaging modalities, a large number of features have been proposed, leading to feature redundancy and the need for optimal feature set identification. A further complication is that, due to the inherent biological variability, even the same classification problem on different datasets can display variations in the respective optimal sets, casting doubts on the generalizability of relevant features. Here, we approach this problem by leveraging topological tools to create charts of features spaces. These charts highlight feature sub-groups that encode similar information (and their respective similarities) allowing for a principled and interpretable choice of features for classification and analysis. Using multiple electromyographic (EMG) datasets as a case study, we use this feature chart to identify functional groups among 58 state-of-the-art EMG features, and to show that they generalize across three different forearm EMG datasets obtained from able-bodied subjects during hand and finger contractions. We find that these groups describe meaningful non-redundant information, succinctly recapitulating information about different regions of feature space. We then recommend representative features from each group based on maximum class separability, robustness and minimum complexity.

Highlights

  • Biological pattern recognition systems are finding a growing number of applications, such as computer-aided diagnosis for breast cancer [1], prosthesis control [2] and brain–computer interfaces [3]

  • The topological network computed using MI 1⁄4 3 intervals overlapped with MO 1⁄4 50% from the 28 principal component (PC) scores for the first EMG dataset is shown in figure 1

  • The resulting topological network consists of 10 nodes and has a main structure shaped like the letter Y composed of three arms connected to a central core, along with two additional components disconnected from the main structure

Read more

Summary

Introduction

Biological pattern recognition systems are finding a growing number of applications, such as computer-aided diagnosis for breast cancer [1], prosthesis control [2] and brain–computer interfaces [3]. Great progress has been made using deep learning techniques when large amounts of labelled data are available [4]. In applications for which limited data are available and full deep learning is not yet viable, it is crucial to be able to identify optimal feature sets for classification and analytical purposes [5]. It is known that the feature sets yielding the best performances can change between very similar classification problems or datasets [6]. One should exercise caution when interpreting the optimal feature set for practical use in biological pattern recognition

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.