Abstract

Individual feature selection algorithms, used for processing high-dimensional multi-source heterogeneous data may lead to weak predictions. The traditional single method process may not ensure the selection of relevant features. The selections of features are susceptible to the changes in input data, and thus fail to perform consistently. These challenges can be overcome by having a robust feature selection algorithm that generates a subset of original features and evaluates the candidate set to check for its relevance. Also, it determines the feasibility of the selected subset of features. The fundamental tasks of selecting feature subset minimize the complexity of the model and help to facilitate the further processing of the model. The limitations of using single feature selection technique can be reduced by combining multiple techniques to generate the effective features. There is a need to design efficient approaches and technique for estimating the feature relevance. This ensemble approach will help to include diversity at input data level, as well as the computational technique. The proposed method—Ensemble Bootstrap Genetic Algorithm (EnBGA)—generates the effective feature subset for the multi-source heterogeneous data. Various univariate and multivariate base selectors are combined together to ensure the robustness and stability of the algorithm. In this pandemic of COVID-19, it’s observed that patients already diagnosed with diseases such as diabetes had an increased mortality rate. The proposed method performs feature analysis for such data, where the Genetic Algorithm searches the feature subset and extracts the most relevant features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.