Size Of Feature Space Research Articles

Thyroid cancer is a life-threatening condition that arises from the cells of the thyroid gland located in the neck’s frontal region just below the adam’s apple. While it is not as prevalent as other types of cancer, it ranks prominently among the commonly observed cancers affecting the endocrine system. Machine learning has emerged as a valuable medical diagnostics tool specifically for detecting thyroid abnormalities. Feature selection is of vital importance in the field of machine learning as it serves to decrease the data dimensionality and concentrate on the most pertinent features. This process improves model performance, reduces training time, and enhances interpretability. This study examined binary variants of FOX-optimization algorithms for feature selection. The study employed eight transfer functions (S and V shape) to convert the FOX-optimization algorithms into their binary versions. The vision transformer-based pre-trained models (DeiT and Swin Transformer) are used for feature extraction. The extracted features are transformed using locally linear embedding, and binary FOX-optimization algorithms are applied for feature selection in conjunction with the Naïve Bayes classifier. The study utilized two datasets (ultrasound and histopathological) related to thyroid cancer images. The benchmarking is performed using the half-quadratic theory-based ensemble ranking technique. Two TOPSIS-based methods (H-TOPSIS and A-TOPSIS) are employed for initial model ranking, followed by an ensemble technique for final ranking. The problem is treated as multi-objective optimization task with accuracy, F2-score, AUC-ROC and feature space size as optimization goals. The binary FOX-optimization algorithm based on the V_1 transfer function achieved superior performance compared to other variants using both datasets as well as feature extraction techniques. The proposed framework comprised a Swin transformer to extract features, a Fox optimization algorithm with a V1 transfer function for feature selection, and a Naïve Bayes classifier and obtained the best performance for both datasets. The best model achieved an accuracy of 94.75%, an AUC-ROC value of 0.9848, an F2-Score of 0.9365, an inference time of 0.0353 seconds, and selected 5 features for the ultrasound dataset. For the histopathological dataset, the diagnosis model achieved an overall accuracy of 89.71%, an AUC-ROC score of 0.9329, an F2-Score of 0.8760, an inference time of 0.05141 seconds, and selected 12 features. The proposed model achieved results comparable to existing research with small features space.

Various methods for ensembles selection and classifier combination have been designed to optimize the performance of ensembles of classifiers. However, use of large number of features in training data can affect the classification performance of machine learning algorithms. The objective of this paper is to represent a novel feature elimination (FE) based ensembles learning method which is an extension to an existing machine learning environment. Here the standard 12 lead ECG signal recordings data have been used in order to diagnose arrhythmia by classifying it into normal and abnormal subjects. The advantage of the proposed approach is that it reduces the size of feature space by way of using various feature elimination methods. The decisions obtained from these methods have been coalesced to form a fused data. Thus the idea behind this work is to discover a reduced feature space so that a classifier built using this tiny data set would perform no worse than a classifier built from the original data set. Random subspace based ensembles classifier is used with PART tree as base classifier. The proposed approach has been implemented and evaluated on the UCI ECG signal data. Here, the classification performance has been evaluated using measures such as mean absolute error, root mean squared error, relative absolute error, F-measure, classification accuracy, receiver operating characteristics and area under curve. In this way, the proposed novel approach has provided an attractive performance in terms of overall classification accuracy of 91.11 % on unseen test data set. From this work, it is shown that this approach performs well on the ensembles size of 15 and 20.

Size Of Feature Space Research Articles

Articles published on Size Of Feature Space

Comparative performance analysis of binary variants of FOX optimization algorithm with half-quadratic ensemble ranking method for thyroid cancer detection

Conditional Random Fields for Multiview Sequential Data Modeling.

High accuracy multilayer autoencoder trained classification method for diagnosis of Parkinson’s disease using vocal signals

Modified Genetic Algorithm for Feature Selection and Hyper Parameter Optimization: Case of XGBoost in Spam Prediction

Deep mining of open source software bug repositories

Machine learning topological phases in real space

A Compressive Sensing Model for Speeding Up Text Classification.

Machine Learning Approach for Answer Detection in Discussion Forums: An Application of Big Data Analytics

Analysis and prediction in sparse and high dimensional text data: The case of Dow Jones stock market

Locality-adapted kernel densities of term co-occurrences for location prediction of tweets

Extensive Experimental Evaluation of Self-Organizing Maps for Automatic Classification of a Multi-Class Multi-Label Corpus

Incorporating known malware signatures to classify new malware variants in network traffic

Impact of feature selection on the accuracy and spatial uncertainty of per-field crop classification using Support Vector Machines

Feature elimination based random subspace ensembles learning for ECG arrhythmia diagnosis

Prediction of Thermophilic Protein with Pseudo Amino Acid Composition: An Approach from Combined Feature Selection and Reduction

Gas chimney detection based on improving the performance of combined multilayer perceptron and support vector classifier

Language morphology offset: Text classification on a Croatian–English parallel corpus

Genetic engineering of hierarchical fuzzy regional representations for handwritten character recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Size Of Feature Space Research Articles

Articles published on Size Of Feature Space

Comparative performance analysis of binary variants of FOX optimization algorithm with half-quadratic ensemble ranking method for thyroid cancer detection

Conditional Random Fields for Multiview Sequential Data Modeling.

High accuracy multilayer autoencoder trained classification method for diagnosis of Parkinson’s disease using vocal signals

Modified Genetic Algorithm for Feature Selection and Hyper Parameter Optimization: Case of XGBoost in Spam Prediction

Deep mining of open source software bug repositories

Machine learning topological phases in real space

A Compressive Sensing Model for Speeding Up Text Classification.

Machine Learning Approach for Answer Detection in Discussion Forums: An Application of Big Data Analytics

Analysis and prediction in sparse and high dimensional text data: The case of Dow Jones stock market

Locality-adapted kernel densities of term co-occurrences for location prediction of tweets

Extensive Experimental Evaluation of Self-Organizing Maps for Automatic Classification of a Multi-Class Multi-Label Corpus

Incorporating known malware signatures to classify new malware variants in network traffic

Impact of feature selection on the accuracy and spatial uncertainty of per-field crop classification using Support Vector Machines

Feature elimination based random subspace ensembles learning for ECG arrhythmia diagnosis

Prediction of Thermophilic Protein with Pseudo Amino Acid Composition: An Approach from Combined Feature Selection and Reduction

Gas chimney detection based on improving the performance of combined multilayer perceptron and support vector classifier

Language morphology offset: Text classification on a Croatian–English parallel corpus

Genetic engineering of hierarchical fuzzy regional representations for handwritten character recognition