Supervised Feature Selection Technique Research Articles

Machine Learning (ML) techniques are becoming an invaluable support for network intrusion detection, especially in revealing anomalous flows, which often hide cyber-threats. Typically, ML algorithms are exploited to classify/recognize data traffic on the basis of statistical features such as inter-arrival times, packets length distribution, mean number of flows, etc. Dealing with the vast diversity and number of features that typically characterize data traffic is a hard problem. This results in the following issues: (i) the presence of so many features leads to lengthy training processes (particularly when features are highly correlated), while prediction accuracy does not proportionally improve; (ii) some of the features may introduce bias during the classification process, particularly those that have scarce relation with the data traffic to be classified. To this end, by reducing the feature space and retaining only the most significant features, Feature Selection (FS) becomes a crucial pre-processing step in network management and, specifically, for the purposes of network intrusion detection. In this review paper, we complement other surveys in multiple ways: (i) evaluating more recent datasets (updated w.r.t. obsolete KDD 99) by means of a designed-from-scratch Python-based procedure; (ii) providing a synopsis of most credited FS approaches in the field of intrusion detection, including Multi-Objective Evolutionary techniques; (iii) assessing various experimental analyses such as feature correlation, time complexity, and performance. Our comparisons offer useful guidelines to network/security managers who are considering the incorporation of ML concepts into network intrusion detection, where trade-offs between performance and resource consumption are crucial.

In this paper, we investigate the potential of unsupervised feature selection techniques for classification tasks, where only sparse training data are available. This is motivated by the fact that unsupervised feature selection techniques combine the advantages of standard dimensionality reduction techniques (which only rely on the given feature vectors and not on the corresponding labels) and supervised feature selection techniques (which retain a subset of the original set of features). Thus, feature selection becomes independent of the given classification task and, consequently, a subset of generally versatile features is retained. We present different techniques relying on the topology of the given sparse training data. Thereby, the topology is described with an ultrametricity index. For the latter, we take into account the Murtagh Ultrametricity Index (MUI) which is defined on the basis of triangles within the given data and the Topological Ultrametricity Index (TUI) which is defined on the basis of a specific graph structure. In a case study addressing the classification of high-dimensional hyperspectral data based on sparse training data, we demonstrate the performance of the proposed unsupervised feature selection techniques in comparison to standard dimensionality reduction and supervised feature selection techniques on four commonly used benchmark datasets. The achieved classification results reveal that involving supervised feature selection techniques leads to similar classification results as involving unsupervised feature selection techniques, while the latter perform feature selection independently from the given classification task and thus deliver generally versatile features.

Supervised Feature Selection Technique Research Articles

Related Topics

Articles published on Supervised Feature Selection Technique

Explainable machine learning models for Medicare fraud detection

Identifying Acoustic Features to Distinguish Highly and Moderately Altered Soundscapes in Colombia

Designing a supervised feature selection technique for mixed attribute data analysis

Machine Learning Algorithms in Corroboration with Isotope and Elemental Profile—An Efficient Tool for Honey Geographical Origin Assessment

Gene Selection in Binary Classification Problems Within Functional Genomics Experiments via Robust Fisher Score

Supervised feature selection techniques in network intrusion detection: A critical review

A rough-GA based optimal feature selection in attribute profiles for classification of hyperspectral imagery

Semantic textual similarity between sentences using bilingual word semantics

Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data

Analysis of Supervised Feature Selection Techniques on Animal Husbandry Dataset

A hybrid isotonic separation training algorithm with correlation-based isotonic feature selection for binary classification

RETRACTED ARTICLE: Tolerance rough set firefly-based quick reduct

A novel breast tumor classification algorithm using neutrosophic score features

Self-adaptive differential evolution for feature selection in hyperspectral image data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Supervised Feature Selection Technique Research Articles

Related Topics

Articles published on Supervised Feature Selection Technique

Explainable machine learning models for Medicare fraud detection

Identifying Acoustic Features to Distinguish Highly and Moderately Altered Soundscapes in Colombia

Designing a supervised feature selection technique for mixed attribute data analysis

Machine Learning Algorithms in Corroboration with Isotope and Elemental Profile—An Efficient Tool for Honey Geographical Origin Assessment

Gene Selection in Binary Classification Problems Within Functional Genomics Experiments via Robust Fisher Score

Supervised feature selection techniques in network intrusion detection: A critical review

A rough-GA based optimal feature selection in attribute profiles for classification of hyperspectral imagery

Semantic textual similarity between sentences using bilingual word semantics

Unsupervised Feature Selection Based on Ultrametricity and Sparse Training Data: A Case Study for the Classification of High-Dimensional Hyperspectral Data

Analysis of Supervised Feature Selection Techniques on Animal Husbandry Dataset

A hybrid isotonic separation training algorithm with correlation-based isotonic feature selection for binary classification

RETRACTED ARTICLE: Tolerance rough set firefly-based quick reduct

A novel breast tumor classification algorithm using neutrosophic score features

Self-adaptive differential evolution for feature selection in hyperspectral image data