Abstract

The technology developments in various domains generate the large amount data with millions of samples/instances and features. Some of the data are from many areas such as bio-informatics, text mining, and microarray data, which are commonly represented in high-dimensional feature vector, and prediction process is difficult task in this kind of data in field of pattern recognition, bioinformatics, statistical analysis, and machine learning. High dimensionality data increases the computational time as well as the space complexity while processing data. In general, most of the pattern recognition and machine learning techniques are available for processing the low-dimensional data; this will not solve the issues of high-dimensional data. To solve this issue, feature selection (FS) plays a vital role which is modeled to select the feature set from the greater number of features from the high-dimensional data; thereby, it builds the simpler model and gives the higher classification accuracy. Also, the FS process focuses on reducing and eliminating the dimensionality nature of the data by removing the irrelevant and redundant data and helps to improve the predictive modeling with the better visualization and understanding capabilities of the data. By considering the issues mentioned above, this chapter aims to provide the detailed introduction on FS techniques and gives the state-of-the-art methods, concerning machine learning and deep learning methods. At last, this chapter provides the various application domains which being in need of FS techniques, and also, further, this chapter also gives the directions to deal with the feature reduction problems occurring in the large voluminous dataset.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call