Abstract

AbstractFeature selection is a nontrivial preprocessing technique in many practical application domains. There are three key challenges with respect to real‐world data. Firstly, the dimensionality of data keeps growing and will achieve hundreds of millions. Secondly, the data has the characteristic of high‐dimensional and small‐size. Thirdly, practical applications need to process each feature in an online manner. However, most of the previous methods only pay much attention to solving the challenges of high dimensionality and online stream. To address all issues above, we propose OFSI, in this article, an nline streaming eature election based on feature nteraction method for feature selection. OFSI can effectively select the streaming features that are strongly related to each other in high‐dimensional and small‐size data, via using feature interaction. Furthermore, to address upcoming features that arrive by groups, we present a new group‐OFSI algorithm for online group feature selection. An extensive experiment using a series of benchmark data sets shows that the proposed two algorithms, OFSI and group‐OFSI, outperform six state‐of‐the‐art online streaming feature selection methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call