Semi-supervised Feature Selection Research Articles

With the development of smart power grids, communication network technology and sensor technology, there has been an exponential growth in complex electricity load data. Irregular electricity load fluctuations caused by the weather and holiday factors disrupt the daily operation of the power companies. To deal with these challenges, this paper investigates a day-ahead electricity peak load interval forecasting problem. It transforms the conventional continuous forecasting problem into a novel interval forecasting problem, and then further converts the interval forecasting problem into the classification forecasting problem. In addition, an indicator system influencing the electricity load is established from three dimensions, namely the load series, calendar data, and weather data. A semi-supervised feature selection algorithm is proposed to address an electricity load classification forecasting issue based on the group method of data handling (GMDH) technology. The proposed algorithm consists of three main stages: (1) training the basic classifier; (2) selectively marking the most suitable samples from the unclassified label data, and adding them to an initial training set; and (3) training the classification models on the final training set and classifying the test samples. An empirical analysis of electricity load dataset from four Chinese cities is conducted. Results show that the proposed model can address the electricity load classification forecasting problem more efficiently and effectively than the FW-Semi FS (forward semi-supervised feature selection) and GMDH-U (GMDH-based semi-supervised feature selection for customer classification) models.

Read full abstract

Quantitative structure-activity relationship (QSAR) is an effective computational technique for drug design that relates the chemical structures of compounds to their biological activities. Feature selection is an important step in QSAR based drug design to select the most relevant descriptors. One of the most popular feature selection methods for classification problems is Fisher score which aim is to minimize the within-class distance and maximize the between-class distance. In this study, the properties of Fisher criterion were extended for QSAR models to define the new distance metrics based on the continuous activity values of compounds with known activities. Then, a semi-supervised feature selection method was proposed based on the combination of Fisher and Laplacian criteria which exploits both compounds with known and unknown activities to select the relevant descriptors. To demonstrate the efficiency of the proposed semi-supervised feature selection method in selecting the relevant descriptors, we applied the method and other feature selection methods on three QSAR data sets such as serine/threonine-protein kinase PLK3 inhibitors, ROCK inhibitors and phenol compounds. The results demonstrated that the QSAR models built on the selected descriptors by the proposed semi-supervised method have better performance than other models. This indicates the efficiency of the proposed method in selecting the relevant descriptors using the compounds with known and unknown activities. The results of this study showed that the compounds with known and unknown activities can be helpful to improve the performance of the combined Fisher and Laplacian based feature selection methods.

Read full abstract

Semi-supervised Feature Selection Research Articles

Related Topics

Articles published on Semi-supervised Feature Selection

Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval

Rough set based semi-supervised feature selection via ensemble selector

Semi-supervised feature selection analysis with structured multi-view sparse regularization

Semi-Supervised Feature Selection via Insensitive Sparse Regression with Application to Video Semantic Recognition

Semi-supervised sparse feature selection via graph Laplacian based scatter matrix for regression problems

Discriminative Semi-Supervised Feature Selection via Rescaled Least Squares Regression-Supplement

Feature selection in machine learning: A new perspective

Retail business analytics: Customer visit segmentation using market basket data

GMDH-Based Semi-Supervised Feature Selection for Electricity Load Classification Forecasting

A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities.

Semi-supervised adaptive feature analysis and its application for multimedia understanding

Simple strategies for semi-supervised feature selection

GMDH-based semi-supervised feature selection for customer classification

Constraint score for semi-supervised feature selection in ligand-and receptor-based QSAR on serine/threonine-protein kinase PLK3 inhibitors

Multimedia annotation via semi-supervised shared-subspace feature selection

Semi-supervised feature selection with sparse representation for hyperspectral image classification

Semi-supervised feature selection with sparse representation for hyperspectral image classification

A Systematic Semi-Supervised Self-adaptable Fault Diagnostics approach in an evolving environment

A Survey on semi-supervised feature selection methods

Semi-supervised feature selection with exploiting shared information among multiple tasks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semi-supervised Feature Selection Research Articles

Related Topics

Articles published on Semi-supervised Feature Selection

Adaptive Semi-Supervised Feature Selection for Cross-Modal Retrieval

Rough set based semi-supervised feature selection via ensemble selector

Semi-supervised feature selection analysis with structured multi-view sparse regularization

Semi-Supervised Feature Selection via Insensitive Sparse Regression with Application to Video Semantic Recognition

Semi-supervised sparse feature selection via graph Laplacian based scatter matrix for regression problems

Discriminative Semi-Supervised Feature Selection via Rescaled Least Squares Regression-Supplement

Feature selection in machine learning: A new perspective

Retail business analytics: Customer visit segmentation using market basket data

GMDH-Based Semi-Supervised Feature Selection for Electricity Load Classification Forecasting

A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities.

Semi-supervised adaptive feature analysis and its application for multimedia understanding

Simple strategies for semi-supervised feature selection

GMDH-based semi-supervised feature selection for customer classification

Constraint score for semi-supervised feature selection in ligand-and receptor-based QSAR on serine/threonine-protein kinase PLK3 inhibitors

Multimedia annotation via semi-supervised shared-subspace feature selection

Semi-supervised feature selection with sparse representation for hyperspectral image classification

Semi-supervised feature selection with sparse representation for hyperspectral image classification

A Systematic Semi-Supervised Self-adaptable Fault Diagnostics approach in an evolving environment

A Survey on semi-supervised feature selection methods

Semi-supervised feature selection with exploiting shared information among multiple tasks