Unsupervised Feature Transformation Research Articles

Extreme learning machine (ELM) has been applied in a wide range of classification and regression problems due to its high accuracy and efficiency. However, ELM can only deal with cases where training and testing data are from identical distribution, while in real world situations, this assumption is often violated. As a result, ELM performs poorly in domain adaptation problems, in which the training data (source domain) and testing data (target domain) are differently distributed but somehow related. In this paper, an ELM-based space learning algorithm, domain space transfer ELM (DST-ELM), is developed to deal with unsupervised domain adaptation problems. To be specific, through DST-ELM, the source and target data are reconstructed in a domain invariant space with target data labels unavailable. Two goals are achieved simultaneously. One is that, the target data are input into an ELM-based feature space learning network, and the output is supposed to approximate the input such that the target domain structural knowledge and the intrinsic discriminative information can be preserved as much as possible. The other one is that, the source data are projected into the same space as the target data and the distribution distance between the two domains is minimized in the space. This unsupervised feature transformation network is followed by an adaptive ELM classifier which is trained from the transferred labeled source samples, and is used for target data label prediction. Moreover, the ELMs in the proposed method, including both the space learning ELM and the classifier, require just a small number of hidden nodes, thus maintaining low computation complexity. Extensive experiments on real-world image and text datasets are conducted and verify that our approach outperforms several existing domain adaptation methods in terms of accuracy while maintaining high efficiency.

Datasets with heterogeneous features can affect feature selection results that are not appropriate because it is difficult to evaluate heterogeneous features concurrently. Feature transformation (FT) is another way to handle heterogeneous features subset selection. The results of transformation from non-numerical into numerical features may produce redundancy to the original numerical features. In this paper, we propose a method to select feature subset based on mutual information (MI) for classifying heterogeneous features. We use unsupervised feature transformation (UFT) methods and joint mutual information maximation (JMIM) methods. UFT methods is used to transform non-numerical features into numerical features. JMIM methods is used to select feature subset with a consideration of the class label. The transformed and the original features are combined entirely, then determine features subset by using JMIM methods, and classify them using support vector machine (SVM) algorithm. The classification accuracy are measured for any number of selected feature subset and compared between UFT-JMIM methods and Dummy-JMIM methods. The average classification accuracy for all experiments in this study that can be achieved by UFT-JMIM methods is about 84.47% and Dummy-JMIM methods is about 84.24%. This result shows that UFT-JMIM methods can minimize information loss between transformed and original features, and select feature subset to avoid redundant and irrelevant features.

Unsupervised Feature Transformation Research Articles

Related Topics

Articles published on Unsupervised Feature Transformation

Improving Deep Forest by Screening

Application of fuzzy clustering for text data dimensionality reduction

Application of Fuzzy Clustering for Text Data Dimensionality Reduction

Assessing Information Transmission in Data Transformations with the Channel Multivariate Entropy Triangle.

Domain Space Transfer Extreme Learning Machine for Domain Adaptation.

FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES

Heterogeneous feature subset selection using mutual information-based feature transformation

Clustering Heterogeneous Data with k-Means by Mutual Information-Based Unsupervised Feature Transformation

A minimax probabilistic approach to feature transformation for multi-class data

Comparison of Processing Chains Based on Support Vector Machine Classifier for Hyperspectral Image Classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Unsupervised Feature Transformation Research Articles

Related Topics

Articles published on Unsupervised Feature Transformation

Improving Deep Forest by Screening

Application of fuzzy clustering for text data dimensionality reduction

Application of Fuzzy Clustering for Text Data Dimensionality Reduction

Assessing Information Transmission in Data Transformations with the Channel Multivariate Entropy Triangle.

Domain Space Transfer Extreme Learning Machine for Domain Adaptation.

FEATURE SELECTION METHODS BASED ON MUTUAL INFORMATION FOR CLASSIFYING HETEROGENEOUS FEATURES

Heterogeneous feature subset selection using mutual information-based feature transformation

Clustering Heterogeneous Data with k-Means by Mutual Information-Based Unsupervised Feature Transformation

A minimax probabilistic approach to feature transformation for multi-class data

Comparison of Processing Chains Based on Support Vector Machine Classifier for Hyperspectral Image Classification