Optimal heterogeneous domain adaptation for text classification in transfer learning

Anshu Khurana,Om Prakash Verma

doi:10.1016/j.compeleceng.2024.109192

Abstract

The prime challenge of unsupervised symmetric heterogeneous cross-domain adaptation is to train the source domain and apply the trained knowledge to the target domain. Most of the existing algorithms for unsupervised transfer learning create the subspace of the source domain features and target domain features for training purposes. It is an extensive computational process as most of the techniques require labeled source data. Many techniques also suffer from the loss of originality of features in both domains. This paper aims to consider the feature vectors of both the source and target domain for training the data based on the similarity of exemplar (feature) vectors of different instances, known as Instance Similarity Feature (ISF). The use of vectorization method for the similarity of features is proposed in this paper. The exemplar vectors are chosen randomly for the target datasets. Hence, to acquire relevant factual data in the knowledge base for training in our research, we worked to increase the domain separation error between source and target instances. To avoid the instability caused due to poor exemplar vector selection, the K-means clustering approach is followed after feature similarity, known as K-means Instance Similarity Feature (KISF). Many existing transfer learning techniques are based on the original feature set, which can cause degeneracy, hence affecting Accuracy. In order to vanquish the limitations of existing approaches, we have introduced novel optimal models with KISF and Ant Lion Optimizer (KISFA), KISF with Particle Swarm Optimization (KISFP) and KISF with Biogeography Based Optimization (KISFB). High-dimensionality can impact efficacy of the model, hence, feature selection with nature-based optimizer namely: Ant Lion Optimizer, Particle Swarm Optimization and Biogeography-Based Optimization are applied. We measure the performance of the proposed models by using Support Vector Machine, Logistic Regression, Random Forest, Naive Baye’s, K-Nearest Neighbor and Decision Tree as classifiers, and Accuracy and F1-score as fitness functions. Extensive experiments are performed on four datasets with 50 iterations. The proposed model is compared with eleven other techniques and our technique outperforms all other techniques in average Accuracy. The validation is performed on the dataset using 10-fold cross-validation. The statistical test was performed using ANOVA, proving that our technique is significantly better than other techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal heterogeneous domain adaptation for text classification in transfer learning

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering

Lead the way for us

Similar Papers

A particle swarm optimization-based feature selection for unsupervised transfer learning
Rakesh Kumar Sanodiya ... Jimson Mathew
Soft Computing | VOL. 24
Rakesh Kumar Sanodiya, et. al.Rakesh Kumar Sanodiya ... Jimson Mathew
26 Jun 2020
Soft Computing | VOL. 24

A particle swarm optimization based feature selection approach to transfer learning in classification
Bach Hoai Nguyen ... Bing Xue
-
Bach Hoai Nguyen, et. al.Bach Hoai Nguyen ... Bing Xue
02 Jul 2018
02 Jul 2018

A New Belief-Based Bidirectional Transfer Classification Method.
Zhun-Ga Liu ... Quan Pan
IEEE Transactions on Cybernetics | VOL. 52
Zhun-Ga Liu, et. al.Zhun-Ga Liu ... Quan Pan
01 Aug 2022
IEEE Transactions on Cybernetics | VOL. 52

Heterogeneous domain adaptation by Features Normalization and Data Topology Preserving
Mohammad Amin Pirbonyeh ... Shahab Shamshirband
Knowledge-Based Systems | VOL. 257
Mohammad Amin Pirbonyeh, et. al.Mohammad Amin Pirbonyeh ... Shahab Shamshirband
29 Jul 2022
Knowledge-Based Systems | VOL. 257

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal heterogeneous domain adaptation for text classification in transfer learning

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering