Enhancing Feature Extraction through G-PLSGLR by Decreasing Dimensionality of Textual Data

Narender Chinthamu,Muthuvairavan Pillai.N,M Murali,Chandrasekar Venkatachalam,Setti Vidya Sagar Appaji

doi:10.17762/ijritcc.v11i4s.6540

Abstract

The technology of big data has become highly popular in numerous industries owing to its various characteristics such as high value, large volume, rapid velocity, wide variety, and significant variability. Nevertheless, big data presents several difficulties that must be addressed, including lengthy processing times, high computational complexity, imprecise features, significant sparsity, irrelevant terms, redundancy, and noise, all of which can have an adverse effect on the performance of feature extraction. The objective of this research is to tackle these issues by utilizing the Partial Least Square Generalized Linear Regression (G-PLSGLR) approach to decrease the high dimensionality of text data. The suggested algorithm is made up of four stages: Firstly, gathering featured data in vector space model (VSM) and training it with bootstrap technique. Second, grouping trained feature samples using a Pearson correlation coefficient and graph-based technique. Third, getting rid of unimportant features by ranking significant group features using PLSGR. Lastly, choosing or extracting significant features using Bayesian information criterion (BIC). The G-PLSGLR algorithm surpasses current methods by achieving a high reduction rate and classification performance, while minimizing feature redundancy, time consumption, and complexity. Furthermore, it enhances the accuracy of features by 35%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Feature Extraction through G-PLSGLR by Decreasing Dimensionality of Textual Data

Abstract

Talk to us

Similar Papers

More From: International Journal on Recent and Innovation Trends in Computing and Communication

Lead the way for us

Journal: International Journal on Recent and Innovation Trends in Computing and Communication	Publication Date: May 5, 2023
Citations: 2

Similar Papers

Cloud computing and big data: Technologies and applications
Mostapha Zbakh ... Mohamed Essaaidi
Concurrency and Computation: Practice and Experience | VOL. 30
Mostapha Zbakh, et. al.Mostapha Zbakh ... Mohamed Essaaidi
20 May 2018
Concurrency and Computation: Practice and Experience | VOL. 30

Technological, organizational and environmental factors influencing on user intention towards big data technology adoption in Malaysian educational organization
Noor Baizura Harun ... Maslina Zolkepli
Accounting | VOL. 8
Noor Baizura Harun, et. al.Noor Baizura Harun ... Maslina Zolkepli
01 Jan 2021
Accounting | VOL. 8

On the existence of obstinate results in vector space models
Milos Radovanović ... Mirjana Ivanović
-
Milos Radovanović, et. al.Milos Radovanović ... Mirjana Ivanović
19 Jul 2010
19 Jul 2010

Big Data Skills Sustainable Development in Healthcare and Pharmaceuticals.
António Pesqueira ... Álvaro Rocha
Journal of medical systems | VOL. 44
António Pesqueira, et. al.António Pesqueira ... Álvaro Rocha
09 Oct 2020
Journal of medical systems | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Feature Extraction through G-PLSGLR by Decreasing Dimensionality of Textual Data

Abstract

Talk to us

Similar Papers

More From: International Journal on Recent and Innovation Trends in Computing and Communication