Review of Variable Selection Methods for Discriminant-Type Problems in Chemometrics

Michael D Sorochan Armstrong,James J Harynuk,A Paulina De La Mata

doi:10.3389/frans.2022.867938

Michael D Sorochan Armstrong, James J Harynuk + Show 1 more

Open Access

https://doi.org/10.3389/frans.2022.867938

Copy DOI

Abstract

Discriminant-type analyses arise from the need to classify samples based on their measured characteristics (variables), usually with respect to some observable property. In the case of samples that are difficult to obtain, or using advanced instrumentation, it is very common to encounter situations with many more measured characteristics than samples. The method of Partial Least Squares Regression (PLS-R), and its variant for discriminant-type analyses (PLS-DA) are among the most ubiquitous of these tools. PLS utilises a rank-deficient method to solve the inverse least-squares problem in a way that maximises the co-variance between the known properties of the samples (commonly referred to as the Y-Block), and their measured characteristics (the X-block). A relatively small subset of highly co-variate variables are weighted more strongly than those that are poorly co-variate, in such a way that an ill-posed matrix inverse problem is circumvented. Feature selection is another common way of reducing the dimensionality of the data to a relatively small, robust subset of variables for use in subsequent modelling. The utility of these features can be inferred and tested any number of ways, this are the subject of this review.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Analytical Science	Publication Date: May 19, 2022
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Review of Variable Selection Methods for Discriminant-Type Problems in Chemometrics

Abstract

Talk to us

Similar Papers

More From: Frontiers in Analytical Science

Lead the way for us

Similar Papers

Rapid prediction of multiple wine quality parameters using infrared spectroscopy coupling with chemometric methods
Xinpeng Ma ... Yankun Li
Journal of Food Composition and Analysis | VOL. 91
Xinpeng Ma, et. al.Xinpeng Ma ... Yankun Li
18 May 2020
Journal of Food Composition and Analysis | VOL. 91

Kernel Partial Least Square Regression with High Resistance to Multiple Outliers and Bad Leverage Points on Near-Infrared Spectral Data Analysis
Divo Dharma Silalahi ... Habshah Midi
Symmetry | VOL. 13
Divo Dharma Silalahi, et. al.Divo Dharma Silalahi ... Habshah Midi
26 Mar 2021
Symmetry | VOL. 13

An assessment of Random Forest wrappers for selecting important features of spectroscopy data in the modelling of soil properties
Francisco M Canero ... David Aragones
-
Francisco M Canero, et. al.Francisco M Canero ... David Aragones
27 Mar 2022
27 Mar 2022

A Hybrid Data Mining Technique for Improving the Classification Accuracy of Microarray Data Set
Sujata Dash ... Bichitrananda Patra
International Journal of Information Engineering and Electronic Business | VOL. 4
Sujata Dash, et. al.Sujata Dash ... Bichitrananda Patra
18 Apr 2012
International Journal of Information Engineering and Electronic Business | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Review of Variable Selection Methods for Discriminant-Type Problems in Chemometrics

Abstract

Talk to us

Similar Papers

More From: Frontiers in Analytical Science