Abstract
In quantitative structure-activity relationship (QSAR), multivariate statistical methods are commonly used for data analysis. Partial least squares (PLS) is of particular interest because it can analyze data with strongly collinear, noisy and numerous X variables, and also simultaneously models several activity variables Y. PLS provides several prediction regions and diagnostic plots as statistical measures. PLS has evolved for coping with the severe demands imposed by complex data structures. In this review article, we outline the algorithms of five advanced PLS techniques and provide some representative examples of each. The selected models are Nonlinear PLS, Multiway PLS, Hierarchical PLS, Orthogonal PLS, and Bi-modal PLS. Studies of particular aspects of living cells (such as the set of genes or proteins in the cell and their interactions) are collectively known as the -omics fields. Omics integrate heterogeneous scientific disciplines and include chemogenomics, proteomics, and metabolomics. The datasets produced within the omics fields are numerous, megavariate and extremely complex. The data structures are frequently incomplete, noisy, nonlinear and collinear demanding modern and powerful multivariate data analysis methods. In particular, the omics technologies have steered biology towards the adoption of orthogonal PLS. We also describe future prospects for the use of PLS algorithms in the omics fields.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Similar Papers
More From: Molecular Informatics
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.