Logistic Biplot by Conjugate Gradient Algorithms and Iterated SVD

Jose Giovany Babativa-Márquez,José Luis Vicente-Villardón

doi:10.3390/math9162015

Jose Giovany Babativa-Márquez, José Luis Vicente-Villardón

Open Access

https://doi.org/10.3390/math9162015

Copy DOI

Abstract

Multivariate binary data are increasingly frequent in practice. Although some adaptations of principal component analysis are used to reduce dimensionality for this kind of data, none of them provide a simultaneous representation of rows and columns (biplot). Recently, a technique named logistic biplot (LB) has been developed to represent the rows and columns of a binary data matrix simultaneously, even though the algorithm used to fit the parameters is too computationally demanding to be useful in the presence of sparsity or when the matrix is large. We propose the fitting of an LB model using nonlinear conjugate gradient (CG) or majorization–minimization (MM) algorithms, and a cross-validation procedure is introduced to select the hyperparameter that represents the number of dimensions in the model. A Monte Carlo study that considers scenarios with several sparsity levels and different dimensions of the binary data set shows that the procedure based on cross-validation is successful in the selection of the model for all algorithms studied. The comparison of the running times shows that the CG algorithm is more efficient in the presence of sparsity and when the matrix is not very large, while the performance of the MM algorithm is better when the binary matrix is balanced or large. As a complement to the proposed methods and to give practical support, a package has been written in the R language called BiplotML. To complete the study, real binary data on gene expression methylation are used to illustrate the proposed methods.

Highlights

In many studies, researchers have a binary multivariate data matrix and aim to reduce dimensions to investigate the structure of the data
We propose the estimation of the parameters of the Logistic Biplot (LB) model in two different ways: one of these is to use conjugate gradient (CG) methods, and the other way is to use a coordinate descendent MM algorithm
The Logistic Biplot (LB) model is a dimensionality reduction technique that generalizes the principal component analysis (PCA) to deal with binary variables and has the advantage of simultaneously representing individuals and variables

Summary

Introduction

Researchers have a binary multivariate data matrix and aim to reduce dimensions to investigate the structure of the data. In biological research—and in particular in the analysis of genetic and epigenetic alterations—the amount of binary data has been increasing over time [5]. In these cases, classical methods to reduce dimensionality, such as principal component analysis (PCA), are not appropriate. Collins et al [6] provide a generalization of PCA to exponential family data using the generalized linear model framework. This approach suggests the possibility of having proper likelihood loss functions depending on the type of data

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Aug 23, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Logistic Biplot by Conjugate Gradient Algorithms and Iterated SVD

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Global convergence of a modified RMIL+ nonlinear conjugate gradient method with strong wolfe
Abdelrhaman Abashar ... Awad Abdelrahman Abdalla Mohammed
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 28
Abdelrhaman Abashar, et. al.Abdelrhaman Abashar ... Awad Abdelrahman Abdalla Mohammed
01 Nov 2022
Indonesian Journal of Electrical Engineering and Computer Science | VOL. 28

A dynamical view of nonlinear conjugate gradient methods with applications to FFT-based computational micromechanics
Matti Schneider
Computational Mechanics | VOL. 66
Matti SchneiderMatti Schneider
04 May 2020
Computational Mechanics | VOL. 66

Logistic principal component analysis via non-convex singular value thresholding
Yipeng Song ... Age K Smilde
Chemometrics and Intelligent Laboratory Systems | VOL. 204
Yipeng Song, et. al.Yipeng Song ... Age K Smilde
15 Jul 2020
Chemometrics and Intelligent Laboratory Systems | VOL. 204

A Penalized Linear and Nonlinear Combined Conjugate Gradient Method for the Reconstruction of Fluorescence Molecular Tomography
Shang Shang ... Jie Tian
International Journal of Biomedical Imaging | VOL. 2007
Shang Shang, et. al.Shang Shang ... Jie Tian
01 Jan 2007
International Journal of Biomedical Imaging | VOL. 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Logistic Biplot by Conjugate Gradient Algorithms and Iterated SVD

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics