Multi-scale affinities with missing data: Estimation and applications.

Min Zhang,Gal Mishne,Eric C Chi

doi:10.1002/sam.11561

Abstract

Many machine learning algorithms depend on weights that quantify row and column similarities of a data matrix. The choice of weights can dramatically impact the effectiveness of the algorithm. Nonetheless, the problem of choosing weights has arguably not been given enough study. When a data matrix is completely observed, Gaussian kernel affinities can be used to quantify the local similarity between pairs of rows and pairs of columns. Computing weights in the presence of missing data, however, becomes challenging. In this paper, we propose a new method to construct row and column affinities even when data are missing by building off a co-clustering technique. This method takes advantage of solving the optimization problem for multiple pairs of cost parameters and filling in the missing values with increasingly smooth estimates. It exploits the coupled similarity structure among both the rows and columns of a data matrix. We show these affinities can be used to perform tasks such as data imputation, clustering, and matrix completion on graphs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-scale affinities with missing data: Estimation and applications.

Abstract

Talk to us

Similar Papers

More From: Statistical analysis and data mining

Lead the way for us

Similar Papers

Missing data in multiple correspondence analysis under the available data principle of the NIPALS algorithm
Andrés Felipe Ochoa Muñoz ... Víctor Manuel Gonzalez Rojas
DYNA | VOL. 86
Andrés Felipe Ochoa Muñoz, et. al.Andrés Felipe Ochoa Muñoz ... Víctor Manuel Gonzalez Rojas
01 Oct 2019
DYNA | VOL. 86

Imputation by the mean score should be avoided when validating a Patient Reported Outcomes questionnaire by a Rasch model in presence of informative missing data
Jean-Benoit Hardouin ... Véronique Sébille
BMC Medical Research Methodology | VOL. 11
Jean-Benoit Hardouin, et. al.Jean-Benoit Hardouin ... Véronique Sébille
14 Jul 2011
BMC Medical Research Methodology | VOL. 11

Effect of Missing Data on Test Equating Methods Under NEAT Design
Semih Aşiret ... Seçil Ömür Sünbül
International Journal of Psychology and Educational Studies | VOL. 10
Semih Aşiret, et. al.Semih Aşiret ... Seçil Ömür Sünbül
01 Aug 2023
International Journal of Psychology and Educational Studies | VOL. 10

Deep probabilistic graphical modeling for robust multivariate time series anomaly detection with missing data
Jingyu Yang ... Ye Yuan
Reliability Engineering & System Safety | VOL. 238
Jingyu Yang, et. al.Jingyu Yang ... Ye Yuan
30 May 2023
Reliability Engineering & System Safety | VOL. 238

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-scale affinities with missing data: Estimation and applications.

Abstract

Talk to us

Similar Papers

More From: Statistical analysis and data mining