Functional Modeling of High-Dimensional Data: A Manifold Learning Approach

Harold A Hernández-Roig,M Carmen Aguilera-Morillo,Rosa E Lillo

doi:10.3390/math9040406

Harold A Hernández-Roig, M Carmen Aguilera-Morillo + Show 1 more

Open Access

https://doi.org/10.3390/math9040406

Copy DOI

Abstract

This paper introduces stringing via Manifold Learning (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, allowing the use of Functional Data Analysis (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that ML-stringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays. This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data.

Highlights

To study the benefits of using ML-stringing instead of the Unidimensional Scaling (UDS)-based version, we focus mainly on three aspects: (1) the visual representation of the stringed high-dimensional data achieved by the estimated functional predictors; (2) the interpretability of the estimated coefficient function; and (3) the accuracy of the predictions achieved by the SOF
We realized that stringing based on UDS rearranged data according to linear relationships between predictors
Motivated by these findings we introduced ML-stringing, a version of the method that takes into account a more complex structure of the data, like nonlinearities

Summary

Introduction

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. A considerable literature has grown up around the topic of high-dimensional data. In this scenario, classical statistical tools are insufficient to study the data, as the number of features is generally higher than the sample size. Microarrays measure gene expressions and in most cases can contain up to 105 genes (features or predictors) for less than one hundred subjects (samples). It is common to deal with a huge difference between the sample size n and the number p of features (written as n p). If the data comes with an associated response (say a category indicating ill/healthy patient) tasks such as modeling become very difficult

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Feb 19, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Functional Modeling of High-Dimensional Data: A Manifold Learning Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Functional exploratory data analysis for high-resolution measurements of urban particulate matter.
M Giovanna Ranalli ... Beatrice Moroni
Biometrical journal. Biometrische Zeitschrift | VOL. 58
M Giovanna Ranalli, et. al.M Giovanna Ranalli ... Beatrice Moroni
13 Apr 2016
Biometrical journal. Biometrische Zeitschrift | VOL. 58

Functional Data Analysis and Person Response Functions
Kyle T Turner ... George Engelhard
Measurement: Interdisciplinary Research and Perspectives | VOL. 21
Kyle T Turner, et. al.Kyle T Turner ... George Engelhard
03 Jul 2023
Measurement: Interdisciplinary Research and Perspectives | VOL. 21

Supervised classification of curves via a combined use of functional data analysis and tree-based methods
Fabrizio Maturo ... Rosanna Verde
Computational Statistics | VOL. 38
Fabrizio Maturo, et. al.Fabrizio Maturo ... Rosanna Verde
30 May 2022
Computational Statistics | VOL. 38

The Use of Functional Data Analysis to Evaluate Activity in a Spontaneous Model of Degenerative Joint Disease Associated Pain in Cats.
Margaret E Gruen ... Alicia C Worth
PLOS ONE | VOL. 12
Margaret E Gruen, et. al.Margaret E Gruen ... Alicia C Worth
18 Jan 2017
PLOS ONE | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Functional Modeling of High-Dimensional Data: A Manifold Learning Approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics