Functional Modeling of High-Dimensional Data: A Manifold Learning Approach

This paper introduces <i>stringing via Manifold Learning</i> (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, a...

Full description

Bibliographic Details
Main Authors: Harold A. Hernández-Roig, M. Carmen Aguilera-Morillo, Rosa E. Lillo
Format: Article
Language:English
Published: MDPI AG 2021-02-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/9/4/406
_version_ 1797395983504506880
author Harold A. Hernández-Roig
M. Carmen Aguilera-Morillo
Rosa E. Lillo
author_facet Harold A. Hernández-Roig
M. Carmen Aguilera-Morillo
Rosa E. Lillo
author_sort Harold A. Hernández-Roig
collection DOAJ
description This paper introduces <i>stringing via Manifold Learning</i> (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, allowing the use of <i>Functional Data Analysis</i> (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that ML-stringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays. This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data.
first_indexed 2024-03-09T00:43:43Z
format Article
id doaj.art-7098eadeb3f24ade8a78fd258ef85395
institution Directory Open Access Journal
issn 2227-7390
language English
last_indexed 2024-03-09T00:43:43Z
publishDate 2021-02-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj.art-7098eadeb3f24ade8a78fd258ef853952023-12-11T17:39:24ZengMDPI AGMathematics2227-73902021-02-019440610.3390/math9040406Functional Modeling of High-Dimensional Data: A Manifold Learning ApproachHarold A. Hernández-Roig0M. Carmen Aguilera-Morillo1Rosa E. Lillo2Department of Statistics, Universidad Carlos III de Madrid, Calle Madrid, 126, 28903 Getafe, Madrid, Spainuc3m-Santander Big Data Institute, Calle Madrid, 135, 28903 Getafe, Madrid, SpainDepartment of Statistics, Universidad Carlos III de Madrid, Calle Madrid, 126, 28903 Getafe, Madrid, SpainThis paper introduces <i>stringing via Manifold Learning</i> (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, allowing the use of <i>Functional Data Analysis</i> (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that ML-stringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays. This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data.https://www.mdpi.com/2227-7390/9/4/406stringing<i>Functional Data Analysis</i><i>Manifold Learning</i><i>Multidimensional Scaling</i>high-dimensional datafunctional regression
spellingShingle Harold A. Hernández-Roig
M. Carmen Aguilera-Morillo
Rosa E. Lillo
Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
Mathematics
stringing
<i>Functional Data Analysis</i>
<i>Manifold Learning</i>
<i>Multidimensional Scaling</i>
high-dimensional data
functional regression
title Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
title_full Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
title_fullStr Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
title_full_unstemmed Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
title_short Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
title_sort functional modeling of high dimensional data a manifold learning approach
topic stringing
<i>Functional Data Analysis</i>
<i>Manifold Learning</i>
<i>Multidimensional Scaling</i>
high-dimensional data
functional regression
url https://www.mdpi.com/2227-7390/9/4/406
work_keys_str_mv AT haroldahernandezroig functionalmodelingofhighdimensionaldataamanifoldlearningapproach
AT mcarmenaguileramorillo functionalmodelingofhighdimensionaldataamanifoldlearningapproach
AT rosaelillo functionalmodelingofhighdimensionaldataamanifoldlearningapproach