Functional Modeling of High-Dimensional Data: A Manifold Learning Approach
This paper introduces <i>stringing via Manifold Learning</i> (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, a...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-02-01
|
Series: | Mathematics |
Subjects: | |
Online Access: | https://www.mdpi.com/2227-7390/9/4/406 |
_version_ | 1797395983504506880 |
---|---|
author | Harold A. Hernández-Roig M. Carmen Aguilera-Morillo Rosa E. Lillo |
author_facet | Harold A. Hernández-Roig M. Carmen Aguilera-Morillo Rosa E. Lillo |
author_sort | Harold A. Hernández-Roig |
collection | DOAJ |
description | This paper introduces <i>stringing via Manifold Learning</i> (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, allowing the use of <i>Functional Data Analysis</i> (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that ML-stringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays. This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data. |
first_indexed | 2024-03-09T00:43:43Z |
format | Article |
id | doaj.art-7098eadeb3f24ade8a78fd258ef85395 |
institution | Directory Open Access Journal |
issn | 2227-7390 |
language | English |
last_indexed | 2024-03-09T00:43:43Z |
publishDate | 2021-02-01 |
publisher | MDPI AG |
record_format | Article |
series | Mathematics |
spelling | doaj.art-7098eadeb3f24ade8a78fd258ef853952023-12-11T17:39:24ZengMDPI AGMathematics2227-73902021-02-019440610.3390/math9040406Functional Modeling of High-Dimensional Data: A Manifold Learning ApproachHarold A. Hernández-Roig0M. Carmen Aguilera-Morillo1Rosa E. Lillo2Department of Statistics, Universidad Carlos III de Madrid, Calle Madrid, 126, 28903 Getafe, Madrid, Spainuc3m-Santander Big Data Institute, Calle Madrid, 135, 28903 Getafe, Madrid, SpainDepartment of Statistics, Universidad Carlos III de Madrid, Calle Madrid, 126, 28903 Getafe, Madrid, SpainThis paper introduces <i>stringing via Manifold Learning</i> (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, allowing the use of <i>Functional Data Analysis</i> (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that ML-stringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays. This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data.https://www.mdpi.com/2227-7390/9/4/406stringing<i>Functional Data Analysis</i><i>Manifold Learning</i><i>Multidimensional Scaling</i>high-dimensional datafunctional regression |
spellingShingle | Harold A. Hernández-Roig M. Carmen Aguilera-Morillo Rosa E. Lillo Functional Modeling of High-Dimensional Data: A Manifold Learning Approach Mathematics stringing <i>Functional Data Analysis</i> <i>Manifold Learning</i> <i>Multidimensional Scaling</i> high-dimensional data functional regression |
title | Functional Modeling of High-Dimensional Data: A Manifold Learning Approach |
title_full | Functional Modeling of High-Dimensional Data: A Manifold Learning Approach |
title_fullStr | Functional Modeling of High-Dimensional Data: A Manifold Learning Approach |
title_full_unstemmed | Functional Modeling of High-Dimensional Data: A Manifold Learning Approach |
title_short | Functional Modeling of High-Dimensional Data: A Manifold Learning Approach |
title_sort | functional modeling of high dimensional data a manifold learning approach |
topic | stringing <i>Functional Data Analysis</i> <i>Manifold Learning</i> <i>Multidimensional Scaling</i> high-dimensional data functional regression |
url | https://www.mdpi.com/2227-7390/9/4/406 |
work_keys_str_mv | AT haroldahernandezroig functionalmodelingofhighdimensionaldataamanifoldlearningapproach AT mcarmenaguileramorillo functionalmodelingofhighdimensionaldataamanifoldlearningapproach AT rosaelillo functionalmodelingofhighdimensionaldataamanifoldlearningapproach |