Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
Abstract Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-12-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-018-2537-z |
_version_ | 1818292834742042624 |
---|---|
author | Xianchao Zhu Xianjun Shen Xingpeng Jiang Kaiping Wei Tingting He Yuanyuan Ma Jiaqi Liu Xiaohua Hu |
author_facet | Xianchao Zhu Xianjun Shen Xingpeng Jiang Kaiping Wei Tingting He Yuanyuan Ma Jiaqi Liu Xiaohua Hu |
author_sort | Xianchao Zhu |
collection | DOAJ |
description | Abstract Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. Conclusions The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity. |
first_indexed | 2024-12-13T03:06:16Z |
format | Article |
id | doaj.art-14c0e46d88b94926b06f6df2b7f42430 |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-12-13T03:06:16Z |
publishDate | 2018-12-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-14c0e46d88b94926b06f6df2b7f424302022-12-22T00:01:43ZengBMCBMC Bioinformatics1471-21052018-12-0119S20374810.1186/s12859-018-2537-zNonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome dataXianchao Zhu0Xianjun Shen1Xingpeng Jiang2Kaiping Wei3Tingting He4Yuanyuan Ma5Jiaqi Liu6Xiaohua Hu7School of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversityAbstract Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. Conclusions The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity.http://link.springer.com/article/10.1186/s12859-018-2537-zMultiple maps t-SNEData visualizationNon-metric similaritiesNesterov momentum |
spellingShingle | Xianchao Zhu Xianjun Shen Xingpeng Jiang Kaiping Wei Tingting He Yuanyuan Ma Jiaqi Liu Xiaohua Hu Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data BMC Bioinformatics Multiple maps t-SNE Data visualization Non-metric similarities Nesterov momentum |
title | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_full | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_fullStr | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_full_unstemmed | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_short | Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
title_sort | nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data |
topic | Multiple maps t-SNE Data visualization Non-metric similarities Nesterov momentum |
url | http://link.springer.com/article/10.1186/s12859-018-2537-z |
work_keys_str_mv | AT xianchaozhu nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT xianjunshen nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT xingpengjiang nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT kaipingwei nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT tingtinghe nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT yuanyuanma nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT jiaqiliu nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata AT xiaohuahu nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata |