Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data

Abstract Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results...

Full description

Bibliographic Details
Main Authors: Xianchao Zhu, Xianjun Shen, Xingpeng Jiang, Kaiping Wei, Tingting He, Yuanyuan Ma, Jiaqi Liu, Xiaohua Hu
Format: Article
Language:English
Published: BMC 2018-12-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2537-z
_version_ 1818292834742042624
author Xianchao Zhu
Xianjun Shen
Xingpeng Jiang
Kaiping Wei
Tingting He
Yuanyuan Ma
Jiaqi Liu
Xiaohua Hu
author_facet Xianchao Zhu
Xianjun Shen
Xingpeng Jiang
Kaiping Wei
Tingting He
Yuanyuan Ma
Jiaqi Liu
Xiaohua Hu
author_sort Xianchao Zhu
collection DOAJ
description Abstract Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. Conclusions The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity.
first_indexed 2024-12-13T03:06:16Z
format Article
id doaj.art-14c0e46d88b94926b06f6df2b7f42430
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-13T03:06:16Z
publishDate 2018-12-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-14c0e46d88b94926b06f6df2b7f424302022-12-22T00:01:43ZengBMCBMC Bioinformatics1471-21052018-12-0119S20374810.1186/s12859-018-2537-zNonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome dataXianchao Zhu0Xianjun Shen1Xingpeng Jiang2Kaiping Wei3Tingting He4Yuanyuan Ma5Jiaqi Liu6Xiaohua Hu7School of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversitySchool of Computer, Central China Normal UniversityAbstract Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. Conclusions The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity.http://link.springer.com/article/10.1186/s12859-018-2537-zMultiple maps t-SNEData visualizationNon-metric similaritiesNesterov momentum
spellingShingle Xianchao Zhu
Xianjun Shen
Xingpeng Jiang
Kaiping Wei
Tingting He
Yuanyuan Ma
Jiaqi Liu
Xiaohua Hu
Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
BMC Bioinformatics
Multiple maps t-SNE
Data visualization
Non-metric similarities
Nesterov momentum
title Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_full Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_fullStr Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_full_unstemmed Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_short Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
title_sort nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data
topic Multiple maps t-SNE
Data visualization
Non-metric similarities
Nesterov momentum
url http://link.springer.com/article/10.1186/s12859-018-2537-z
work_keys_str_mv AT xianchaozhu nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT xianjunshen nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT xingpengjiang nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT kaipingwei nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT tingtinghe nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT yuanyuanma nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT jiaqiliu nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata
AT xiaohuahu nonlinearexpressionandvisualizationofnonmetricrelationshipsingeneticdiseasesandmicrobiomedata