mixOmics: An R package for 'omics feature selection and multiple data integration.

The advent of high throughput technologies has led to a wealth of publicly available 'omics data coming from different sources, such as transcriptomics, proteomics, metabolomics. Combining such large-scale biological data sets can lead to the discovery of important biological insights, provided...

Full description

Bibliographic Details
Main Authors: Florian Rohart, Benoît Gautier, Amrit Singh, Kim-Anh Lê Cao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-11-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5687754?pdf=render
_version_ 1818197267220267008
author Florian Rohart
Benoît Gautier
Amrit Singh
Kim-Anh Lê Cao
author_facet Florian Rohart
Benoît Gautier
Amrit Singh
Kim-Anh Lê Cao
author_sort Florian Rohart
collection DOAJ
description The advent of high throughput technologies has led to a wealth of publicly available 'omics data coming from different sources, such as transcriptomics, proteomics, metabolomics. Combining such large-scale biological data sets can lead to the discovery of important biological insights, provided that relevant information can be extracted in a holistic manner. Current statistical approaches have been focusing on identifying small subsets of molecules (a 'molecular signature') to explain or predict biological conditions, but mainly for a single type of 'omics. In addition, commonly used methods are univariate and consider each biological feature independently. We introduce mixOmics, an R package dedicated to the multivariate analysis of biological data sets with a specific focus on data exploration, dimension reduction and visualisation. By adopting a systems biology approach, the toolkit provides a wide range of methods that statistically integrate several data sets at once to probe relationships between heterogeneous 'omics data sets. Our recent methods extend Projection to Latent Structure (PLS) models for discriminant analysis, for data integration across multiple 'omics data or across independent studies, and for the identification of molecular signatures. We illustrate our latest mixOmics integrative frameworks for the multivariate analyses of 'omics data available from the package.
first_indexed 2024-12-12T01:47:15Z
format Article
id doaj.art-eae8489e565d44bcaf942dc3e6317c33
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-12T01:47:15Z
publishDate 2017-11-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-eae8489e565d44bcaf942dc3e6317c332022-12-22T00:42:33ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-11-011311e100575210.1371/journal.pcbi.1005752mixOmics: An R package for 'omics feature selection and multiple data integration.Florian RohartBenoît GautierAmrit SinghKim-Anh Lê CaoThe advent of high throughput technologies has led to a wealth of publicly available 'omics data coming from different sources, such as transcriptomics, proteomics, metabolomics. Combining such large-scale biological data sets can lead to the discovery of important biological insights, provided that relevant information can be extracted in a holistic manner. Current statistical approaches have been focusing on identifying small subsets of molecules (a 'molecular signature') to explain or predict biological conditions, but mainly for a single type of 'omics. In addition, commonly used methods are univariate and consider each biological feature independently. We introduce mixOmics, an R package dedicated to the multivariate analysis of biological data sets with a specific focus on data exploration, dimension reduction and visualisation. By adopting a systems biology approach, the toolkit provides a wide range of methods that statistically integrate several data sets at once to probe relationships between heterogeneous 'omics data sets. Our recent methods extend Projection to Latent Structure (PLS) models for discriminant analysis, for data integration across multiple 'omics data or across independent studies, and for the identification of molecular signatures. We illustrate our latest mixOmics integrative frameworks for the multivariate analyses of 'omics data available from the package.http://europepmc.org/articles/PMC5687754?pdf=render
spellingShingle Florian Rohart
Benoît Gautier
Amrit Singh
Kim-Anh Lê Cao
mixOmics: An R package for 'omics feature selection and multiple data integration.
PLoS Computational Biology
title mixOmics: An R package for 'omics feature selection and multiple data integration.
title_full mixOmics: An R package for 'omics feature selection and multiple data integration.
title_fullStr mixOmics: An R package for 'omics feature selection and multiple data integration.
title_full_unstemmed mixOmics: An R package for 'omics feature selection and multiple data integration.
title_short mixOmics: An R package for 'omics feature selection and multiple data integration.
title_sort mixomics an r package for omics feature selection and multiple data integration
url http://europepmc.org/articles/PMC5687754?pdf=render
work_keys_str_mv AT florianrohart mixomicsanrpackageforomicsfeatureselectionandmultipledataintegration
AT benoitgautier mixomicsanrpackageforomicsfeatureselectionandmultipledataintegration
AT amritsingh mixomicsanrpackageforomicsfeatureselectionandmultipledataintegration
AT kimanhlecao mixomicsanrpackageforomicsfeatureselectionandmultipledataintegration