Improving the analysis of designed studies by combining statistical modelling with study design information

<p>Abstract</p> <p>Background</p> <p>In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing un...

Full description

Bibliographic Details
Main Authors: Kleemann Robert, Bobeldijk Ivana, van den Berg Sjoerd AA, Wopereis Suzan, Thissen Uwe, Kooistra Teake, Willems van Dijk Ko, van Ommen Ben, Smilde Age K
Format: Article
Language:English
Published: BMC 2009-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/10/52
_version_ 1818581962201235456
author Kleemann Robert
Bobeldijk Ivana
van den Berg Sjoerd AA
Wopereis Suzan
Thissen Uwe
Kooistra Teake
Willems van Dijk Ko
van Ommen Ben
Smilde Age K
author_facet Kleemann Robert
Bobeldijk Ivana
van den Berg Sjoerd AA
Wopereis Suzan
Thissen Uwe
Kooistra Teake
Willems van Dijk Ko
van Ommen Ben
Smilde Age K
author_sort Kleemann Robert
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study design can be used to decompose the total data into data blocks that are associated with specific effects. Subsequent statistical analysis can be improved by this decomposition if these are applied on selected combinations of effects.</p> <p>Results</p> <p>The benefit of this approach was demonstrated with an analysis that combines multivariate PLS (Partial Least Squares) regression with data decomposition from ANOVA (Analysis of Variance): ANOVA-PLS. As a case, a nutritional intervention study is used on Apoliprotein E3-Leiden (APOE3Leiden) transgenic mice to study the relation between liver lipidomics and a plasma inflammation marker, Serum Amyloid A. The ANOVA-PLS performance was compared to PLS regression on the non-decomposed data with respect to the quality of the modelled relation, model reliability, and interpretability.</p> <p>Conclusion</p> <p>It was shown that ANOVA-PLS leads to a better statistical model that is more reliable and better interpretable compared to standard PLS analysis. From a following biological interpretation, more relevant metabolites were derived from the model. The concept of combining data composition with a subsequent statistical analysis, as in ANOVA-PLS, is however not limited to PLS regression in metabolomics but can be applied for many statistical methods and many different types of data.</p>
first_indexed 2024-12-16T07:41:49Z
format Article
id doaj.art-994d7a4c734e4fda87f527624b05684d
institution Directory Open Access Journal
issn 1471-2105
language English
last_indexed 2024-12-16T07:41:49Z
publishDate 2009-02-01
publisher BMC
record_format Article
series BMC Bioinformatics
spelling doaj.art-994d7a4c734e4fda87f527624b05684d2022-12-21T22:39:03ZengBMCBMC Bioinformatics1471-21052009-02-011015210.1186/1471-2105-10-52Improving the analysis of designed studies by combining statistical modelling with study design informationKleemann RobertBobeldijk Ivanavan den Berg Sjoerd AAWopereis SuzanThissen UweKooistra TeakeWillems van Dijk Kovan Ommen BenSmilde Age K<p>Abstract</p> <p>Background</p> <p>In the fields of life sciences, so-called designed studies are used for studying complex biological systems. The data derived from these studies comply with a study design aimed at generating relevant information while diminishing unwanted variation (noise). Knowledge about the study design can be used to decompose the total data into data blocks that are associated with specific effects. Subsequent statistical analysis can be improved by this decomposition if these are applied on selected combinations of effects.</p> <p>Results</p> <p>The benefit of this approach was demonstrated with an analysis that combines multivariate PLS (Partial Least Squares) regression with data decomposition from ANOVA (Analysis of Variance): ANOVA-PLS. As a case, a nutritional intervention study is used on Apoliprotein E3-Leiden (APOE3Leiden) transgenic mice to study the relation between liver lipidomics and a plasma inflammation marker, Serum Amyloid A. The ANOVA-PLS performance was compared to PLS regression on the non-decomposed data with respect to the quality of the modelled relation, model reliability, and interpretability.</p> <p>Conclusion</p> <p>It was shown that ANOVA-PLS leads to a better statistical model that is more reliable and better interpretable compared to standard PLS analysis. From a following biological interpretation, more relevant metabolites were derived from the model. The concept of combining data composition with a subsequent statistical analysis, as in ANOVA-PLS, is however not limited to PLS regression in metabolomics but can be applied for many statistical methods and many different types of data.</p>http://www.biomedcentral.com/1471-2105/10/52
spellingShingle Kleemann Robert
Bobeldijk Ivana
van den Berg Sjoerd AA
Wopereis Suzan
Thissen Uwe
Kooistra Teake
Willems van Dijk Ko
van Ommen Ben
Smilde Age K
Improving the analysis of designed studies by combining statistical modelling with study design information
BMC Bioinformatics
title Improving the analysis of designed studies by combining statistical modelling with study design information
title_full Improving the analysis of designed studies by combining statistical modelling with study design information
title_fullStr Improving the analysis of designed studies by combining statistical modelling with study design information
title_full_unstemmed Improving the analysis of designed studies by combining statistical modelling with study design information
title_short Improving the analysis of designed studies by combining statistical modelling with study design information
title_sort improving the analysis of designed studies by combining statistical modelling with study design information
url http://www.biomedcentral.com/1471-2105/10/52
work_keys_str_mv AT kleemannrobert improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT bobeldijkivana improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT vandenbergsjoerdaa improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT wopereissuzan improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT thissenuwe improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT kooistrateake improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT willemsvandijkko improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT vanommenben improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation
AT smildeagek improvingtheanalysisofdesignedstudiesbycombiningstatisticalmodellingwithstudydesigninformation