Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.

Human geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer m...

Full description

Bibliographic Details
Main Authors: Kimberly A Walters, Yungui Huang, Marco Azaro, Kathleen Tobin, Thomas Lehner, Linda M Brzustowicz, Veronica J Vieland
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3891773?pdf=render
_version_ 1811194124176457728
author Kimberly A Walters
Yungui Huang
Marco Azaro
Kathleen Tobin
Thomas Lehner
Linda M Brzustowicz
Veronica J Vieland
author_facet Kimberly A Walters
Yungui Huang
Marco Azaro
Kathleen Tobin
Thomas Lehner
Linda M Brzustowicz
Veronica J Vieland
author_sort Kimberly A Walters
collection DOAJ
description Human geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer many advantages, including the ability to return to previously collected data to apply new analytic techniques, they also have some limitations. To illustrate, we reviewed data from seven older schizophrenia studies available from the NIMH-funded Center for Collaborative Genomic Studies on Mental Disorders, also known as the Human Genetics Initiative (HGI), and assessed the impact of data cleaning and regularization on linkage analyses. Extensive data regularization protocols were developed and applied to both genotypic and phenotypic data. Genome-wide nonparametric linkage (NPL) statistics were computed for each study, over various stages of data processing. To assess the impact of data processing on aggregate results, Genome-Scan Meta-Analysis (GSMA) was performed. Examples of increased, reduced and shifted linkage peaks were found when comparing linkage results based on original HGI data to results using post-processed data within the same set of pedigrees. Interestingly, reducing the number of affected individuals tended to increase rather than decrease linkage peaks. But most importantly, while the effects of data regularization within individual data sets were small, GSMA applied to the data in aggregate yielded a substantially different picture after data regularization. These results have implications for analyses based on other types of data (e.g., case-control GWAS or sequencing data) as well as data obtained from other repositories.
first_indexed 2024-04-12T00:20:42Z
format Article
id doaj.art-dc2b954ca51b441591b375ee63ac776e
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-12T00:20:42Z
publishDate 2014-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-dc2b954ca51b441591b375ee63ac776e2022-12-22T03:55:44ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0191e8469610.1371/journal.pone.0084696Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.Kimberly A WaltersYungui HuangMarco AzaroKathleen TobinThomas LehnerLinda M BrzustowiczVeronica J VielandHuman geneticists are increasingly turning to study designs based on very large sample sizes to overcome difficulties in studying complex disorders. This in turn almost always requires multi-site data collection and processing of data through centralized repositories. While such repositories offer many advantages, including the ability to return to previously collected data to apply new analytic techniques, they also have some limitations. To illustrate, we reviewed data from seven older schizophrenia studies available from the NIMH-funded Center for Collaborative Genomic Studies on Mental Disorders, also known as the Human Genetics Initiative (HGI), and assessed the impact of data cleaning and regularization on linkage analyses. Extensive data regularization protocols were developed and applied to both genotypic and phenotypic data. Genome-wide nonparametric linkage (NPL) statistics were computed for each study, over various stages of data processing. To assess the impact of data processing on aggregate results, Genome-Scan Meta-Analysis (GSMA) was performed. Examples of increased, reduced and shifted linkage peaks were found when comparing linkage results based on original HGI data to results using post-processed data within the same set of pedigrees. Interestingly, reducing the number of affected individuals tended to increase rather than decrease linkage peaks. But most importantly, while the effects of data regularization within individual data sets were small, GSMA applied to the data in aggregate yielded a substantially different picture after data regularization. These results have implications for analyses based on other types of data (e.g., case-control GWAS or sequencing data) as well as data obtained from other repositories.http://europepmc.org/articles/PMC3891773?pdf=render
spellingShingle Kimberly A Walters
Yungui Huang
Marco Azaro
Kathleen Tobin
Thomas Lehner
Linda M Brzustowicz
Veronica J Vieland
Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.
PLoS ONE
title Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.
title_full Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.
title_fullStr Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.
title_full_unstemmed Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.
title_short Meta-analysis of repository data: impact of data regularization on NIMH schizophrenia linkage results.
title_sort meta analysis of repository data impact of data regularization on nimh schizophrenia linkage results
url http://europepmc.org/articles/PMC3891773?pdf=render
work_keys_str_mv AT kimberlyawalters metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT yunguihuang metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT marcoazaro metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT kathleentobin metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT thomaslehner metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT lindambrzustowicz metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults
AT veronicajvieland metaanalysisofrepositorydataimpactofdataregularizationonnimhschizophrenialinkageresults