Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array

Accurate assignment of copy number at known copy number variant (CNV) loci is important for both increasing understanding of the structural evolution of genomes as well as for carrying out association studies of copy number with disease. As with calling SNP genotypes, the task can be framed as a clu...

Mô tả đầy đủ

Chi tiết về thư mục
Những tác giả chính: Cardin, N, Holmes, C, Donnelly, P, Marchini, J
Định dạng: Journal article
Ngôn ngữ:English
Được phát hành: 2011
_version_ 1826266106716749824
author Cardin, N
Holmes, C
Donnelly, P
Marchini, J
author_facet Cardin, N
Holmes, C
Donnelly, P
Marchini, J
author_sort Cardin, N
collection OXFORD
description Accurate assignment of copy number at known copy number variant (CNV) loci is important for both increasing understanding of the structural evolution of genomes as well as for carrying out association studies of copy number with disease. As with calling SNP genotypes, the task can be framed as a clustering problem but for a number of reasons assigning copy number is much more challenging. CNV assays have lower signal-to-noise ratios than SNP assays, often display heavy tailed and asymmetric intensity distributions, contain outlying observations and may exhibit systematic technical differences among different cohorts. In addition, the number of copy-number classes at a CNV in the population may be unknown a priori. Due to these complications, automatic and robust assignment of copy number from array data remains a challenging problem. We have developed a copy number assignment algorithm, CNVCALL, for a targeted CNV array, such as that used by the Wellcome Trust Case Control Consortium's recent CNV association study. We use a Bayesian hierarchical mixture model that robustly identifies both the number of different copy number classes at a specific locus as well as relative copy number for each individual in the sample. This approach is fully automated which is a critical requirement when analyzing large numbers of CNVs. We illustrate the methods performance using real data from the Wellcome Trust Case Control Consortium's CNV association study and using simulated data. © 2011 Wiley-Liss, Inc.
first_indexed 2024-03-06T20:33:54Z
format Journal article
id oxford-uuid:31f8c8f6-a578-493d-bb5c-78cba9acb5f7
institution University of Oxford
language English
last_indexed 2024-03-06T20:33:54Z
publishDate 2011
record_format dspace
spelling oxford-uuid:31f8c8f6-a578-493d-bb5c-78cba9acb5f72022-03-26T13:11:12ZBayesian hierarchical mixture modeling to assign copy number from a targeted CNV arrayJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:31f8c8f6-a578-493d-bb5c-78cba9acb5f7EnglishSymplectic Elements at Oxford2011Cardin, NHolmes, CDonnelly, PMarchini, JAccurate assignment of copy number at known copy number variant (CNV) loci is important for both increasing understanding of the structural evolution of genomes as well as for carrying out association studies of copy number with disease. As with calling SNP genotypes, the task can be framed as a clustering problem but for a number of reasons assigning copy number is much more challenging. CNV assays have lower signal-to-noise ratios than SNP assays, often display heavy tailed and asymmetric intensity distributions, contain outlying observations and may exhibit systematic technical differences among different cohorts. In addition, the number of copy-number classes at a CNV in the population may be unknown a priori. Due to these complications, automatic and robust assignment of copy number from array data remains a challenging problem. We have developed a copy number assignment algorithm, CNVCALL, for a targeted CNV array, such as that used by the Wellcome Trust Case Control Consortium's recent CNV association study. We use a Bayesian hierarchical mixture model that robustly identifies both the number of different copy number classes at a specific locus as well as relative copy number for each individual in the sample. This approach is fully automated which is a critical requirement when analyzing large numbers of CNVs. We illustrate the methods performance using real data from the Wellcome Trust Case Control Consortium's CNV association study and using simulated data. © 2011 Wiley-Liss, Inc.
spellingShingle Cardin, N
Holmes, C
Donnelly, P
Marchini, J
Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array
title Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array
title_full Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array
title_fullStr Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array
title_full_unstemmed Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array
title_short Bayesian hierarchical mixture modeling to assign copy number from a targeted CNV array
title_sort bayesian hierarchical mixture modeling to assign copy number from a targeted cnv array
work_keys_str_mv AT cardinn bayesianhierarchicalmixturemodelingtoassigncopynumberfromatargetedcnvarray
AT holmesc bayesianhierarchicalmixturemodelingtoassigncopynumberfromatargetedcnvarray
AT donnellyp bayesianhierarchicalmixturemodelingtoassigncopynumberfromatargetedcnvarray
AT marchinij bayesianhierarchicalmixturemodelingtoassigncopynumberfromatargetedcnvarray