QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data

Array-based technologies have been used to detect chromosomal copy number changes (aneuploidies) in the human genome. Recent studies identified numerous copy number variants (CNV) and some are common polymorphisms that may contribute to disease susceptibility. We developed, and experimentally valida...

Full description

Bibliographic Details
Main Authors: Colella, S, Yau, C, Taylor, J, Mirza, G, Butler, H, Clouston, P, Bassett, A, Seller, A, Holmes, C, Ragoussis, J
Format: Journal article
Language:English
Published: Oxford University Press 2007
Subjects:
_version_ 1797071848840626176
author Colella, S
Yau, C
Taylor, J
Mirza, G
Butler, H
Clouston, P
Bassett, A
Seller, A
Holmes, C
Ragoussis, J
author_facet Colella, S
Yau, C
Taylor, J
Mirza, G
Butler, H
Clouston, P
Bassett, A
Seller, A
Holmes, C
Ragoussis, J
author_sort Colella, S
collection OXFORD
description Array-based technologies have been used to detect chromosomal copy number changes (aneuploidies) in the human genome. Recent studies identified numerous copy number variants (CNV) and some are common polymorphisms that may contribute to disease susceptibility. We developed, and experimentally validated, a novel computational framework (QuantiSNP) for detecting regions of copy number variation from BeadArray™ SNP genotyping data using an Objective Bayes Hidden-Markov Model (OB-HMM). Objective Bayes measures are used to set certain hyperparameters in the priors using a novel re-sampling framework to calibrate the model to a fixed Type I (false positive) error rate. Other parameters are set via maximum marginal likelihood to prior training data of known structure. QuantiSNP provides probabilistic quantification of state classifications and significantly improves the accuracy of segmental aneuploidy identification and mapping, relative to existing analytical tools (Beadstudio, Illumina), as demonstrated by validation of breakpoint boundaries. QuantiSNP identified both novel and validated CNVs. QuantiSNP was developed using BeadArray™ SNP data but it can be adapted to other platforms and we believe that the OB-HMM framework has widespread applicability in genomic research. In conclusion, QuantiSNP is a novel algorithm for high-resolution CNV/aneuploidy detection with application to clinical genetics, cancer and disease association studies.
first_indexed 2024-03-06T22:59:10Z
format Journal article
id oxford-uuid:617962f2-1779-4382-882e-f91a873fbd20
institution University of Oxford
language English
last_indexed 2024-03-06T22:59:10Z
publishDate 2007
publisher Oxford University Press
record_format dspace
spelling oxford-uuid:617962f2-1779-4382-882e-f91a873fbd202022-03-26T18:00:18ZQuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping dataJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:617962f2-1779-4382-882e-f91a873fbd20Statistics (see also social sciences)Medical SciencesGenetics (medical sciences)EnglishOxford University Research Archive - ValetOxford University Press2007Colella, SYau, CTaylor, JMirza, GButler, HClouston, PBassett, ASeller, AHolmes, CRagoussis, JArray-based technologies have been used to detect chromosomal copy number changes (aneuploidies) in the human genome. Recent studies identified numerous copy number variants (CNV) and some are common polymorphisms that may contribute to disease susceptibility. We developed, and experimentally validated, a novel computational framework (QuantiSNP) for detecting regions of copy number variation from BeadArray™ SNP genotyping data using an Objective Bayes Hidden-Markov Model (OB-HMM). Objective Bayes measures are used to set certain hyperparameters in the priors using a novel re-sampling framework to calibrate the model to a fixed Type I (false positive) error rate. Other parameters are set via maximum marginal likelihood to prior training data of known structure. QuantiSNP provides probabilistic quantification of state classifications and significantly improves the accuracy of segmental aneuploidy identification and mapping, relative to existing analytical tools (Beadstudio, Illumina), as demonstrated by validation of breakpoint boundaries. QuantiSNP identified both novel and validated CNVs. QuantiSNP was developed using BeadArray™ SNP data but it can be adapted to other platforms and we believe that the OB-HMM framework has widespread applicability in genomic research. In conclusion, QuantiSNP is a novel algorithm for high-resolution CNV/aneuploidy detection with application to clinical genetics, cancer and disease association studies.
spellingShingle Statistics (see also social sciences)
Medical Sciences
Genetics (medical sciences)
Colella, S
Yau, C
Taylor, J
Mirza, G
Butler, H
Clouston, P
Bassett, A
Seller, A
Holmes, C
Ragoussis, J
QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
title QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
title_full QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
title_fullStr QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
title_full_unstemmed QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
title_short QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data
title_sort quantisnp an objective bayes hidden markov model to detect and accurately map copy number variation using snp genotyping data
topic Statistics (see also social sciences)
Medical Sciences
Genetics (medical sciences)
work_keys_str_mv AT colellas quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT yauc quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT taylorj quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT mirzag quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT butlerh quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT cloustonp quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT bassetta quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT sellera quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT holmesc quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata
AT ragoussisj quantisnpanobjectivebayeshiddenmarkovmodeltodetectandaccuratelymapcopynumbervariationusingsnpgenotypingdata