Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle

Abstract Background The trading of individual animal genotype information often involves only the exchange of the called genotypes and not necessarily the additional information required to effectively call structural variants. The main aim here was to determine if it is possible to impute copy numb...

Full description

Bibliographic Details
Main Authors: Pierce Rafter, Isobel Claire Gormley, Andrew C. Parnell, John Francis Kearney, Donagh P. Berry
Format: Article
Language:English
Published: BMC 2020-03-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-020-6627-8
_version_ 1818038119038976000
author Pierce Rafter
Isobel Claire Gormley
Andrew C. Parnell
John Francis Kearney
Donagh P. Berry
author_facet Pierce Rafter
Isobel Claire Gormley
Andrew C. Parnell
John Francis Kearney
Donagh P. Berry
author_sort Pierce Rafter
collection DOAJ
description Abstract Background The trading of individual animal genotype information often involves only the exchange of the called genotypes and not necessarily the additional information required to effectively call structural variants. The main aim here was to determine if it is possible to impute copy number variants (CNVs) using the flanking single nucleotide polymorphism (SNP) haplotype structure in cattle. While this objective was achieved using high-density genotype panels (i.e., 713,162 SNPs), a secondary objective investigated the concordance of CNVs called with this high-density genotype panel compared to CNVs called from a medium-density panel (i.e., 45,677 SNPs in the present study). This is the first study to compare CNVs called from high-density and medium-density SNP genotypes from the same animals. High (and medium-density) genotypes were available on 991 Holstein-Friesian, 1015 Charolais, and 1394 Limousin bulls. The concordance between CNVs called from the medium-density and high-density genotypes were calculated separately for each animal. A subset of CNVs which were called from the high-density genotypes was selected for imputation. Imputation was carried out separately for each breed using a set of high-density SNPs flanking the midpoint of each CNV. A CNV was deemed to be imputed correctly when the called copy number matched the imputed copy number. Results For 97.0% of CNVs called from the high-density genotypes, the corresponding genomic position on the medium-density of the animal did not contain a called CNV. The average accuracy of imputation for CNV deletions was 0.281, with a standard deviation of 0.286. The average accuracy of imputation of the CNV normal state, i.e. the absence of a CNV, was 0.982 with a standard deviation of 0.022. Two CNV duplications were imputed in the Charolais, a single CNV duplication in the Limousins, and a single CNV duplication in the Holstein-Friesians; in all cases the CNV duplications were incorrectly imputed. Conclusion The vast majority of CNVs called from the high-density genotypes were not detected using the medium-density genotypes. Furthermore, CNVs cannot be accurately predicted from flanking SNP haplotypes, at least based on the imputation algorithms routinely used in cattle, and using the SNPs currently available on the high-density genotype panel.
first_indexed 2024-12-10T07:37:40Z
format Article
id doaj.art-8ca25926e16b4a07a10181c890d536aa
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-10T07:37:40Z
publishDate 2020-03-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-8ca25926e16b4a07a10181c890d536aa2022-12-22T01:57:22ZengBMCBMC Genomics1471-21642020-03-0121111010.1186/s12864-020-6627-8Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattlePierce Rafter0Isobel Claire Gormley1Andrew C. Parnell2John Francis Kearney3Donagh P. Berry4Animal & Grassland Research and Innovation Centre, TeagascUCD School of Mathematics and Statistics, University College DublinUCD School of Mathematics and Statistics, University College DublinIrish Cattle Breeding Federation, Highfield House, Shinagh, Bandon, Co.Animal & Grassland Research and Innovation Centre, TeagascAbstract Background The trading of individual animal genotype information often involves only the exchange of the called genotypes and not necessarily the additional information required to effectively call structural variants. The main aim here was to determine if it is possible to impute copy number variants (CNVs) using the flanking single nucleotide polymorphism (SNP) haplotype structure in cattle. While this objective was achieved using high-density genotype panels (i.e., 713,162 SNPs), a secondary objective investigated the concordance of CNVs called with this high-density genotype panel compared to CNVs called from a medium-density panel (i.e., 45,677 SNPs in the present study). This is the first study to compare CNVs called from high-density and medium-density SNP genotypes from the same animals. High (and medium-density) genotypes were available on 991 Holstein-Friesian, 1015 Charolais, and 1394 Limousin bulls. The concordance between CNVs called from the medium-density and high-density genotypes were calculated separately for each animal. A subset of CNVs which were called from the high-density genotypes was selected for imputation. Imputation was carried out separately for each breed using a set of high-density SNPs flanking the midpoint of each CNV. A CNV was deemed to be imputed correctly when the called copy number matched the imputed copy number. Results For 97.0% of CNVs called from the high-density genotypes, the corresponding genomic position on the medium-density of the animal did not contain a called CNV. The average accuracy of imputation for CNV deletions was 0.281, with a standard deviation of 0.286. The average accuracy of imputation of the CNV normal state, i.e. the absence of a CNV, was 0.982 with a standard deviation of 0.022. Two CNV duplications were imputed in the Charolais, a single CNV duplication in the Limousins, and a single CNV duplication in the Holstein-Friesians; in all cases the CNV duplications were incorrectly imputed. Conclusion The vast majority of CNVs called from the high-density genotypes were not detected using the medium-density genotypes. Furthermore, CNVs cannot be accurately predicted from flanking SNP haplotypes, at least based on the imputation algorithms routinely used in cattle, and using the SNPs currently available on the high-density genotype panel.http://link.springer.com/article/10.1186/s12864-020-6627-8CNVBovinePennCNVQuantiSNPBeagleFImpute
spellingShingle Pierce Rafter
Isobel Claire Gormley
Andrew C. Parnell
John Francis Kearney
Donagh P. Berry
Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
BMC Genomics
CNV
Bovine
PennCNV
QuantiSNP
Beagle
FImpute
title Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
title_full Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
title_fullStr Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
title_full_unstemmed Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
title_short Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
title_sort concordance rate between copy number variants detected using either high or medium density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle
topic CNV
Bovine
PennCNV
QuantiSNP
Beagle
FImpute
url http://link.springer.com/article/10.1186/s12864-020-6627-8
work_keys_str_mv AT piercerafter concordanceratebetweencopynumbervariantsdetectedusingeitherhighormediumdensitysinglenucleotidepolymorphismgenotypepanelsandthepotentialofimputingcopynumbervariantsfromflankinghighdensitysinglenucleotidepolymorphismhaplotypesincattle
AT isobelclairegormley concordanceratebetweencopynumbervariantsdetectedusingeitherhighormediumdensitysinglenucleotidepolymorphismgenotypepanelsandthepotentialofimputingcopynumbervariantsfromflankinghighdensitysinglenucleotidepolymorphismhaplotypesincattle
AT andrewcparnell concordanceratebetweencopynumbervariantsdetectedusingeitherhighormediumdensitysinglenucleotidepolymorphismgenotypepanelsandthepotentialofimputingcopynumbervariantsfromflankinghighdensitysinglenucleotidepolymorphismhaplotypesincattle
AT johnfranciskearney concordanceratebetweencopynumbervariantsdetectedusingeitherhighormediumdensitysinglenucleotidepolymorphismgenotypepanelsandthepotentialofimputingcopynumbervariantsfromflankinghighdensitysinglenucleotidepolymorphismhaplotypesincattle
AT donaghpberry concordanceratebetweencopynumbervariantsdetectedusingeitherhighormediumdensitysinglenucleotidepolymorphismgenotypepanelsandthepotentialofimputingcopynumbervariantsfromflankinghighdensitysinglenucleotidepolymorphismhaplotypesincattle