Detecting copy number variation in next generation sequencing data from diagnostic gene panels

Abstract Background Detection of copy number variation (CNV) in genes associated with disease is important in genetic diagnostics, and next generation sequencing (NGS) technology provides data that can be used for CNV detection. However, CNV detection based on NGS data is in general not often used i...

Full description

Bibliographic Details
Main Authors: Ashish Kumar Singh, Maren Fridtjofsen Olsen, Liss Anne Solberg Lavik, Trine Vold, Finn Drabløs, Wenche Sjursen
Format: Article
Language:English
Published: BMC 2021-08-01
Series:BMC Medical Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12920-021-01059-x
_version_ 1818670280512372736
author Ashish Kumar Singh
Maren Fridtjofsen Olsen
Liss Anne Solberg Lavik
Trine Vold
Finn Drabløs
Wenche Sjursen
author_facet Ashish Kumar Singh
Maren Fridtjofsen Olsen
Liss Anne Solberg Lavik
Trine Vold
Finn Drabløs
Wenche Sjursen
author_sort Ashish Kumar Singh
collection DOAJ
description Abstract Background Detection of copy number variation (CNV) in genes associated with disease is important in genetic diagnostics, and next generation sequencing (NGS) technology provides data that can be used for CNV detection. However, CNV detection based on NGS data is in general not often used in diagnostic labs as the data analysis is challenging, especially with data from targeted gene panels. Wet lab methods like MLPA (MRC Holland) are widely used, but are expensive, time consuming and have gene-specific limitations. Our aim has been to develop a bioinformatic tool for CNV detection from NGS data in medical genetic diagnostic samples. Results Our computational pipeline for detection of CNVs in NGS data from targeted gene panels utilizes coverage depth of the captured regions and calculates a copy number ratio score for each region. This is computed by comparing the mean coverage of the sample with the mean coverage of the same region in other samples, defined as a pool. The pipeline selects pools for comparison dynamically from previously sequenced samples, using the pool with an average coverage depth that is nearest to the one of the samples. A sliding window-based approach is used to analyze each region, where length of sliding window and sliding distance can be chosen dynamically to increase or decrease the resolution. This helps in detecting CNVs in small or partial exons. With this pipeline we have correctly identified the CNVs in 36 positive control samples, with sensitivity of 100% and specificity of 91%. We have detected whole gene level deletion/duplication, single/multi exonic level deletion/duplication, partial exonic deletion and mosaic deletion. Since its implementation in mid-2018 it has proven its diagnostic value with more than 45 CNV findings in routine tests. Conclusions With this pipeline as part of our diagnostic practices it is now possible to detect partial, single or multi-exonic, and intragenic CNVs in all genes in our target panel. This has helped our diagnostic lab to expand the portfolio of genes where we offer CNV detection, which previously was limited by the availability of MLPA kits.
first_indexed 2024-12-17T07:05:36Z
format Article
id doaj.art-84d201fa49674f48a6b546f71115b19b
institution Directory Open Access Journal
issn 1755-8794
language English
last_indexed 2024-12-17T07:05:36Z
publishDate 2021-08-01
publisher BMC
record_format Article
series BMC Medical Genomics
spelling doaj.art-84d201fa49674f48a6b546f71115b19b2022-12-21T21:59:11ZengBMCBMC Medical Genomics1755-87942021-08-0114111210.1186/s12920-021-01059-xDetecting copy number variation in next generation sequencing data from diagnostic gene panelsAshish Kumar Singh0Maren Fridtjofsen Olsen1Liss Anne Solberg Lavik2Trine Vold3Finn Drabløs4Wenche Sjursen5Department of Medical Genetics, St. Olavs HospitalDepartment of Medical Genetics, St. Olavs HospitalDepartment of Medical Genetics, St. Olavs HospitalDepartment of Medical Genetics, St. Olavs HospitalDepartment of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and TechnologyDepartment of Medical Genetics, St. Olavs HospitalAbstract Background Detection of copy number variation (CNV) in genes associated with disease is important in genetic diagnostics, and next generation sequencing (NGS) technology provides data that can be used for CNV detection. However, CNV detection based on NGS data is in general not often used in diagnostic labs as the data analysis is challenging, especially with data from targeted gene panels. Wet lab methods like MLPA (MRC Holland) are widely used, but are expensive, time consuming and have gene-specific limitations. Our aim has been to develop a bioinformatic tool for CNV detection from NGS data in medical genetic diagnostic samples. Results Our computational pipeline for detection of CNVs in NGS data from targeted gene panels utilizes coverage depth of the captured regions and calculates a copy number ratio score for each region. This is computed by comparing the mean coverage of the sample with the mean coverage of the same region in other samples, defined as a pool. The pipeline selects pools for comparison dynamically from previously sequenced samples, using the pool with an average coverage depth that is nearest to the one of the samples. A sliding window-based approach is used to analyze each region, where length of sliding window and sliding distance can be chosen dynamically to increase or decrease the resolution. This helps in detecting CNVs in small or partial exons. With this pipeline we have correctly identified the CNVs in 36 positive control samples, with sensitivity of 100% and specificity of 91%. We have detected whole gene level deletion/duplication, single/multi exonic level deletion/duplication, partial exonic deletion and mosaic deletion. Since its implementation in mid-2018 it has proven its diagnostic value with more than 45 CNV findings in routine tests. Conclusions With this pipeline as part of our diagnostic practices it is now possible to detect partial, single or multi-exonic, and intragenic CNVs in all genes in our target panel. This has helped our diagnostic lab to expand the portfolio of genes where we offer CNV detection, which previously was limited by the availability of MLPA kits.https://doi.org/10.1186/s12920-021-01059-xNext generation sequencing (NGS)Copy number variation (CNV)Structural variantMultiplex ligation-dependent probe amplification (MLPA)Sliding window
spellingShingle Ashish Kumar Singh
Maren Fridtjofsen Olsen
Liss Anne Solberg Lavik
Trine Vold
Finn Drabløs
Wenche Sjursen
Detecting copy number variation in next generation sequencing data from diagnostic gene panels
BMC Medical Genomics
Next generation sequencing (NGS)
Copy number variation (CNV)
Structural variant
Multiplex ligation-dependent probe amplification (MLPA)
Sliding window
title Detecting copy number variation in next generation sequencing data from diagnostic gene panels
title_full Detecting copy number variation in next generation sequencing data from diagnostic gene panels
title_fullStr Detecting copy number variation in next generation sequencing data from diagnostic gene panels
title_full_unstemmed Detecting copy number variation in next generation sequencing data from diagnostic gene panels
title_short Detecting copy number variation in next generation sequencing data from diagnostic gene panels
title_sort detecting copy number variation in next generation sequencing data from diagnostic gene panels
topic Next generation sequencing (NGS)
Copy number variation (CNV)
Structural variant
Multiplex ligation-dependent probe amplification (MLPA)
Sliding window
url https://doi.org/10.1186/s12920-021-01059-x
work_keys_str_mv AT ashishkumarsingh detectingcopynumbervariationinnextgenerationsequencingdatafromdiagnosticgenepanels
AT marenfridtjofsenolsen detectingcopynumbervariationinnextgenerationsequencingdatafromdiagnosticgenepanels
AT lissannesolberglavik detectingcopynumbervariationinnextgenerationsequencingdatafromdiagnosticgenepanels
AT trinevold detectingcopynumbervariationinnextgenerationsequencingdatafromdiagnosticgenepanels
AT finndrabløs detectingcopynumbervariationinnextgenerationsequencingdatafromdiagnosticgenepanels
AT wenchesjursen detectingcopynumbervariationinnextgenerationsequencingdatafromdiagnosticgenepanels