Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients

Abstract Inherited retinal degenerations (IRDs) are a heterogeneous group of predominantly monogenic disorders with over 300 causative genes identified. Short-read exome sequencing is commonly used to genotypically diagnose patients with clinical features of IRDs, however, in up to 30% of patients w...

Full description

Bibliographic Details
Main Authors: Kenji Nakamichi, Russell N. Van Gelder, Jennifer R. Chao, Debarshi Mustafi
Format: Article
Language:English
Published: Nature Portfolio 2023-05-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-35791-4
_version_ 1797817970392563712
author Kenji Nakamichi
Russell N. Van Gelder
Jennifer R. Chao
Debarshi Mustafi
author_facet Kenji Nakamichi
Russell N. Van Gelder
Jennifer R. Chao
Debarshi Mustafi
author_sort Kenji Nakamichi
collection DOAJ
description Abstract Inherited retinal degenerations (IRDs) are a heterogeneous group of predominantly monogenic disorders with over 300 causative genes identified. Short-read exome sequencing is commonly used to genotypically diagnose patients with clinical features of IRDs, however, in up to 30% of patients with autosomal recessive IRDs, one or no disease-causing variants are identified. Furthermore, chromosomal maps cannot be reconstructed for allelic variant discovery with short-reads. Long-read genome sequencing can provide complete coverage of disease loci and a targeted approach can focus sequencing bandwidth to a genomic region of interest to provide increased depth and haplotype reconstruction to uncover cases of missing heritability. We demonstrate that targeted adaptive long-read sequencing on the Oxford Nanopore Technologies (ONT) platform of the USH2A gene from three probands in a family with the most common cause of the syndromic IRD, Usher Syndrome, resulted in greater than 12-fold target gene sequencing enrichment on average. This focused depth of sequencing allowed for haplotype reconstruction and phased variant identification. We further show that variants obtained from the haplotype-aware genotyping pipeline can be heuristically ranked to focus on potential pathogenic candidates without a priori knowledge of the disease-causing variants. Moreover, consideration of the variants unique to targeted long-read sequencing that are not covered by short-read technology demonstrated higher precision and F1 scores for variant discovery by long-read sequencing. This work establishes that targeted adaptive long-read sequencing can generate targeted, chromosome-phased data sets for identification of coding and non-coding disease-causing alleles in IRDs and can be applicable to other Mendelian diseases.
first_indexed 2024-03-13T09:01:22Z
format Article
id doaj.art-2d58e0200ccd4dd5a2cd7df9cc9d1f13
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-13T09:01:22Z
publishDate 2023-05-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-2d58e0200ccd4dd5a2cd7df9cc9d1f132023-05-28T11:16:43ZengNature PortfolioScientific Reports2045-23222023-05-011311910.1038/s41598-023-35791-4Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patientsKenji Nakamichi0Russell N. Van Gelder1Jennifer R. Chao2Debarshi Mustafi3Department of Ophthalmology, Roger and Karalis Johnson Retina Center, University of WashingtonDepartment of Ophthalmology, Roger and Karalis Johnson Retina Center, University of WashingtonDepartment of Ophthalmology, Roger and Karalis Johnson Retina Center, University of WashingtonDepartment of Ophthalmology, Roger and Karalis Johnson Retina Center, University of WashingtonAbstract Inherited retinal degenerations (IRDs) are a heterogeneous group of predominantly monogenic disorders with over 300 causative genes identified. Short-read exome sequencing is commonly used to genotypically diagnose patients with clinical features of IRDs, however, in up to 30% of patients with autosomal recessive IRDs, one or no disease-causing variants are identified. Furthermore, chromosomal maps cannot be reconstructed for allelic variant discovery with short-reads. Long-read genome sequencing can provide complete coverage of disease loci and a targeted approach can focus sequencing bandwidth to a genomic region of interest to provide increased depth and haplotype reconstruction to uncover cases of missing heritability. We demonstrate that targeted adaptive long-read sequencing on the Oxford Nanopore Technologies (ONT) platform of the USH2A gene from three probands in a family with the most common cause of the syndromic IRD, Usher Syndrome, resulted in greater than 12-fold target gene sequencing enrichment on average. This focused depth of sequencing allowed for haplotype reconstruction and phased variant identification. We further show that variants obtained from the haplotype-aware genotyping pipeline can be heuristically ranked to focus on potential pathogenic candidates without a priori knowledge of the disease-causing variants. Moreover, consideration of the variants unique to targeted long-read sequencing that are not covered by short-read technology demonstrated higher precision and F1 scores for variant discovery by long-read sequencing. This work establishes that targeted adaptive long-read sequencing can generate targeted, chromosome-phased data sets for identification of coding and non-coding disease-causing alleles in IRDs and can be applicable to other Mendelian diseases.https://doi.org/10.1038/s41598-023-35791-4
spellingShingle Kenji Nakamichi
Russell N. Van Gelder
Jennifer R. Chao
Debarshi Mustafi
Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients
Scientific Reports
title Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients
title_full Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients
title_fullStr Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients
title_full_unstemmed Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients
title_short Targeted adaptive long-read sequencing for discovery of complex phased variants in inherited retinal disease patients
title_sort targeted adaptive long read sequencing for discovery of complex phased variants in inherited retinal disease patients
url https://doi.org/10.1038/s41598-023-35791-4
work_keys_str_mv AT kenjinakamichi targetedadaptivelongreadsequencingfordiscoveryofcomplexphasedvariantsininheritedretinaldiseasepatients
AT russellnvangelder targetedadaptivelongreadsequencingfordiscoveryofcomplexphasedvariantsininheritedretinaldiseasepatients
AT jenniferrchao targetedadaptivelongreadsequencingfordiscoveryofcomplexphasedvariantsininheritedretinaldiseasepatients
AT debarshimustafi targetedadaptivelongreadsequencingfordiscoveryofcomplexphasedvariantsininheritedretinaldiseasepatients