Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data

Abstract Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting e...

Full description

Bibliographic Details
Main Authors: Apurba Shil, Liron Levin, Hava Golan, Gal Meiri, Analya Michaelovski, Yair Sadaka, Adi Aran, Ilan Dinstein, Idan Menashe
Format: Article
Language:English
Published: Nature Portfolio 2023-11-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-46258-x
_version_ 1827770660620861440
author Apurba Shil
Liron Levin
Hava Golan
Gal Meiri
Analya Michaelovski
Yair Sadaka
Adi Aran
Ilan Dinstein
Idan Menashe
author_facet Apurba Shil
Liron Levin
Hava Golan
Gal Meiri
Analya Michaelovski
Yair Sadaka
Adi Aran
Ilan Dinstein
Idan Menashe
author_sort Apurba Shil
collection DOAJ
description Abstract Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single-nucleotide variants (SNVs) and short insertions/deletions (INDELs). However, interpreting these variants' functional and clinical consequences requires integrating multifaceted genomic information. We compared the concordance and effectiveness of three bioinformatics tools in detecting ASD candidate variants (SNVs and short INDELs) from WES data of 220 ASD family trios registered in the National Autism Database of Israel. We studied only rare (< 1% population frequency) proband-specific variants. According to the American College of Medical Genetics (ACMG) guidelines, the pathogenicity of variants was evaluated by the InterVar and TAPES tools. In addition, likely gene-disrupting (LGD) variants were detected based on an in-house bioinformatics tool, Psi-Variant, that integrates results from seven in-silico prediction tools. Overall, 372 variants in 311 genes distributed in 168 probands were detected by these tools. The overlap between the tools was 64.1, 22.9, and 23.1% for InterVar–TAPES, InterVar–Psi-Variant, and TAPES–Psi-Variant, respectively. The intersection between InterVar and Psi-Variant (I ∩ P) was the most effective approach in detecting variants in known ASD genes (PPV = 0.274; OR = 7.09, 95% CI = 3.92–12.22), while the union of InterVar and Psi Variant (I U P) achieved the highest diagnostic yield (20.5%).Our results suggest that integrating different variant interpretation approaches in detecting ASD candidate variants from WES data is superior to each approach alone. The inclusion of additional criteria could further improve the detection of ASD candidate variants.
first_indexed 2024-03-11T12:42:45Z
format Article
id doaj.art-72d73c661dda4c34a4daba0b28c8aaba
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-11T12:42:45Z
publishDate 2023-11-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-72d73c661dda4c34a4daba0b28c8aaba2023-11-05T12:12:23ZengNature PortfolioScientific Reports2045-23222023-11-011311910.1038/s41598-023-46258-xComparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing dataApurba Shil0Liron Levin1Hava Golan2Gal Meiri3Analya Michaelovski4Yair Sadaka5Adi Aran6Ilan Dinstein7Idan Menashe8Department of Epidemiology, Biostatistics, and Health Community Sciences, Faculty of Health Sciences, Ben-Gurion University of the NegevBioinformatics Core Facility, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevNeuropediatric Unit, Shaare Zedek Medical CenterAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevDepartment of Epidemiology, Biostatistics, and Health Community Sciences, Faculty of Health Sciences, Ben-Gurion University of the NegevAbstract Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single-nucleotide variants (SNVs) and short insertions/deletions (INDELs). However, interpreting these variants' functional and clinical consequences requires integrating multifaceted genomic information. We compared the concordance and effectiveness of three bioinformatics tools in detecting ASD candidate variants (SNVs and short INDELs) from WES data of 220 ASD family trios registered in the National Autism Database of Israel. We studied only rare (< 1% population frequency) proband-specific variants. According to the American College of Medical Genetics (ACMG) guidelines, the pathogenicity of variants was evaluated by the InterVar and TAPES tools. In addition, likely gene-disrupting (LGD) variants were detected based on an in-house bioinformatics tool, Psi-Variant, that integrates results from seven in-silico prediction tools. Overall, 372 variants in 311 genes distributed in 168 probands were detected by these tools. The overlap between the tools was 64.1, 22.9, and 23.1% for InterVar–TAPES, InterVar–Psi-Variant, and TAPES–Psi-Variant, respectively. The intersection between InterVar and Psi-Variant (I ∩ P) was the most effective approach in detecting variants in known ASD genes (PPV = 0.274; OR = 7.09, 95% CI = 3.92–12.22), while the union of InterVar and Psi Variant (I U P) achieved the highest diagnostic yield (20.5%).Our results suggest that integrating different variant interpretation approaches in detecting ASD candidate variants from WES data is superior to each approach alone. The inclusion of additional criteria could further improve the detection of ASD candidate variants.https://doi.org/10.1038/s41598-023-46258-x
spellingShingle Apurba Shil
Liron Levin
Hava Golan
Gal Meiri
Analya Michaelovski
Yair Sadaka
Adi Aran
Ilan Dinstein
Idan Menashe
Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
Scientific Reports
title Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_full Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_fullStr Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_full_unstemmed Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_short Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_sort comparison of three bioinformatics tools in the detection of asd candidate variants from whole exome sequencing data
url https://doi.org/10.1038/s41598-023-46258-x
work_keys_str_mv AT apurbashil comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT lironlevin comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT havagolan comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT galmeiri comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT analyamichaelovski comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT yairsadaka comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT adiaran comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT ilandinstein comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT idanmenashe comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata