Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
Abstract Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting e...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2023-11-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-023-46258-x |
_version_ | 1827770660620861440 |
---|---|
author | Apurba Shil Liron Levin Hava Golan Gal Meiri Analya Michaelovski Yair Sadaka Adi Aran Ilan Dinstein Idan Menashe |
author_facet | Apurba Shil Liron Levin Hava Golan Gal Meiri Analya Michaelovski Yair Sadaka Adi Aran Ilan Dinstein Idan Menashe |
author_sort | Apurba Shil |
collection | DOAJ |
description | Abstract Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single-nucleotide variants (SNVs) and short insertions/deletions (INDELs). However, interpreting these variants' functional and clinical consequences requires integrating multifaceted genomic information. We compared the concordance and effectiveness of three bioinformatics tools in detecting ASD candidate variants (SNVs and short INDELs) from WES data of 220 ASD family trios registered in the National Autism Database of Israel. We studied only rare (< 1% population frequency) proband-specific variants. According to the American College of Medical Genetics (ACMG) guidelines, the pathogenicity of variants was evaluated by the InterVar and TAPES tools. In addition, likely gene-disrupting (LGD) variants were detected based on an in-house bioinformatics tool, Psi-Variant, that integrates results from seven in-silico prediction tools. Overall, 372 variants in 311 genes distributed in 168 probands were detected by these tools. The overlap between the tools was 64.1, 22.9, and 23.1% for InterVar–TAPES, InterVar–Psi-Variant, and TAPES–Psi-Variant, respectively. The intersection between InterVar and Psi-Variant (I ∩ P) was the most effective approach in detecting variants in known ASD genes (PPV = 0.274; OR = 7.09, 95% CI = 3.92–12.22), while the union of InterVar and Psi Variant (I U P) achieved the highest diagnostic yield (20.5%).Our results suggest that integrating different variant interpretation approaches in detecting ASD candidate variants from WES data is superior to each approach alone. The inclusion of additional criteria could further improve the detection of ASD candidate variants. |
first_indexed | 2024-03-11T12:42:45Z |
format | Article |
id | doaj.art-72d73c661dda4c34a4daba0b28c8aaba |
institution | Directory Open Access Journal |
issn | 2045-2322 |
language | English |
last_indexed | 2024-03-11T12:42:45Z |
publishDate | 2023-11-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj.art-72d73c661dda4c34a4daba0b28c8aaba2023-11-05T12:12:23ZengNature PortfolioScientific Reports2045-23222023-11-011311910.1038/s41598-023-46258-xComparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing dataApurba Shil0Liron Levin1Hava Golan2Gal Meiri3Analya Michaelovski4Yair Sadaka5Adi Aran6Ilan Dinstein7Idan Menashe8Department of Epidemiology, Biostatistics, and Health Community Sciences, Faculty of Health Sciences, Ben-Gurion University of the NegevBioinformatics Core Facility, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevNeuropediatric Unit, Shaare Zedek Medical CenterAzrieli National Centre for Autism and Neurodevelopment Research, Ben-Gurion University of the NegevDepartment of Epidemiology, Biostatistics, and Health Community Sciences, Faculty of Health Sciences, Ben-Gurion University of the NegevAbstract Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single-nucleotide variants (SNVs) and short insertions/deletions (INDELs). However, interpreting these variants' functional and clinical consequences requires integrating multifaceted genomic information. We compared the concordance and effectiveness of three bioinformatics tools in detecting ASD candidate variants (SNVs and short INDELs) from WES data of 220 ASD family trios registered in the National Autism Database of Israel. We studied only rare (< 1% population frequency) proband-specific variants. According to the American College of Medical Genetics (ACMG) guidelines, the pathogenicity of variants was evaluated by the InterVar and TAPES tools. In addition, likely gene-disrupting (LGD) variants were detected based on an in-house bioinformatics tool, Psi-Variant, that integrates results from seven in-silico prediction tools. Overall, 372 variants in 311 genes distributed in 168 probands were detected by these tools. The overlap between the tools was 64.1, 22.9, and 23.1% for InterVar–TAPES, InterVar–Psi-Variant, and TAPES–Psi-Variant, respectively. The intersection between InterVar and Psi-Variant (I ∩ P) was the most effective approach in detecting variants in known ASD genes (PPV = 0.274; OR = 7.09, 95% CI = 3.92–12.22), while the union of InterVar and Psi Variant (I U P) achieved the highest diagnostic yield (20.5%).Our results suggest that integrating different variant interpretation approaches in detecting ASD candidate variants from WES data is superior to each approach alone. The inclusion of additional criteria could further improve the detection of ASD candidate variants.https://doi.org/10.1038/s41598-023-46258-x |
spellingShingle | Apurba Shil Liron Levin Hava Golan Gal Meiri Analya Michaelovski Yair Sadaka Adi Aran Ilan Dinstein Idan Menashe Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data Scientific Reports |
title | Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data |
title_full | Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data |
title_fullStr | Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data |
title_full_unstemmed | Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data |
title_short | Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data |
title_sort | comparison of three bioinformatics tools in the detection of asd candidate variants from whole exome sequencing data |
url | https://doi.org/10.1038/s41598-023-46258-x |
work_keys_str_mv | AT apurbashil comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT lironlevin comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT havagolan comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT galmeiri comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT analyamichaelovski comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT yairsadaka comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT adiaran comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT ilandinstein comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata AT idanmenashe comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata |