Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications

High-throughput DNA sequencing technology has transformed genetic research and is starting to make an impact on clinical practice. However, analyzing high-throughput sequencing data remains challenging, particularly in clinical settings where accuracy and turnaround times are critical. We present a...

Full description

Bibliographic Details
Main Authors: Rimmer, A, Phan, H, Mathieson, I, Iqbal, Z, Twigg, S, Wilkie, A, Mcvean, G, Lunter, G
Format: Journal article
Language:English
Published: Nature Publishing Group 2014
_version_ 1826257305823346688
author Rimmer, A
Phan, H
Mathieson, I
Iqbal, Z
Twigg, S
Wilkie, A
Mcvean, G
Lunter, G
author_facet Rimmer, A
Phan, H
Mathieson, I
Iqbal, Z
Twigg, S
Wilkie, A
Mcvean, G
Lunter, G
author_sort Rimmer, A
collection OXFORD
description High-throughput DNA sequencing technology has transformed genetic research and is starting to make an impact on clinical practice. However, analyzing high-throughput sequencing data remains challenging, particularly in clinical settings where accuracy and turnaround times are critical. We present a new approach to this problem, implemented in a software package called Platypus. Platypus achieves high sensitivity and specificity for SNPs, indels and complex polymorphisms by using local de novo assembly to generate candidate variants, followed by local realignment and probabilistic haplotype estimation. It is an order of magnitude faster than existing tools and generates calls from raw aligned read data without preprocessing. We demonstrate the performance of Platypus in clinically relevant experimental designs by comparing with SAMtools and GATK on whole-genome and exome-capture data, by identifying de novo variation in 15 parent-offspring trios with high sensitivity and specificity, and by estimating human leukocyte antigen genotypes directly from variant calls. © 2014 Nature America, Inc.
first_indexed 2024-03-06T18:16:04Z
format Journal article
id oxford-uuid:04ac0249-b692-437a-bf1b-45a317ec8459
institution University of Oxford
language English
last_indexed 2024-03-06T18:16:04Z
publishDate 2014
publisher Nature Publishing Group
record_format dspace
spelling oxford-uuid:04ac0249-b692-437a-bf1b-45a317ec84592022-03-26T08:52:57ZIntegrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applicationsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:04ac0249-b692-437a-bf1b-45a317ec8459EnglishSymplectic Elements at OxfordNature Publishing Group2014Rimmer, APhan, HMathieson, IIqbal, ZTwigg, SWilkie, AMcvean, GLunter, GHigh-throughput DNA sequencing technology has transformed genetic research and is starting to make an impact on clinical practice. However, analyzing high-throughput sequencing data remains challenging, particularly in clinical settings where accuracy and turnaround times are critical. We present a new approach to this problem, implemented in a software package called Platypus. Platypus achieves high sensitivity and specificity for SNPs, indels and complex polymorphisms by using local de novo assembly to generate candidate variants, followed by local realignment and probabilistic haplotype estimation. It is an order of magnitude faster than existing tools and generates calls from raw aligned read data without preprocessing. We demonstrate the performance of Platypus in clinically relevant experimental designs by comparing with SAMtools and GATK on whole-genome and exome-capture data, by identifying de novo variation in 15 parent-offspring trios with high sensitivity and specificity, and by estimating human leukocyte antigen genotypes directly from variant calls. © 2014 Nature America, Inc.
spellingShingle Rimmer, A
Phan, H
Mathieson, I
Iqbal, Z
Twigg, S
Wilkie, A
Mcvean, G
Lunter, G
Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications
title Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications
title_full Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications
title_fullStr Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications
title_full_unstemmed Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications
title_short Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications
title_sort integrating mapping assembly and haplotype based approaches for calling variants in clinical sequencing applications
work_keys_str_mv AT rimmera integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT phanh integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT mathiesoni integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT iqbalz integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT twiggs integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT wilkiea integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT mcveang integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications
AT lunterg integratingmappingassemblyandhaplotypebasedapproachesforcallingvariantsinclinicalsequencingapplications