A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference

<p>We show how positional markers can be used to encode genetic variation within a Burrows-Wheeler Transform (BWT), and use this to construct a generalisation of the traditional "reference genome", incorporating known variation within a species. Our goal is to support the inference o...

Full description

Bibliographic Details
Main Authors: Iqbal, Z, Maciuca, S, del Ojo Elias, C, McVean, G
Format: Conference item
Published: Springer 2016
_version_ 1797088776985509888
author Iqbal, Z
Maciuca, S
del Ojo Elias, C
McVean, G
author_facet Iqbal, Z
Maciuca, S
del Ojo Elias, C
McVean, G
author_sort Iqbal, Z
collection OXFORD
description <p>We show how positional markers can be used to encode genetic variation within a Burrows-Wheeler Transform (BWT), and use this to construct a generalisation of the traditional "reference genome", incorporating known variation within a species. Our goal is to support the inference of the closest mosaic of previously known sequences to the genome(s) under analysis.</p> <br/> <p>Our scheme results in an increased alphabet size, and by using a wavelet tree encoding of the BWT we reduce the performance impact on rank operations. We give a specialised form of the backward search that allows variation-aware exact matching. We implement this, and demonstrate the cost of constructing an index of the whole human genome with 8 million genetic variants is 25GB of RAM. We also show that inferring a closer reference can close large kilobase-scale coverage gaps in P. falciparum.</p>
first_indexed 2024-03-07T02:54:54Z
format Conference item
id oxford-uuid:aeef5bed-d1e4-4298-9dbd-961384b72edf
institution University of Oxford
last_indexed 2024-03-07T02:54:54Z
publishDate 2016
publisher Springer
record_format dspace
spelling oxford-uuid:aeef5bed-d1e4-4298-9dbd-961384b72edf2022-03-27T03:46:12ZA natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inferenceConference itemhttp://purl.org/coar/resource_type/c_5794uuid:aeef5bed-d1e4-4298-9dbd-961384b72edfSymplectic Elements at OxfordSpringer2016Iqbal, ZMaciuca, Sdel Ojo Elias, CMcVean, G<p>We show how positional markers can be used to encode genetic variation within a Burrows-Wheeler Transform (BWT), and use this to construct a generalisation of the traditional "reference genome", incorporating known variation within a species. Our goal is to support the inference of the closest mosaic of previously known sequences to the genome(s) under analysis.</p> <br/> <p>Our scheme results in an increased alphabet size, and by using a wavelet tree encoding of the BWT we reduce the performance impact on rank operations. We give a specialised form of the backward search that allows variation-aware exact matching. We implement this, and demonstrate the cost of constructing an index of the whole human genome with 8 million genetic variants is 25GB of RAM. We also show that inferring a closer reference can close large kilobase-scale coverage gaps in P. falciparum.</p>
spellingShingle Iqbal, Z
Maciuca, S
del Ojo Elias, C
McVean, G
A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
title A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
title_full A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
title_fullStr A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
title_full_unstemmed A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
title_short A natural encoding of genetic variation in a Burrows-Wheeler Transform to enable mapping and genome inference
title_sort natural encoding of genetic variation in a burrows wheeler transform to enable mapping and genome inference
work_keys_str_mv AT iqbalz anaturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT maciucas anaturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT delojoeliasc anaturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT mcveang anaturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT iqbalz naturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT maciucas naturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT delojoeliasc naturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference
AT mcveang naturalencodingofgeneticvariationinaburrowswheelertransformtoenablemappingandgenomeinference