A multiway analysis for identifying high integrity bovine BACs

<p>Abstract</p> <p>Background</p> <p>In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of...

Full description

Bibliographic Details
Main Authors: McEwan John C, Brauning Rudiger, McWilliam Sean, Barris Wesley, Ratnakumar Abhirami, Snelling Warren M, Dalrymple Brian P
Format: Article
Language:English
Published: BMC 2009-01-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/10/46
_version_ 1811286769174315008
author McEwan John C
Brauning Rudiger
McWilliam Sean
Barris Wesley
Ratnakumar Abhirami
Snelling Warren M
Dalrymple Brian P
author_facet McEwan John C
Brauning Rudiger
McWilliam Sean
Barris Wesley
Ratnakumar Abhirami
Snelling Warren M
Dalrymple Brian P
author_sort McEwan John C
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined.</p> <p>Results</p> <p>The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects.</p> <p>Conclusion</p> <p>Our analysis shows that ~95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (~27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.</p>
first_indexed 2024-04-13T03:05:48Z
format Article
id doaj.art-bb02d5fe36f54d4fbdcb07a510d4af11
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-04-13T03:05:48Z
publishDate 2009-01-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-bb02d5fe36f54d4fbdcb07a510d4af112022-12-22T03:05:15ZengBMCBMC Genomics1471-21642009-01-011014610.1186/1471-2164-10-46A multiway analysis for identifying high integrity bovine BACsMcEwan John CBrauning RudigerMcWilliam SeanBarris WesleyRatnakumar AbhiramiSnelling Warren MDalrymple Brian P<p>Abstract</p> <p>Background</p> <p>In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined.</p> <p>Results</p> <p>The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects.</p> <p>Conclusion</p> <p>Our analysis shows that ~95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (~27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.</p>http://www.biomedcentral.com/1471-2164/10/46
spellingShingle McEwan John C
Brauning Rudiger
McWilliam Sean
Barris Wesley
Ratnakumar Abhirami
Snelling Warren M
Dalrymple Brian P
A multiway analysis for identifying high integrity bovine BACs
BMC Genomics
title A multiway analysis for identifying high integrity bovine BACs
title_full A multiway analysis for identifying high integrity bovine BACs
title_fullStr A multiway analysis for identifying high integrity bovine BACs
title_full_unstemmed A multiway analysis for identifying high integrity bovine BACs
title_short A multiway analysis for identifying high integrity bovine BACs
title_sort multiway analysis for identifying high integrity bovine bacs
url http://www.biomedcentral.com/1471-2164/10/46
work_keys_str_mv AT mcewanjohnc amultiwayanalysisforidentifyinghighintegritybovinebacs
AT brauningrudiger amultiwayanalysisforidentifyinghighintegritybovinebacs
AT mcwilliamsean amultiwayanalysisforidentifyinghighintegritybovinebacs
AT barriswesley amultiwayanalysisforidentifyinghighintegritybovinebacs
AT ratnakumarabhirami amultiwayanalysisforidentifyinghighintegritybovinebacs
AT snellingwarrenm amultiwayanalysisforidentifyinghighintegritybovinebacs
AT dalrymplebrianp amultiwayanalysisforidentifyinghighintegritybovinebacs
AT mcewanjohnc multiwayanalysisforidentifyinghighintegritybovinebacs
AT brauningrudiger multiwayanalysisforidentifyinghighintegritybovinebacs
AT mcwilliamsean multiwayanalysisforidentifyinghighintegritybovinebacs
AT barriswesley multiwayanalysisforidentifyinghighintegritybovinebacs
AT ratnakumarabhirami multiwayanalysisforidentifyinghighintegritybovinebacs
AT snellingwarrenm multiwayanalysisforidentifyinghighintegritybovinebacs
AT dalrymplebrianp multiwayanalysisforidentifyinghighintegritybovinebacs