A multiway analysis for identifying high integrity bovine BACs
<p>Abstract</p> <p>Background</p> <p>In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2009-01-01
|
Series: | BMC Genomics |
Online Access: | http://www.biomedcentral.com/1471-2164/10/46 |
_version_ | 1811286769174315008 |
---|---|
author | McEwan John C Brauning Rudiger McWilliam Sean Barris Wesley Ratnakumar Abhirami Snelling Warren M Dalrymple Brian P |
author_facet | McEwan John C Brauning Rudiger McWilliam Sean Barris Wesley Ratnakumar Abhirami Snelling Warren M Dalrymple Brian P |
author_sort | McEwan John C |
collection | DOAJ |
description | <p>Abstract</p> <p>Background</p> <p>In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined.</p> <p>Results</p> <p>The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects.</p> <p>Conclusion</p> <p>Our analysis shows that ~95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (~27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.</p> |
first_indexed | 2024-04-13T03:05:48Z |
format | Article |
id | doaj.art-bb02d5fe36f54d4fbdcb07a510d4af11 |
institution | Directory Open Access Journal |
issn | 1471-2164 |
language | English |
last_indexed | 2024-04-13T03:05:48Z |
publishDate | 2009-01-01 |
publisher | BMC |
record_format | Article |
series | BMC Genomics |
spelling | doaj.art-bb02d5fe36f54d4fbdcb07a510d4af112022-12-22T03:05:15ZengBMCBMC Genomics1471-21642009-01-011014610.1186/1471-2164-10-46A multiway analysis for identifying high integrity bovine BACsMcEwan John CBrauning RudigerMcWilliam SeanBarris WesleyRatnakumar AbhiramiSnelling Warren MDalrymple Brian P<p>Abstract</p> <p>Background</p> <p>In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined.</p> <p>Results</p> <p>The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects.</p> <p>Conclusion</p> <p>Our analysis shows that ~95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (~27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.</p>http://www.biomedcentral.com/1471-2164/10/46 |
spellingShingle | McEwan John C Brauning Rudiger McWilliam Sean Barris Wesley Ratnakumar Abhirami Snelling Warren M Dalrymple Brian P A multiway analysis for identifying high integrity bovine BACs BMC Genomics |
title | A multiway analysis for identifying high integrity bovine BACs |
title_full | A multiway analysis for identifying high integrity bovine BACs |
title_fullStr | A multiway analysis for identifying high integrity bovine BACs |
title_full_unstemmed | A multiway analysis for identifying high integrity bovine BACs |
title_short | A multiway analysis for identifying high integrity bovine BACs |
title_sort | multiway analysis for identifying high integrity bovine bacs |
url | http://www.biomedcentral.com/1471-2164/10/46 |
work_keys_str_mv | AT mcewanjohnc amultiwayanalysisforidentifyinghighintegritybovinebacs AT brauningrudiger amultiwayanalysisforidentifyinghighintegritybovinebacs AT mcwilliamsean amultiwayanalysisforidentifyinghighintegritybovinebacs AT barriswesley amultiwayanalysisforidentifyinghighintegritybovinebacs AT ratnakumarabhirami amultiwayanalysisforidentifyinghighintegritybovinebacs AT snellingwarrenm amultiwayanalysisforidentifyinghighintegritybovinebacs AT dalrymplebrianp amultiwayanalysisforidentifyinghighintegritybovinebacs AT mcewanjohnc multiwayanalysisforidentifyinghighintegritybovinebacs AT brauningrudiger multiwayanalysisforidentifyinghighintegritybovinebacs AT mcwilliamsean multiwayanalysisforidentifyinghighintegritybovinebacs AT barriswesley multiwayanalysisforidentifyinghighintegritybovinebacs AT ratnakumarabhirami multiwayanalysisforidentifyinghighintegritybovinebacs AT snellingwarrenm multiwayanalysisforidentifyinghighintegritybovinebacs AT dalrymplebrianp multiwayanalysisforidentifyinghighintegritybovinebacs |