A test for the statistical significance of DNA sequence similarities for application in databank searches.
A method is developed, based on word-searching, which provides a rapid test for the statistical significance of DNA sequence similarities for use in databank searching. The method makes allowance for the lengths and dinucleotide compositions of the sequences being compared. A way is also described t...
Main Authors: | , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
1989
|
_version_ | 1826280141031997440 |
---|---|
author | Mott, R Kirkwood, T Curnow, R |
author_facet | Mott, R Kirkwood, T Curnow, R |
author_sort | Mott, R |
collection | OXFORD |
description | A method is developed, based on word-searching, which provides a rapid test for the statistical significance of DNA sequence similarities for use in databank searching. The method makes allowance for the lengths and dinucleotide compositions of the sequences being compared. A way is also described to calculate the power of the test, i.e. the probability of detecting a given similarity as being statistically significant. The effects on the power of the test of the scoring method, word length, sequence length, and sequence composition are examined. A novel scoring method is shown to be superior to the method currently used in most word-searching algorithms. |
first_indexed | 2024-03-07T00:09:15Z |
format | Journal article |
id | oxford-uuid:78a25f65-545d-425b-841d-620f3d1c6bb0 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T00:09:15Z |
publishDate | 1989 |
record_format | dspace |
spelling | oxford-uuid:78a25f65-545d-425b-841d-620f3d1c6bb02022-03-26T20:32:01ZA test for the statistical significance of DNA sequence similarities for application in databank searches.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:78a25f65-545d-425b-841d-620f3d1c6bb0EnglishSymplectic Elements at Oxford1989Mott, RKirkwood, TCurnow, RA method is developed, based on word-searching, which provides a rapid test for the statistical significance of DNA sequence similarities for use in databank searching. The method makes allowance for the lengths and dinucleotide compositions of the sequences being compared. A way is also described to calculate the power of the test, i.e. the probability of detecting a given similarity as being statistically significant. The effects on the power of the test of the scoring method, word length, sequence length, and sequence composition are examined. A novel scoring method is shown to be superior to the method currently used in most word-searching algorithms. |
spellingShingle | Mott, R Kirkwood, T Curnow, R A test for the statistical significance of DNA sequence similarities for application in databank searches. |
title | A test for the statistical significance of DNA sequence similarities for application in databank searches. |
title_full | A test for the statistical significance of DNA sequence similarities for application in databank searches. |
title_fullStr | A test for the statistical significance of DNA sequence similarities for application in databank searches. |
title_full_unstemmed | A test for the statistical significance of DNA sequence similarities for application in databank searches. |
title_short | A test for the statistical significance of DNA sequence similarities for application in databank searches. |
title_sort | test for the statistical significance of dna sequence similarities for application in databank searches |
work_keys_str_mv | AT mottr atestforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches AT kirkwoodt atestforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches AT curnowr atestforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches AT mottr testforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches AT kirkwoodt testforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches AT curnowr testforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches |