A test for the statistical significance of DNA sequence similarities for application in databank searches.

A method is developed, based on word-searching, which provides a rapid test for the statistical significance of DNA sequence similarities for use in databank searching. The method makes allowance for the lengths and dinucleotide compositions of the sequences being compared. A way is also described t...

Full description

Bibliographic Details
Main Authors: Mott, R, Kirkwood, T, Curnow, R
Format: Journal article
Language:English
Published: 1989
_version_ 1826280141031997440
author Mott, R
Kirkwood, T
Curnow, R
author_facet Mott, R
Kirkwood, T
Curnow, R
author_sort Mott, R
collection OXFORD
description A method is developed, based on word-searching, which provides a rapid test for the statistical significance of DNA sequence similarities for use in databank searching. The method makes allowance for the lengths and dinucleotide compositions of the sequences being compared. A way is also described to calculate the power of the test, i.e. the probability of detecting a given similarity as being statistically significant. The effects on the power of the test of the scoring method, word length, sequence length, and sequence composition are examined. A novel scoring method is shown to be superior to the method currently used in most word-searching algorithms.
first_indexed 2024-03-07T00:09:15Z
format Journal article
id oxford-uuid:78a25f65-545d-425b-841d-620f3d1c6bb0
institution University of Oxford
language English
last_indexed 2024-03-07T00:09:15Z
publishDate 1989
record_format dspace
spelling oxford-uuid:78a25f65-545d-425b-841d-620f3d1c6bb02022-03-26T20:32:01ZA test for the statistical significance of DNA sequence similarities for application in databank searches.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:78a25f65-545d-425b-841d-620f3d1c6bb0EnglishSymplectic Elements at Oxford1989Mott, RKirkwood, TCurnow, RA method is developed, based on word-searching, which provides a rapid test for the statistical significance of DNA sequence similarities for use in databank searching. The method makes allowance for the lengths and dinucleotide compositions of the sequences being compared. A way is also described to calculate the power of the test, i.e. the probability of detecting a given similarity as being statistically significant. The effects on the power of the test of the scoring method, word length, sequence length, and sequence composition are examined. A novel scoring method is shown to be superior to the method currently used in most word-searching algorithms.
spellingShingle Mott, R
Kirkwood, T
Curnow, R
A test for the statistical significance of DNA sequence similarities for application in databank searches.
title A test for the statistical significance of DNA sequence similarities for application in databank searches.
title_full A test for the statistical significance of DNA sequence similarities for application in databank searches.
title_fullStr A test for the statistical significance of DNA sequence similarities for application in databank searches.
title_full_unstemmed A test for the statistical significance of DNA sequence similarities for application in databank searches.
title_short A test for the statistical significance of DNA sequence similarities for application in databank searches.
title_sort test for the statistical significance of dna sequence similarities for application in databank searches
work_keys_str_mv AT mottr atestforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches
AT kirkwoodt atestforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches
AT curnowr atestforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches
AT mottr testforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches
AT kirkwoodt testforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches
AT curnowr testforthestatisticalsignificanceofdnasequencesimilaritiesforapplicationindatabanksearches