Tests for the statistical significance of protein sequence similarities in data-bank searches.
A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino ac...
Main Authors: | , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
1990
|
_version_ | 1797097860820369408 |
---|---|
author | Mott, R Kirkwood, T Curnow, R |
author_facet | Mott, R Kirkwood, T Curnow, R |
author_sort | Mott, R |
collection | OXFORD |
description | A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described. |
first_indexed | 2024-03-07T05:01:20Z |
format | Journal article |
id | oxford-uuid:d86122bb-b155-4bc2-8c59-e78a532ce955 |
institution | University of Oxford |
language | English |
last_indexed | 2024-03-07T05:01:20Z |
publishDate | 1990 |
record_format | dspace |
spelling | oxford-uuid:d86122bb-b155-4bc2-8c59-e78a532ce9552022-03-27T08:48:07ZTests for the statistical significance of protein sequence similarities in data-bank searches.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:d86122bb-b155-4bc2-8c59-e78a532ce955EnglishSymplectic Elements at Oxford1990Mott, RKirkwood, TCurnow, RA suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described. |
spellingShingle | Mott, R Kirkwood, T Curnow, R Tests for the statistical significance of protein sequence similarities in data-bank searches. |
title | Tests for the statistical significance of protein sequence similarities in data-bank searches. |
title_full | Tests for the statistical significance of protein sequence similarities in data-bank searches. |
title_fullStr | Tests for the statistical significance of protein sequence similarities in data-bank searches. |
title_full_unstemmed | Tests for the statistical significance of protein sequence similarities in data-bank searches. |
title_short | Tests for the statistical significance of protein sequence similarities in data-bank searches. |
title_sort | tests for the statistical significance of protein sequence similarities in data bank searches |
work_keys_str_mv | AT mottr testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches AT kirkwoodt testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches AT curnowr testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches |