Tests for the statistical significance of protein sequence similarities in data-bank searches.

A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino ac...

Full description

Bibliographic Details
Main Authors: Mott, R, Kirkwood, T, Curnow, R
Format: Journal article
Language:English
Published: 1990
_version_ 1797097860820369408
author Mott, R
Kirkwood, T
Curnow, R
author_facet Mott, R
Kirkwood, T
Curnow, R
author_sort Mott, R
collection OXFORD
description A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.
first_indexed 2024-03-07T05:01:20Z
format Journal article
id oxford-uuid:d86122bb-b155-4bc2-8c59-e78a532ce955
institution University of Oxford
language English
last_indexed 2024-03-07T05:01:20Z
publishDate 1990
record_format dspace
spelling oxford-uuid:d86122bb-b155-4bc2-8c59-e78a532ce9552022-03-27T08:48:07ZTests for the statistical significance of protein sequence similarities in data-bank searches.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:d86122bb-b155-4bc2-8c59-e78a532ce955EnglishSymplectic Elements at Oxford1990Mott, RKirkwood, TCurnow, RA suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.
spellingShingle Mott, R
Kirkwood, T
Curnow, R
Tests for the statistical significance of protein sequence similarities in data-bank searches.
title Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_full Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_fullStr Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_full_unstemmed Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_short Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_sort tests for the statistical significance of protein sequence similarities in data bank searches
work_keys_str_mv AT mottr testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches
AT kirkwoodt testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches
AT curnowr testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches