Tests for the statistical significance of protein sequence similarities in data-bank searches.

A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino ac...

Full description

Bibliographic Details
Main Authors:	Mott, R, Kirkwood, T, Curnow, R
Format:	Journal article
Language:	English
Published:	1990

_version_	1797097860820369408
author	Mott, R Kirkwood, T Curnow, R
author_facet	Mott, R Kirkwood, T Curnow, R
author_sort	Mott, R
collection	OXFORD
description	A suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.
first_indexed	2024-03-07T05:01:20Z
format	Journal article
id	oxford-uuid:d86122bb-b155-4bc2-8c59-e78a532ce955
institution	University of Oxford
language	English
last_indexed	2024-03-07T05:01:20Z
publishDate	1990
record_format	dspace
spelling	oxford-uuid:d86122bb-b155-4bc2-8c59-e78a532ce9552022-03-27T08:48:07ZTests for the statistical significance of protein sequence similarities in data-bank searches.Journal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:d86122bb-b155-4bc2-8c59-e78a532ce955EnglishSymplectic Elements at Oxford1990Mott, RKirkwood, TCurnow, RA suite of tests to evaluate the statistical significance of protein sequence similarities is developed for use in data bank searches. The tests are based on the Wilbur-Lipman word-search algorithm, and take into account the sequence lengths and compositions, and optionally the weighting of amino acid matches. The method is extended to allow for the existence of a sequence insertion/deletion within the region of similarity. The accuracy of statistical distributions underlying the tests is validated using randomly generated sequences and real sequences selected at random from the data banks. A computer program to perform the tests is briefly described.
spellingShingle	Mott, R Kirkwood, T Curnow, R Tests for the statistical significance of protein sequence similarities in data-bank searches.
title	Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_full	Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_fullStr	Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_full_unstemmed	Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_short	Tests for the statistical significance of protein sequence similarities in data-bank searches.
title_sort	tests for the statistical significance of protein sequence similarities in data bank searches
work_keys_str_mv	AT mottr testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches AT kirkwoodt testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches AT curnowr testsforthestatisticalsignificanceofproteinsequencesimilaritiesindatabanksearches

Tests for the statistical significance of protein sequence similarities in data-bank searches.

Similar Items