Information categorisation in biological sequence alignments

This is a two-part report. In the first part we introduce the reader to biological sequence alignment. We discus dynamic programming as is used in sequence alignment, first in the case of two sequences and later, how it is adopted for multiple sequence alignment. Several references are given to the...

Full description

Bibliographic Details
Main Authors:	Gunewardena, S, Jeavons, P
Format:	Report
Published:	Oxford University Computing Laboratory 2004

_version_	1797084247893213184
author	Gunewardena, S Jeavons, P
author_facet	Gunewardena, S Jeavons, P
author_sort	Gunewardena, S
collection	OXFORD
description	This is a two-part report. In the first part we introduce the reader to biological sequence alignment. We discus dynamic programming as is used in sequence alignment, first in the case of two sequences and later, how it is adopted for multiple sequence alignment. Several references are given to the different sequence alignment strategies reported in the literature used to enhance the standard dynamic programming algorithm for sequence alignment to suit biological sequences. A short discussion on how alignments are scored is given. Finally, some of the existing sequence alignment tools are described.<p>The second part of this report presents a critical analysis of information as it relates to biological sequence alignment. Information relating to the sequences being aligned form the basis on which any alignment is built. In its basic form this information might quantify how individual residues are scored when aligned with each other or how gaps are scored when introduced between two residues. Every biological sequence has if not explicit, at least some form of implicit information relating to its residues that form distinguishing markers along the sequence. There are many ways of extracting this information such as from databases of the relevant sequences, from the literature, prior processing etc. It is reasonable to assume that the more sequence information we use in an alignment, the more confidant we can be of the resulting alignment, and hence make better hypothesis of the unknown sequences. The aim of this part of the report is to build a framework on how to represent this information in such a way as to facilitate the dynamic and flexible incorporation of it to facilitate sequence alignments.</p>
first_indexed	2024-03-07T01:52:46Z
format	Report
id	oxford-uuid:9ab07177-4776-4762-8aeb-dff0fd3498d0
institution	University of Oxford
last_indexed	2024-03-07T01:52:46Z
publishDate	2004
publisher	Oxford University Computing Laboratory
record_format	dspace
spelling	oxford-uuid:9ab07177-4776-4762-8aeb-dff0fd3498d02022-03-27T00:23:04ZInformation categorisation in biological sequence alignmentsReporthttp://purl.org/coar/resource_type/c_93fcuuid:9ab07177-4776-4762-8aeb-dff0fd3498d0Department of Computer ScienceOxford University Computing Laboratory2004Gunewardena, SJeavons, PThis is a two-part report. In the first part we introduce the reader to biological sequence alignment. We discus dynamic programming as is used in sequence alignment, first in the case of two sequences and later, how it is adopted for multiple sequence alignment. Several references are given to the different sequence alignment strategies reported in the literature used to enhance the standard dynamic programming algorithm for sequence alignment to suit biological sequences. A short discussion on how alignments are scored is given. Finally, some of the existing sequence alignment tools are described.<p>The second part of this report presents a critical analysis of information as it relates to biological sequence alignment. Information relating to the sequences being aligned form the basis on which any alignment is built. In its basic form this information might quantify how individual residues are scored when aligned with each other or how gaps are scored when introduced between two residues. Every biological sequence has if not explicit, at least some form of implicit information relating to its residues that form distinguishing markers along the sequence. There are many ways of extracting this information such as from databases of the relevant sequences, from the literature, prior processing etc. It is reasonable to assume that the more sequence information we use in an alignment, the more confidant we can be of the resulting alignment, and hence make better hypothesis of the unknown sequences. The aim of this part of the report is to build a framework on how to represent this information in such a way as to facilitate the dynamic and flexible incorporation of it to facilitate sequence alignments.</p>
spellingShingle	Gunewardena, S Jeavons, P Information categorisation in biological sequence alignments
title	Information categorisation in biological sequence alignments
title_full	Information categorisation in biological sequence alignments
title_fullStr	Information categorisation in biological sequence alignments
title_full_unstemmed	Information categorisation in biological sequence alignments
title_short	Information categorisation in biological sequence alignments
title_sort	information categorisation in biological sequence alignments
work_keys_str_mv	AT gunewardenas informationcategorisationinbiologicalsequencealignments AT jeavonsp informationcategorisationinbiologicalsequencealignments

Information categorisation in biological sequence alignments

Similar Items