Information categorisation in biological sequence alignments

This is a two-part report. In the first part we introduce the reader to biological sequence alignment. We discus dynamic programming as is used in sequence alignment, first in the case of two sequences and later, how it is adopted for multiple sequence alignment. Several references are given to the...

Full description

Bibliographic Details
Main Authors: Gunewardena, S, Jeavons, P
Format: Report
Published: Oxford University Computing Laboratory 2004
_version_ 1797084247893213184
author Gunewardena, S
Jeavons, P
author_facet Gunewardena, S
Jeavons, P
author_sort Gunewardena, S
collection OXFORD
description This is a two-part report. In the first part we introduce the reader to biological sequence alignment. We discus dynamic programming as is used in sequence alignment, first in the case of two sequences and later, how it is adopted for multiple sequence alignment. Several references are given to the different sequence alignment strategies reported in the literature used to enhance the standard dynamic programming algorithm for sequence alignment to suit biological sequences. A short discussion on how alignments are scored is given. Finally, some of the existing sequence alignment tools are described.<p>The second part of this report presents a critical analysis of information as it relates to biological sequence alignment. Information relating to the sequences being aligned form the basis on which any alignment is built. In its basic form this information might quantify how individual residues are scored when aligned with each other or how gaps are scored when introduced between two residues. Every biological sequence has if not explicit, at least some form of implicit information relating to its residues that form distinguishing markers along the sequence. There are many ways of extracting this information such as from databases of the relevant sequences, from the literature, prior processing etc. It is reasonable to assume that the more sequence information we use in an alignment, the more confidant we can be of the resulting alignment, and hence make better hypothesis of the unknown sequences. The aim of this part of the report is to build a framework on how to represent this information in such a way as to facilitate the dynamic and flexible incorporation of it to facilitate sequence alignments.</p>
first_indexed 2024-03-07T01:52:46Z
format Report
id oxford-uuid:9ab07177-4776-4762-8aeb-dff0fd3498d0
institution University of Oxford
last_indexed 2024-03-07T01:52:46Z
publishDate 2004
publisher Oxford University Computing Laboratory
record_format dspace
spelling oxford-uuid:9ab07177-4776-4762-8aeb-dff0fd3498d02022-03-27T00:23:04ZInformation categorisation in biological sequence alignmentsReporthttp://purl.org/coar/resource_type/c_93fcuuid:9ab07177-4776-4762-8aeb-dff0fd3498d0Department of Computer ScienceOxford University Computing Laboratory2004Gunewardena, SJeavons, PThis is a two-part report. In the first part we introduce the reader to biological sequence alignment. We discus dynamic programming as is used in sequence alignment, first in the case of two sequences and later, how it is adopted for multiple sequence alignment. Several references are given to the different sequence alignment strategies reported in the literature used to enhance the standard dynamic programming algorithm for sequence alignment to suit biological sequences. A short discussion on how alignments are scored is given. Finally, some of the existing sequence alignment tools are described.<p>The second part of this report presents a critical analysis of information as it relates to biological sequence alignment. Information relating to the sequences being aligned form the basis on which any alignment is built. In its basic form this information might quantify how individual residues are scored when aligned with each other or how gaps are scored when introduced between two residues. Every biological sequence has if not explicit, at least some form of implicit information relating to its residues that form distinguishing markers along the sequence. There are many ways of extracting this information such as from databases of the relevant sequences, from the literature, prior processing etc. It is reasonable to assume that the more sequence information we use in an alignment, the more confidant we can be of the resulting alignment, and hence make better hypothesis of the unknown sequences. The aim of this part of the report is to build a framework on how to represent this information in such a way as to facilitate the dynamic and flexible incorporation of it to facilitate sequence alignments.</p>
spellingShingle Gunewardena, S
Jeavons, P
Information categorisation in biological sequence alignments
title Information categorisation in biological sequence alignments
title_full Information categorisation in biological sequence alignments
title_fullStr Information categorisation in biological sequence alignments
title_full_unstemmed Information categorisation in biological sequence alignments
title_short Information categorisation in biological sequence alignments
title_sort information categorisation in biological sequence alignments
work_keys_str_mv AT gunewardenas informationcategorisationinbiologicalsequencealignments
AT jeavonsp informationcategorisationinbiologicalsequencealignments