Detecting microsatellites within genomes: significant variation among algorithms

Abstract Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algor...

Full description

Bibliographic Details
Main Authors:	Rivals Eric, Leclercq Sébastien, Jarne Philippe
Format:	Article
Language:	English
Published:	BMC 2007-04-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/8/125

_version_	1818050429712334848
author	Rivals Eric Leclercq Sébastien Jarne Philippe
author_facet	Rivals Eric Leclercq Sébastien Jarne Philippe
author_sort	Rivals Eric
collection	DOAJ
description	<p>Abstract</p> <p>Background</p> <p>Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker).</p> <p>Results</p> <p>Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (<it>Saccharomyces cerevisiae</it>, <it>Neurospora crassa </it>and <it>Drosophila melanogaster</it>) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif.</p> <p>Conclusion</p> <p>Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.</p>
first_indexed	2024-12-10T10:53:20Z
format	Article
id	doaj.art-18a589f67a6a49b09f3255c957cac5cf
institution	Directory Open Access Journal
issn	1471-2105
language	English
last_indexed	2024-12-10T10:53:20Z
publishDate	2007-04-01
publisher	BMC
record_format	Article
series	BMC Bioinformatics
spelling	doaj.art-18a589f67a6a49b09f3255c957cac5cf2022-12-22T01:51:56ZengBMCBMC Bioinformatics1471-21052007-04-018112510.1186/1471-2105-8-125Detecting microsatellites within genomes: significant variation among algorithmsRivals EricLeclercq SébastienJarne Philippe<p>Abstract</p> <p>Background</p> <p>Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker).</p> <p>Results</p> <p>Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (<it>Saccharomyces cerevisiae</it>, <it>Neurospora crassa </it>and <it>Drosophila melanogaster</it>) spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp), regardless of motif.</p> <p>Conclusion</p> <p>Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.</p>http://www.biomedcentral.com/1471-2105/8/125
spellingShingle	Rivals Eric Leclercq Sébastien Jarne Philippe Detecting microsatellites within genomes: significant variation among algorithms BMC Bioinformatics
title	Detecting microsatellites within genomes: significant variation among algorithms
title_full	Detecting microsatellites within genomes: significant variation among algorithms
title_fullStr	Detecting microsatellites within genomes: significant variation among algorithms
title_full_unstemmed	Detecting microsatellites within genomes: significant variation among algorithms
title_short	Detecting microsatellites within genomes: significant variation among algorithms
title_sort	detecting microsatellites within genomes significant variation among algorithms
url	http://www.biomedcentral.com/1471-2105/8/125
work_keys_str_mv	AT rivalseric detectingmicrosatelliteswithingenomessignificantvariationamongalgorithms AT leclercqsebastien detectingmicrosatelliteswithingenomessignificantvariationamongalgorithms AT jarnephilippe detectingmicrosatelliteswithingenomessignificantvariationamongalgorithms

Detecting microsatellites within genomes: significant variation among algorithms

Similar Items