MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences

Microsatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown h...

Full description

Bibliographic Details
Main Author: Carlos M. Martínez Ortiz
Format: Article
Language:English
Published: ECIMED 2019-01-01
Series:Revista Cubana de Informática Médica
Online Access:http://revinformatica.sld.cu/index.php/rcim/article/view/302
_version_ 1798044629828894720
author Carlos M. Martínez Ortiz
author_facet Carlos M. Martínez Ortiz
author_sort Carlos M. Martínez Ortiz
collection DOAJ
description Microsatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown how pathogens can evade the immune response by simply altering the composition of repeat sequences in their genes. There are numerous computer applications for the detection of these sequences, but they do not meet all expectations due to the divergence of criteria and approaches applied to solving the problem of their detection. MIDAS implements a non-heuristic solution based on two combinatorial algorithms in series: the first one detects exact microsatellites, and the second one, if the model parameters allow it, extends the sequences to their optimal inaccurate version. The application has as input the genomic sequence in GBFF or FASTA format and its output provides the microsatellite positions in the genomic sequence, as well as sizes, alignments, flanks and other statistics. The algorithm is highly efficient and comprehensive, detecting all possible repeat sequences regardless of their nucleotide composition.<br /><strong>Keywords:</strong> SSR; microsatellite; molecular marker; data mining; algorithms
first_indexed 2024-04-11T23:07:18Z
format Article
id doaj.art-3252ff71c0364112afacc1d3be8d5c06
institution Directory Open Access Journal
issn 1684-1859
language English
last_indexed 2024-04-11T23:07:18Z
publishDate 2019-01-01
publisher ECIMED
record_format Article
series Revista Cubana de Informática Médica
spelling doaj.art-3252ff71c0364112afacc1d3be8d5c062022-12-22T03:57:58ZengECIMEDRevista Cubana de Informática Médica1684-18592019-01-01102174MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequencesCarlos M. Martínez OrtizMicrosatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown how pathogens can evade the immune response by simply altering the composition of repeat sequences in their genes. There are numerous computer applications for the detection of these sequences, but they do not meet all expectations due to the divergence of criteria and approaches applied to solving the problem of their detection. MIDAS implements a non-heuristic solution based on two combinatorial algorithms in series: the first one detects exact microsatellites, and the second one, if the model parameters allow it, extends the sequences to their optimal inaccurate version. The application has as input the genomic sequence in GBFF or FASTA format and its output provides the microsatellite positions in the genomic sequence, as well as sizes, alignments, flanks and other statistics. The algorithm is highly efficient and comprehensive, detecting all possible repeat sequences regardless of their nucleotide composition.<br /><strong>Keywords:</strong> SSR; microsatellite; molecular marker; data mining; algorithmshttp://revinformatica.sld.cu/index.php/rcim/article/view/302
spellingShingle Carlos M. Martínez Ortiz
MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
Revista Cubana de Informática Médica
title MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
title_full MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
title_fullStr MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
title_full_unstemmed MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
title_short MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
title_sort midas computer application for the identification of exact and inaccurate microsatellites in genomic sequences
url http://revinformatica.sld.cu/index.php/rcim/article/view/302
work_keys_str_mv AT carlosmmartinezortiz midascomputerapplicationfortheidentificationofexactandinaccuratemicrosatellitesingenomicsequences