MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences
Microsatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown h...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
ECIMED
2019-01-01
|
Series: | Revista Cubana de Informática Médica |
Online Access: | http://revinformatica.sld.cu/index.php/rcim/article/view/302 |
_version_ | 1798044629828894720 |
---|---|
author | Carlos M. Martínez Ortiz |
author_facet | Carlos M. Martínez Ortiz |
author_sort | Carlos M. Martínez Ortiz |
collection | DOAJ |
description | Microsatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown how pathogens can evade the immune response by simply altering the composition of repeat sequences in their genes. There are numerous computer applications for the detection of these sequences, but they do not meet all expectations due to the divergence of criteria and approaches applied to solving the problem of their detection. MIDAS implements a non-heuristic solution based on two combinatorial algorithms in series: the first one detects exact microsatellites, and the second one, if the model parameters allow it, extends the sequences to their optimal inaccurate version. The application has as input the genomic sequence in GBFF or FASTA format and its output provides the microsatellite positions in the genomic sequence, as well as sizes, alignments, flanks and other statistics. The algorithm is highly efficient and comprehensive, detecting all possible repeat sequences regardless of their nucleotide composition.<br /><strong>Keywords:</strong> SSR; microsatellite; molecular marker; data mining; algorithms |
first_indexed | 2024-04-11T23:07:18Z |
format | Article |
id | doaj.art-3252ff71c0364112afacc1d3be8d5c06 |
institution | Directory Open Access Journal |
issn | 1684-1859 |
language | English |
last_indexed | 2024-04-11T23:07:18Z |
publishDate | 2019-01-01 |
publisher | ECIMED |
record_format | Article |
series | Revista Cubana de Informática Médica |
spelling | doaj.art-3252ff71c0364112afacc1d3be8d5c062022-12-22T03:57:58ZengECIMEDRevista Cubana de Informática Médica1684-18592019-01-01102174MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequencesCarlos M. Martínez OrtizMicrosatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown how pathogens can evade the immune response by simply altering the composition of repeat sequences in their genes. There are numerous computer applications for the detection of these sequences, but they do not meet all expectations due to the divergence of criteria and approaches applied to solving the problem of their detection. MIDAS implements a non-heuristic solution based on two combinatorial algorithms in series: the first one detects exact microsatellites, and the second one, if the model parameters allow it, extends the sequences to their optimal inaccurate version. The application has as input the genomic sequence in GBFF or FASTA format and its output provides the microsatellite positions in the genomic sequence, as well as sizes, alignments, flanks and other statistics. The algorithm is highly efficient and comprehensive, detecting all possible repeat sequences regardless of their nucleotide composition.<br /><strong>Keywords:</strong> SSR; microsatellite; molecular marker; data mining; algorithmshttp://revinformatica.sld.cu/index.php/rcim/article/view/302 |
spellingShingle | Carlos M. Martínez Ortiz MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences Revista Cubana de Informática Médica |
title | MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences |
title_full | MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences |
title_fullStr | MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences |
title_full_unstemmed | MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences |
title_short | MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences |
title_sort | midas computer application for the identification of exact and inaccurate microsatellites in genomic sequences |
url | http://revinformatica.sld.cu/index.php/rcim/article/view/302 |
work_keys_str_mv | AT carlosmmartinezortiz midascomputerapplicationfortheidentificationofexactandinaccuratemicrosatellitesingenomicsequences |