New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s)
G-quadruplexes (G4s) are non-canonical DNA and RNA secondary structures that control gene regulation. A single nucleotide polymorphism (SNP) is a small genetic variation occurring within a DNA sequence and accounting for the variabilities between individuals. While the majority of SNPs, especially t...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2022-01-01
|
Series: | MethodsX |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2215016122002540 |
_version_ | 1797980852090568704 |
---|---|
author | Mona SAAD Marc Shebaby Cybel Mehawej Wissam Faour |
author_facet | Mona SAAD Marc Shebaby Cybel Mehawej Wissam Faour |
author_sort | Mona SAAD |
collection | DOAJ |
description | G-quadruplexes (G4s) are non-canonical DNA and RNA secondary structures that control gene regulation. A single nucleotide polymorphism (SNP) is a small genetic variation occurring within a DNA sequence and accounting for the variabilities between individuals. While the majority of SNPs, especially those frequent in the population, are considered as benign genetic variations, few others can lead to diseases. SNPs occurring in G4 sequences were reported to modulate gene regulation. In order to find overlaps between predicted G4 sequences and SNPs located in the genomic regions, we developed two complementary computational python codes (SNP-locator and G4-overlap). The codes map a mutation to the overlapping/closest G4 sequences, based on the genetic variant name and the FASTA format of the corresponding gene. We validated these two codes on a set of 31 SNP variants occurring in cytochromes P450 genes and podocytes-marker genes. Out of 31 SNPs, 28 were accurately located using the mentioned codes. • SNP-locator code locates any SNP in promoters, upstream regulatory regions, exons and introns. • The SNP-locator code requires the FASTA genomic sequence of the studied gene and the genetic variant nomenclature at the cDNA level. • G4-overlap code maps the SNP to the overlapping or the closest G4 sequence. |
first_indexed | 2024-04-11T06:01:24Z |
format | Article |
id | doaj.art-eff607ffdace4e1185fc9af2fc75786c |
institution | Directory Open Access Journal |
issn | 2215-0161 |
language | English |
last_indexed | 2024-04-11T06:01:24Z |
publishDate | 2022-01-01 |
publisher | Elsevier |
record_format | Article |
series | MethodsX |
spelling | doaj.art-eff607ffdace4e1185fc9af2fc75786c2022-12-22T04:41:40ZengElsevierMethodsX2215-01612022-01-019101875New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s)Mona SAAD0Marc Shebaby1Cybel Mehawej2Wissam Faour3Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Byblos, LebanonSchool of Arts and Sciences, Lebanese American University, Byblos, LebanonDepartment of Human Genetics, Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Byblos, LebanonGilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Byblos, Lebanon; Corresponding author.G-quadruplexes (G4s) are non-canonical DNA and RNA secondary structures that control gene regulation. A single nucleotide polymorphism (SNP) is a small genetic variation occurring within a DNA sequence and accounting for the variabilities between individuals. While the majority of SNPs, especially those frequent in the population, are considered as benign genetic variations, few others can lead to diseases. SNPs occurring in G4 sequences were reported to modulate gene regulation. In order to find overlaps between predicted G4 sequences and SNPs located in the genomic regions, we developed two complementary computational python codes (SNP-locator and G4-overlap). The codes map a mutation to the overlapping/closest G4 sequences, based on the genetic variant name and the FASTA format of the corresponding gene. We validated these two codes on a set of 31 SNP variants occurring in cytochromes P450 genes and podocytes-marker genes. Out of 31 SNPs, 28 were accurately located using the mentioned codes. • SNP-locator code locates any SNP in promoters, upstream regulatory regions, exons and introns. • The SNP-locator code requires the FASTA genomic sequence of the studied gene and the genetic variant nomenclature at the cDNA level. • G4-overlap code maps the SNP to the overlapping or the closest G4 sequence.http://www.sciencedirect.com/science/article/pii/S2215016122002540New complementary python codes to locate Single nucleotide polymorphism SNP and overlapping G-quadruplexes |
spellingShingle | Mona SAAD Marc Shebaby Cybel Mehawej Wissam Faour New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s) MethodsX New complementary python codes to locate Single nucleotide polymorphism SNP and overlapping G-quadruplexes |
title | New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s) |
title_full | New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s) |
title_fullStr | New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s) |
title_full_unstemmed | New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s) |
title_short | New complementary python codes to locate Single Nucleotide Polymorphisms (SNPs) and Overlapping G-Quadruplex Sequences (G4s) |
title_sort | new complementary python codes to locate single nucleotide polymorphisms snps and overlapping g quadruplex sequences g4s |
topic | New complementary python codes to locate Single nucleotide polymorphism SNP and overlapping G-quadruplexes |
url | http://www.sciencedirect.com/science/article/pii/S2215016122002540 |
work_keys_str_mv | AT monasaad newcomplementarypythoncodestolocatesinglenucleotidepolymorphismssnpsandoverlappinggquadruplexsequencesg4s AT marcshebaby newcomplementarypythoncodestolocatesinglenucleotidepolymorphismssnpsandoverlappinggquadruplexsequencesg4s AT cybelmehawej newcomplementarypythoncodestolocatesinglenucleotidepolymorphismssnpsandoverlappinggquadruplexsequencesg4s AT wissamfaour newcomplementarypythoncodestolocatesinglenucleotidepolymorphismssnpsandoverlappinggquadruplexsequencesg4s |