Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA

Abstract Background Directed DNA methylation on N6-adenine (6mA), N4-cytosine (4mC), and C5-cytosine (5mC) can potentially increase DNA coding capacity and regulate a variety of biological functions. These modifications are relatively abundant in bacteria, occurring in about a percent of all bases o...

Full description

Bibliographic Details
Main Authors: Zach K. O’Brown, Konstantinos Boulias, Jie Wang, Simon Yuan Wang, Natasha M. O’Brown, Ziyang Hao, Hiroki Shibuya, Paul-Enguerrand Fady, Yang Shi, Chuan He, Sean G. Megason, Tao Liu, Eric L. Greer
Format: Article
Language:English
Published: BMC 2019-06-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-019-5754-6
_version_ 1818083764932182016
author Zach K. O’Brown
Konstantinos Boulias
Jie Wang
Simon Yuan Wang
Natasha M. O’Brown
Ziyang Hao
Hiroki Shibuya
Paul-Enguerrand Fady
Yang Shi
Chuan He
Sean G. Megason
Tao Liu
Eric L. Greer
author_facet Zach K. O’Brown
Konstantinos Boulias
Jie Wang
Simon Yuan Wang
Natasha M. O’Brown
Ziyang Hao
Hiroki Shibuya
Paul-Enguerrand Fady
Yang Shi
Chuan He
Sean G. Megason
Tao Liu
Eric L. Greer
author_sort Zach K. O’Brown
collection DOAJ
description Abstract Background Directed DNA methylation on N6-adenine (6mA), N4-cytosine (4mC), and C5-cytosine (5mC) can potentially increase DNA coding capacity and regulate a variety of biological functions. These modifications are relatively abundant in bacteria, occurring in about a percent of all bases of most bacteria. Until recently, 5mC and its oxidized derivatives were thought to be the only directed DNA methylation events in metazoa. New and more sensitive detection techniques (ultra-high performance liquid chromatography coupled with mass spectrometry (UHPLC-ms/ms) and single molecule real-time sequencing (SMRTseq)) have suggested that 6mA and 4mC modifications could be present in a variety of metazoa. Results Here, we find that both of these techniques are prone to inaccuracies, which overestimate DNA methylation concentrations in metazoan genomic DNA. Artifacts can arise from methylated bacterial DNA contamination of enzyme preparations used to digest DNA and contaminating bacterial DNA in eukaryotic DNA preparations. Moreover, DNA sonication introduces a novel modified base from 5mC that has a retention time near 4mC that can be confused with 4mC. Our analyses also suggest that SMRTseq systematically overestimates 4mC in prokaryotic and eukaryotic DNA and 6mA in DNA samples in which it is rare. Using UHPLC-ms/ms designed to minimize and subtract artifacts, we find low to undetectable levels of 4mC and 6mA in genomes of representative worms, insects, amphibians, birds, rodents and primates under normal growth conditions. We also find that mammalian cells incorporate exogenous methylated nucleosides into their genome, suggesting that a portion of 6mA modifications could derive from incorporation of nucleosides from bacteria in food or microbiota. However, gDNA samples from gnotobiotic mouse tissues found rare (0.9–3.7 ppm) 6mA modifications above background. Conclusions Altogether these data demonstrate that 6mA and 4mC are rarer in metazoa than previously reported, and highlight the importance of careful sample preparation and measurement, and need for more accurate sequencing techniques.
first_indexed 2024-12-10T19:43:11Z
format Article
id doaj.art-5f5c2596b8554fddb265cdf456fcd9f1
institution Directory Open Access Journal
issn 1471-2164
language English
last_indexed 2024-12-10T19:43:11Z
publishDate 2019-06-01
publisher BMC
record_format Article
series BMC Genomics
spelling doaj.art-5f5c2596b8554fddb265cdf456fcd9f12022-12-22T01:35:58ZengBMCBMC Genomics1471-21642019-06-0120111510.1186/s12864-019-5754-6Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNAZach K. O’Brown0Konstantinos Boulias1Jie Wang2Simon Yuan Wang3Natasha M. O’Brown4Ziyang Hao5Hiroki Shibuya6Paul-Enguerrand Fady7Yang Shi8Chuan He9Sean G. Megason10Tao Liu11Eric L. Greer12Division of Newborn Medicine, Boston Children’s HospitalDivision of Newborn Medicine, Boston Children’s HospitalDepartment of Biochemistry and Biostatistics, University at Buffalo Center of Excellence in Bioinformatics & Life SciencesDivision of Newborn Medicine, Boston Children’s HospitalDepartment of Neurobiology, Harvard Medical SchoolDepartment of Chemistry and Howard Hughes Medical Institute, The University of ChicagoDivision of Newborn Medicine, Boston Children’s HospitalDivision of Newborn Medicine, Boston Children’s HospitalDivision of Newborn Medicine, Boston Children’s HospitalDepartment of Chemistry and Howard Hughes Medical Institute, The University of ChicagoDepartment of Systems Biology, Harvard Medical SchoolDepartment of Biochemistry and Biostatistics, University at Buffalo Center of Excellence in Bioinformatics & Life SciencesDivision of Newborn Medicine, Boston Children’s HospitalAbstract Background Directed DNA methylation on N6-adenine (6mA), N4-cytosine (4mC), and C5-cytosine (5mC) can potentially increase DNA coding capacity and regulate a variety of biological functions. These modifications are relatively abundant in bacteria, occurring in about a percent of all bases of most bacteria. Until recently, 5mC and its oxidized derivatives were thought to be the only directed DNA methylation events in metazoa. New and more sensitive detection techniques (ultra-high performance liquid chromatography coupled with mass spectrometry (UHPLC-ms/ms) and single molecule real-time sequencing (SMRTseq)) have suggested that 6mA and 4mC modifications could be present in a variety of metazoa. Results Here, we find that both of these techniques are prone to inaccuracies, which overestimate DNA methylation concentrations in metazoan genomic DNA. Artifacts can arise from methylated bacterial DNA contamination of enzyme preparations used to digest DNA and contaminating bacterial DNA in eukaryotic DNA preparations. Moreover, DNA sonication introduces a novel modified base from 5mC that has a retention time near 4mC that can be confused with 4mC. Our analyses also suggest that SMRTseq systematically overestimates 4mC in prokaryotic and eukaryotic DNA and 6mA in DNA samples in which it is rare. Using UHPLC-ms/ms designed to minimize and subtract artifacts, we find low to undetectable levels of 4mC and 6mA in genomes of representative worms, insects, amphibians, birds, rodents and primates under normal growth conditions. We also find that mammalian cells incorporate exogenous methylated nucleosides into their genome, suggesting that a portion of 6mA modifications could derive from incorporation of nucleosides from bacteria in food or microbiota. However, gDNA samples from gnotobiotic mouse tissues found rare (0.9–3.7 ppm) 6mA modifications above background. Conclusions Altogether these data demonstrate that 6mA and 4mC are rarer in metazoa than previously reported, and highlight the importance of careful sample preparation and measurement, and need for more accurate sequencing techniques.http://link.springer.com/article/10.1186/s12864-019-5754-6DNA epigenomeDNA N6-methyladenosine6 mADNA N4-methylcytosine4mC
spellingShingle Zach K. O’Brown
Konstantinos Boulias
Jie Wang
Simon Yuan Wang
Natasha M. O’Brown
Ziyang Hao
Hiroki Shibuya
Paul-Enguerrand Fady
Yang Shi
Chuan He
Sean G. Megason
Tao Liu
Eric L. Greer
Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA
BMC Genomics
DNA epigenome
DNA N6-methyladenosine
6 mA
DNA N4-methylcytosine
4mC
title Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA
title_full Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA
title_fullStr Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA
title_full_unstemmed Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA
title_short Sources of artifact in measurements of 6mA and 4mC abundance in eukaryotic genomic DNA
title_sort sources of artifact in measurements of 6ma and 4mc abundance in eukaryotic genomic dna
topic DNA epigenome
DNA N6-methyladenosine
6 mA
DNA N4-methylcytosine
4mC
url http://link.springer.com/article/10.1186/s12864-019-5754-6
work_keys_str_mv AT zachkobrown sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT konstantinosboulias sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT jiewang sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT simonyuanwang sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT natashamobrown sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT ziyanghao sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT hirokishibuya sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT paulenguerrandfady sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT yangshi sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT chuanhe sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT seangmegason sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT taoliu sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna
AT ericlgreer sourcesofartifactinmeasurementsof6maand4mcabundanceineukaryoticgenomicdna