Characterization of the Illumina EPIC array for optimal applications in epigenetic research targeting diverse human populations

Abstract The Illumina EPIC array is widely used for high-throughput profiling of DNA cytosine modifications in human samples, covering more than 850,000 modification sites across various genomic features. The application of this platform is expected to provide novel insights into the epigenetic cont...

Full description

Bibliographic Details
Main Authors: Zhou Zhang, Chang Zeng, Wei Zhang
Format: Article
Language:English
Published: BMC 2022-12-01
Series:Epigenetics Communications
Subjects:
Online Access:https://doi.org/10.1186/s43682-022-00015-9
Description
Summary:Abstract The Illumina EPIC array is widely used for high-throughput profiling of DNA cytosine modifications in human samples, covering more than 850,000 modification sites across various genomic features. The application of this platform is expected to provide novel insights into the epigenetic contribution to human complex traits and diseases. Considering the diverse inter-population genetic and epigenetic variation, it will benefit the research community with a comprehensive characterization of this platform for its applicability to major global populations. Specifically, we mapped 866,836 CpG probes from the EPIC array to the human genome reference. We detected 91,034 CpG probes that did not align reliably to the human genome reference. In addition, 21,256 CpG probes were found to ambiguously map to multiple loci in the human genome, and 448 probes showing inaccurate genomic information from the original Illumina annotations. We further characterized those uniquely mapped CpG probes in terms of whether they contained common genetic variants, i.e., single nucleotide polymorphisms (SNPs), in major global populations, by utilizing the 1000 Genomes Project data. A list of optimal CpG probes on the EPIC array was generated for major global populations, with the aim of providing a resource to facilitate future studies of diverse human populations. In conclusion, our analysis indicated that studies of diverse human populations using the EPIC array would be benefited by taking into account of the technical features of this platform.
ISSN:2730-7034