A graph neural network-based interpretable framework reveals a novel DNA fragility–associated chromatin structural unit

Abstract Background DNA double-strand breaks (DSBs) are among the most deleterious DNA lesions, and they can cause cancer if improperly repaired. Recent chromosome conformation capture techniques, such as Hi-C, have enabled the identification of relationships between the 3D chromatin structure and D...

Full description

Bibliographic Details
Main Authors: Yu Sun, Xiang Xu, Lin Lin, Kang Xu, Yang Zheng, Chao Ren, Huan Tao, Xu Wang, Huan Zhao, Weiwei Tu, Xuemei Bai, Junting Wang, Qiya Huang, Yaru Li, Hebing Chen, Hao Li, Xiaochen Bo
Format: Article
Language:English
Published: BMC 2023-04-01
Series:Genome Biology
Subjects:
Online Access:https://doi.org/10.1186/s13059-023-02916-x
Description
Summary:Abstract Background DNA double-strand breaks (DSBs) are among the most deleterious DNA lesions, and they can cause cancer if improperly repaired. Recent chromosome conformation capture techniques, such as Hi-C, have enabled the identification of relationships between the 3D chromatin structure and DSBs, but little is known about how to explain these relationships, especially from global contact maps, or their contributions to DSB formation. Results Here, we propose a framework that integrates graph neural network (GNN) to unravel the relationship between 3D chromatin structure and DSBs using an advanced interpretable technique GNNExplainer. We identify a new chromatin structural unit named the DNA fragility–associated chromatin interaction network (FaCIN). FaCIN is a bottleneck-like structure, and it helps to reveal a universal form of how the fragility of a piece of DNA might be affected by the whole genome through chromatin interactions. Moreover, we demonstrate that neck interactions in FaCIN can serve as chromatin structural determinants of DSB formation. Conclusions Our study provides a more systematic and refined view enabling a better understanding of the mechanisms of DSB formation under the context of the 3D genome.
ISSN:1474-760X