Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks

The crucial aspect of extractive document summarization lies in understanding the interrelations between sentences. Documents inherently comprise a multitude of sentences, and sentence-level models frequently fail to consider the relationships between distantly-placed sentences, resulting in the omi...

Full description

Bibliographic Details
Main Authors:	Wu Su, Jin Jiang, Kaihui Huang
Format:	Article
Language:	English
Published:	PeerJ Inc. 2023-12-01
Series:	PeerJ Computer Science
Subjects:	Extractive summarization Graph neural networks Adaptive Graph attention network
Online Access:	https://peerj.com/articles/cs-1737.pdf

_version_	1797389887418138624
author	Wu Su Jin Jiang Kaihui Huang
author_facet	Wu Su Jin Jiang Kaihui Huang
author_sort	Wu Su
collection	DOAJ
description	The crucial aspect of extractive document summarization lies in understanding the interrelations between sentences. Documents inherently comprise a multitude of sentences, and sentence-level models frequently fail to consider the relationships between distantly-placed sentences, resulting in the omission of significant information in the summary. Moreover, information within documents tends to be distributed sparsely, challenging the efficacy of sentence-level models. In the realm of heterogeneous graph neural networks, it has been observed that semantic nodes with varying levels of granularity encapsulate distinct semantic connections. Initially, the incorporation of edge features into the computation of dynamic graph attention networks is performed to account for node relationships. Subsequently, given the multiplicity of topics in a document or a set of documents, a topic model is employed to extract topic-specific features and the probability distribution linking these topics with sentence nodes. Last but not least, the model defines nodes with different levels of granularity—ranging from documents and topics to sentences—and these various nodes necessitate different propagation widths and depths for capturing intricate relationships in the information being disseminated. Adaptive measures are taken to learn the importance and correlation between nodes of different granularities in terms of both width and depth. Experimental evidence from two benchmark datasets highlights the superior performance of the proposed model, as assessed by ROUGE metrics, in comparison to existing approaches, even in the absence of pre-trained language models. Additionally, an ablation study confirms the positive impact of each individual module on the model's ROUGE scores.
first_indexed	2024-03-08T23:02:36Z
format	Article
id	doaj.art-82d69208f841443eb4b88c6fbd78aa34
institution	Directory Open Access Journal
issn	2376-5992
language	English
last_indexed	2024-03-08T23:02:36Z
publishDate	2023-12-01
publisher	PeerJ Inc.
record_format	Article
series	PeerJ Computer Science
spelling	doaj.art-82d69208f841443eb4b88c6fbd78aa342023-12-15T15:05:10ZengPeerJ Inc.PeerJ Computer Science2376-59922023-12-019e173710.7717/peerj-cs.1737Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networksWu SuJin JiangKaihui HuangThe crucial aspect of extractive document summarization lies in understanding the interrelations between sentences. Documents inherently comprise a multitude of sentences, and sentence-level models frequently fail to consider the relationships between distantly-placed sentences, resulting in the omission of significant information in the summary. Moreover, information within documents tends to be distributed sparsely, challenging the efficacy of sentence-level models. In the realm of heterogeneous graph neural networks, it has been observed that semantic nodes with varying levels of granularity encapsulate distinct semantic connections. Initially, the incorporation of edge features into the computation of dynamic graph attention networks is performed to account for node relationships. Subsequently, given the multiplicity of topics in a document or a set of documents, a topic model is employed to extract topic-specific features and the probability distribution linking these topics with sentence nodes. Last but not least, the model defines nodes with different levels of granularity—ranging from documents and topics to sentences—and these various nodes necessitate different propagation widths and depths for capturing intricate relationships in the information being disseminated. Adaptive measures are taken to learn the importance and correlation between nodes of different granularities in terms of both width and depth. Experimental evidence from two benchmark datasets highlights the superior performance of the proposed model, as assessed by ROUGE metrics, in comparison to existing approaches, even in the absence of pre-trained language models. Additionally, an ablation study confirms the positive impact of each individual module on the model's ROUGE scores.https://peerj.com/articles/cs-1737.pdfExtractive summarizationGraph neural networksAdaptiveGraph attention network
spellingShingle	Wu Su Jin Jiang Kaihui Huang Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks PeerJ Computer Science Extractive summarization Graph neural networks Adaptive Graph attention network
title	Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks
title_full	Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks
title_fullStr	Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks
title_full_unstemmed	Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks
title_short	Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks
title_sort	multi granularity adaptive extractive document summarization with heterogeneous graph neural networks
topic	Extractive summarization Graph neural networks Adaptive Graph attention network
url	https://peerj.com/articles/cs-1737.pdf
work_keys_str_mv	AT wusu multigranularityadaptiveextractivedocumentsummarizationwithheterogeneousgraphneuralnetworks AT jinjiang multigranularityadaptiveextractivedocumentsummarizationwithheterogeneousgraphneuralnetworks AT kaihuihuang multigranularityadaptiveextractivedocumentsummarizationwithheterogeneousgraphneuralnetworks

Multi-granularity adaptive extractive document summarization with heterogeneous graph neural networks

Similar Items