Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering

The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online tex...

Full description

Bibliographic Details
Main Authors: Rana Alqaisi, Wasel Ghanem, Aziz Qaroush
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9303358/
_version_ 1828112041187999744
author Rana Alqaisi
Wasel Ghanem
Aziz Qaroush
author_facet Rana Alqaisi
Wasel Ghanem
Aziz Qaroush
author_sort Rana Alqaisi
collection DOAJ
description The increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively.
first_indexed 2024-04-11T11:44:27Z
format Article
id doaj.art-68f2698fd810493fa3b87a3993d9f18a
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-11T11:44:27Z
publishDate 2020-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-68f2698fd810493fa3b87a3993d9f18a2022-12-22T04:25:40ZengIEEEIEEE Access2169-35362020-01-01822820622822410.1109/ACCESS.2020.30464949303358Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid ClusteringRana Alqaisi0https://orcid.org/0000-0003-4298-803XWasel Ghanem1https://orcid.org/0000-0003-2409-4729Aziz Qaroush2https://orcid.org/0000-0002-0274-4121Department of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineDepartment of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineDepartment of Electrical and Computer Engineering, Birzeit University, Birzeit, PalestineThe increasing usage of the Internet and social networks has produced a significant amount of online textual data. These online textual data led to information overload and redundancy. It is important to eliminate the information redundancy and preserve the time required for reading these online textual data. Thus, there is a persistent need for an automatic text summarization system, which extract the relevant and salient information from a collection of documents, that sharing the same or related topics. Then, presenting this extracted information in a condensed form to preserve the main topics. This paper proposes an automatic, generic, and extractive Arabic multi-document summarization system. The proposed system employs the clustering-based and evolutionary multi-objective optimization methods. The clustering-based method discovers the main topics in the text, while the evolutionary multi-objective optimization method optimizes three objectives based on coverage, diversity/redundancy, and relevancy. The performance of the proposed system is evaluated using TAC 2011 and DUC 2002 datasets. The experimental results are compared using ROUGE evaluation measure. The obtained results showed the effectiveness of the proposed system compared to other peer systems. The proposed system outperformed other peer systems for all ROUGE metrics using TAC 2011. We achieved an F-measure of 38.9%, 17.7%, 35.4%, and 15.8% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively. In addition, the proposed system with DUC 2002 dataset achieved an F-measure of 47.1%, 23.7%, 47.1%, 20.4% for Rouge-1, Rouge-2, Rouge-L, and Rouge-SU4, respectively.https://ieeexplore.ieee.org/document/9303358/Natural language processingextractive text summarizationmulti-objective optimizationmaximum coverage and relevancyless redundancy
spellingShingle Rana Alqaisi
Wasel Ghanem
Aziz Qaroush
Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
IEEE Access
Natural language processing
extractive text summarization
multi-objective optimization
maximum coverage and relevancy
less redundancy
title Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_full Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_fullStr Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_full_unstemmed Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_short Extractive Multi-Document Arabic Text Summarization Using Evolutionary Multi-Objective Optimization With K-Medoid Clustering
title_sort extractive multi document arabic text summarization using evolutionary multi objective optimization with k medoid clustering
topic Natural language processing
extractive text summarization
multi-objective optimization
maximum coverage and relevancy
less redundancy
url https://ieeexplore.ieee.org/document/9303358/
work_keys_str_mv AT ranaalqaisi extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering
AT waselghanem extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering
AT azizqaroush extractivemultidocumentarabictextsummarizationusingevolutionarymultiobjectiveoptimizationwithkmedoidclustering