Evaluating a typology of signals for automatic detection of complementarity

In a cluster of news texts on the same event, two sentences from different documents might express different multi-document phenomena (redundancy, complementarity, and contradiction). Cross-Document Structure Theory (CST) provides labels to explicitly represent these phenomena. The automatic identif...

Full description

Bibliographic Details
Main Authors: Jackson Wilke da Cruz Souza, Ariani Di Felippo
Format: Article
Language:English
Published: Programa de Pós-Graduação em Estudos Linguísticos 2022-09-01
Series:Domínios de Lingu@gem
Subjects:
Online Access:https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63776
_version_ 1797759558907592704
author Jackson Wilke da Cruz Souza
Ariani Di Felippo
author_facet Jackson Wilke da Cruz Souza
Ariani Di Felippo
author_sort Jackson Wilke da Cruz Souza
collection DOAJ
description In a cluster of news texts on the same event, two sentences from different documents might express different multi-document phenomena (redundancy, complementarity, and contradiction). Cross-Document Structure Theory (CST) provides labels to explicitly represent these phenomena. The automatic identification of the multi-document phenomena and their correspondent CST relations is definitely handy for Automatic Multi-Document Summarization since it helps computers understand text meaning. In this paper, we evaluated a typology of (textual) signals for the automatic detection of the CST relations of complementarity (i.e., Historical background, Follow-up and Elaboration) in a multi-document corpus of news texts in Brazilian Portuguese. Using algorithms from different machine-learning paradigms, we obtained classifiers that achieved high general accuracy (higher than 90%), indicating the potential of the signals.
first_indexed 2024-03-12T18:46:09Z
format Article
id doaj.art-db03c93c888942298e28fb70bbb344ee
institution Directory Open Access Journal
issn 1980-5799
language English
last_indexed 2024-03-12T18:46:09Z
publishDate 2022-09-01
publisher Programa de Pós-Graduação em Estudos Linguísticos
record_format Article
series Domínios de Lingu@gem
spelling doaj.art-db03c93c888942298e28fb70bbb344ee2023-08-02T07:37:59ZengPrograma de Pós-Graduação em Estudos LinguísticosDomínios de Lingu@gem1980-57992022-09-011641517154310.14393/DL52-v16n4a2022-1035920Evaluating a typology of signals for automatic detection of complementarityJackson Wilke da Cruz Souza0https://orcid.org/0000-0003-1881-6780Ariani Di Felippo1https://orcid.org/0000-0002-4566-9352UNIFAL-MGUFSCarIn a cluster of news texts on the same event, two sentences from different documents might express different multi-document phenomena (redundancy, complementarity, and contradiction). Cross-Document Structure Theory (CST) provides labels to explicitly represent these phenomena. The automatic identification of the multi-document phenomena and their correspondent CST relations is definitely handy for Automatic Multi-Document Summarization since it helps computers understand text meaning. In this paper, we evaluated a typology of (textual) signals for the automatic detection of the CST relations of complementarity (i.e., Historical background, Follow-up and Elaboration) in a multi-document corpus of news texts in Brazilian Portuguese. Using algorithms from different machine-learning paradigms, we obtained classifiers that achieved high general accuracy (higher than 90%), indicating the potential of the signals.https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63776cross-document structure theoryautomatic summarizationmulti-document corpuscomplementaritytextual signal
spellingShingle Jackson Wilke da Cruz Souza
Ariani Di Felippo
Evaluating a typology of signals for automatic detection of complementarity
Domínios de Lingu@gem
cross-document structure theory
automatic summarization
multi-document corpus
complementarity
textual signal
title Evaluating a typology of signals for automatic detection of complementarity
title_full Evaluating a typology of signals for automatic detection of complementarity
title_fullStr Evaluating a typology of signals for automatic detection of complementarity
title_full_unstemmed Evaluating a typology of signals for automatic detection of complementarity
title_short Evaluating a typology of signals for automatic detection of complementarity
title_sort evaluating a typology of signals for automatic detection of complementarity
topic cross-document structure theory
automatic summarization
multi-document corpus
complementarity
textual signal
url https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63776
work_keys_str_mv AT jacksonwilkedacruzsouza evaluatingatypologyofsignalsforautomaticdetectionofcomplementarity
AT arianidifelippo evaluatingatypologyofsignalsforautomaticdetectionofcomplementarity