Evaluating a typology of signals for automatic detection of complementarity
In a cluster of news texts on the same event, two sentences from different documents might express different multi-document phenomena (redundancy, complementarity, and contradiction). Cross-Document Structure Theory (CST) provides labels to explicitly represent these phenomena. The automatic identif...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Programa de Pós-Graduação em Estudos Linguísticos
2022-09-01
|
Series: | Domínios de Lingu@gem |
Subjects: | |
Online Access: | https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63776 |
_version_ | 1797759558907592704 |
---|---|
author | Jackson Wilke da Cruz Souza Ariani Di Felippo |
author_facet | Jackson Wilke da Cruz Souza Ariani Di Felippo |
author_sort | Jackson Wilke da Cruz Souza |
collection | DOAJ |
description | In a cluster of news texts on the same event, two sentences from different documents might express different multi-document phenomena (redundancy, complementarity, and contradiction). Cross-Document Structure Theory (CST) provides labels to explicitly represent these phenomena. The automatic identification of the multi-document phenomena and their correspondent CST relations is definitely handy for Automatic Multi-Document Summarization since it helps computers understand text meaning. In this paper, we evaluated a typology of (textual) signals for the automatic detection of the CST relations of complementarity (i.e., Historical background, Follow-up and Elaboration) in a multi-document corpus of news texts in Brazilian Portuguese. Using algorithms from different machine-learning paradigms, we obtained classifiers that achieved high general accuracy (higher than 90%), indicating the potential of the signals. |
first_indexed | 2024-03-12T18:46:09Z |
format | Article |
id | doaj.art-db03c93c888942298e28fb70bbb344ee |
institution | Directory Open Access Journal |
issn | 1980-5799 |
language | English |
last_indexed | 2024-03-12T18:46:09Z |
publishDate | 2022-09-01 |
publisher | Programa de Pós-Graduação em Estudos Linguísticos |
record_format | Article |
series | Domínios de Lingu@gem |
spelling | doaj.art-db03c93c888942298e28fb70bbb344ee2023-08-02T07:37:59ZengPrograma de Pós-Graduação em Estudos LinguísticosDomínios de Lingu@gem1980-57992022-09-011641517154310.14393/DL52-v16n4a2022-1035920Evaluating a typology of signals for automatic detection of complementarityJackson Wilke da Cruz Souza0https://orcid.org/0000-0003-1881-6780Ariani Di Felippo1https://orcid.org/0000-0002-4566-9352UNIFAL-MGUFSCarIn a cluster of news texts on the same event, two sentences from different documents might express different multi-document phenomena (redundancy, complementarity, and contradiction). Cross-Document Structure Theory (CST) provides labels to explicitly represent these phenomena. The automatic identification of the multi-document phenomena and their correspondent CST relations is definitely handy for Automatic Multi-Document Summarization since it helps computers understand text meaning. In this paper, we evaluated a typology of (textual) signals for the automatic detection of the CST relations of complementarity (i.e., Historical background, Follow-up and Elaboration) in a multi-document corpus of news texts in Brazilian Portuguese. Using algorithms from different machine-learning paradigms, we obtained classifiers that achieved high general accuracy (higher than 90%), indicating the potential of the signals.https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63776cross-document structure theoryautomatic summarizationmulti-document corpuscomplementaritytextual signal |
spellingShingle | Jackson Wilke da Cruz Souza Ariani Di Felippo Evaluating a typology of signals for automatic detection of complementarity Domínios de Lingu@gem cross-document structure theory automatic summarization multi-document corpus complementarity textual signal |
title | Evaluating a typology of signals for automatic detection of complementarity |
title_full | Evaluating a typology of signals for automatic detection of complementarity |
title_fullStr | Evaluating a typology of signals for automatic detection of complementarity |
title_full_unstemmed | Evaluating a typology of signals for automatic detection of complementarity |
title_short | Evaluating a typology of signals for automatic detection of complementarity |
title_sort | evaluating a typology of signals for automatic detection of complementarity |
topic | cross-document structure theory automatic summarization multi-document corpus complementarity textual signal |
url | https://seer.ufu.br/index.php/dominiosdelinguagem/article/view/63776 |
work_keys_str_mv | AT jacksonwilkedacruzsouza evaluatingatypologyofsignalsforautomaticdetectionofcomplementarity AT arianidifelippo evaluatingatypologyofsignalsforautomaticdetectionofcomplementarity |