Corpora Annotated with Negation: An Overview
Negation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems. Currently, most corpora have been annotated for English, but the presen...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
The MIT Press
2020-03-01
|
Series: | Computational Linguistics |
Online Access: | https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00371 |
_version_ | 1811284400646651904 |
---|---|
author | Jiménez-Zafra, Salud María Morante, Roser Teresa Martín-Valdivia, María Ureña-López, L. Alfonso |
author_facet | Jiménez-Zafra, Salud María Morante, Roser Teresa Martín-Valdivia, María Ureña-López, L. Alfonso |
author_sort | Jiménez-Zafra, Salud María |
collection | DOAJ |
description | Negation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems. Currently, most corpora have been annotated for English, but the presence of languages other than English on the Internet, such as Chinese or Spanish, is greater every day. In this study, we present a review of the corpora annotated with negation information in several languages with the goal of evaluating what aspects of negation have been annotated and how compatible the corpora are. We conclude that it is very difficult to merge the existing corpora because we found differences in the annotation schemes used, and most importantly, in the annotation guidelines: the way in which each corpus was tokenized and the negation elements that have been annotated. Differently than for other well established tasks like semantic role labeling
or parsing, for negation there is no standard annotation scheme nor guidelines, which hampers progress in its treatment. |
first_indexed | 2024-04-13T02:28:05Z |
format | Article |
id | doaj.art-fdd5021215764924b448200864eccb8d |
institution | Directory Open Access Journal |
issn | 0891-2017 1530-9312 |
language | English |
last_indexed | 2024-04-13T02:28:05Z |
publishDate | 2020-03-01 |
publisher | The MIT Press |
record_format | Article |
series | Computational Linguistics |
spelling | doaj.art-fdd5021215764924b448200864eccb8d2022-12-22T03:06:43ZengThe MIT PressComputational Linguistics0891-20171530-93122020-03-0146115210.1162/coli_a_00371Corpora Annotated with Negation: An OverviewJiménez-Zafra, Salud MaríaMorante, RoserTeresa Martín-Valdivia, MaríaUreña-López, L. AlfonsoNegation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems. Currently, most corpora have been annotated for English, but the presence of languages other than English on the Internet, such as Chinese or Spanish, is greater every day. In this study, we present a review of the corpora annotated with negation information in several languages with the goal of evaluating what aspects of negation have been annotated and how compatible the corpora are. We conclude that it is very difficult to merge the existing corpora because we found differences in the annotation schemes used, and most importantly, in the annotation guidelines: the way in which each corpus was tokenized and the negation elements that have been annotated. Differently than for other well established tasks like semantic role labeling or parsing, for negation there is no standard annotation scheme nor guidelines, which hampers progress in its treatment.https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00371 |
spellingShingle | Jiménez-Zafra, Salud María Morante, Roser Teresa Martín-Valdivia, María Ureña-López, L. Alfonso Corpora Annotated with Negation: An Overview Computational Linguistics |
title | Corpora Annotated with Negation: An Overview |
title_full | Corpora Annotated with Negation: An Overview |
title_fullStr | Corpora Annotated with Negation: An Overview |
title_full_unstemmed | Corpora Annotated with Negation: An Overview |
title_short | Corpora Annotated with Negation: An Overview |
title_sort | corpora annotated with negation an overview |
url | https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00371 |
work_keys_str_mv | AT jimenezzafrasaludmaria corporaannotatedwithnegationanoverview AT moranteroser corporaannotatedwithnegationanoverview AT teresamartinvaldiviamaria corporaannotatedwithnegationanoverview AT urenalopezlalfonso corporaannotatedwithnegationanoverview |