Corpora Annotated with Negation: An Overview

Negation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems. Currently, most corpora have been annotated for English, but the presen...

Full description

Bibliographic Details
Main Authors: Jiménez-Zafra, Salud María, Morante, Roser, Teresa Martín-Valdivia, María, Ureña-López, L. Alfonso
Format: Article
Language:English
Published: The MIT Press 2020-03-01
Series:Computational Linguistics
Online Access:https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00371
_version_ 1811284400646651904
author Jiménez-Zafra, Salud María
Morante, Roser
Teresa Martín-Valdivia, María
Ureña-López, L. Alfonso
author_facet Jiménez-Zafra, Salud María
Morante, Roser
Teresa Martín-Valdivia, María
Ureña-López, L. Alfonso
author_sort Jiménez-Zafra, Salud María
collection DOAJ
description Negation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems. Currently, most corpora have been annotated for English, but the presence of languages other than English on the Internet, such as Chinese or Spanish, is greater every day. In this study, we present a review of the corpora annotated with negation information in several languages with the goal of evaluating what aspects of negation have been annotated and how compatible the corpora are. We conclude that it is very difficult to merge the existing corpora because we found differences in the annotation schemes used, and most importantly, in the annotation guidelines: the way in which each corpus was tokenized and the negation elements that have been annotated. Differently than for other well established tasks like semantic role labeling or parsing, for negation there is no standard annotation scheme nor guidelines, which hampers progress in its treatment.
first_indexed 2024-04-13T02:28:05Z
format Article
id doaj.art-fdd5021215764924b448200864eccb8d
institution Directory Open Access Journal
issn 0891-2017
1530-9312
language English
last_indexed 2024-04-13T02:28:05Z
publishDate 2020-03-01
publisher The MIT Press
record_format Article
series Computational Linguistics
spelling doaj.art-fdd5021215764924b448200864eccb8d2022-12-22T03:06:43ZengThe MIT PressComputational Linguistics0891-20171530-93122020-03-0146115210.1162/coli_a_00371Corpora Annotated with Negation: An OverviewJiménez-Zafra, Salud MaríaMorante, RoserTeresa Martín-Valdivia, MaríaUreña-López, L. AlfonsoNegation is a universal linguistic phenomenon with a great qualitative impact on natural language processing applications. The availability of corpora annotated with negation is essential to training negation processing systems. Currently, most corpora have been annotated for English, but the presence of languages other than English on the Internet, such as Chinese or Spanish, is greater every day. In this study, we present a review of the corpora annotated with negation information in several languages with the goal of evaluating what aspects of negation have been annotated and how compatible the corpora are. We conclude that it is very difficult to merge the existing corpora because we found differences in the annotation schemes used, and most importantly, in the annotation guidelines: the way in which each corpus was tokenized and the negation elements that have been annotated. Differently than for other well established tasks like semantic role labeling or parsing, for negation there is no standard annotation scheme nor guidelines, which hampers progress in its treatment.https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00371
spellingShingle Jiménez-Zafra, Salud María
Morante, Roser
Teresa Martín-Valdivia, María
Ureña-López, L. Alfonso
Corpora Annotated with Negation: An Overview
Computational Linguistics
title Corpora Annotated with Negation: An Overview
title_full Corpora Annotated with Negation: An Overview
title_fullStr Corpora Annotated with Negation: An Overview
title_full_unstemmed Corpora Annotated with Negation: An Overview
title_short Corpora Annotated with Negation: An Overview
title_sort corpora annotated with negation an overview
url https://www.mitpressjournals.org/doi/abs/10.1162/coli_a_00371
work_keys_str_mv AT jimenezzafrasaludmaria corporaannotatedwithnegationanoverview
AT moranteroser corporaannotatedwithnegationanoverview
AT teresamartinvaldiviamaria corporaannotatedwithnegationanoverview
AT urenalopezlalfonso corporaannotatedwithnegationanoverview