<i>VivesDebate</i>: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament

The application of the latest Natural Language Processing breakthroughs in computational argumentation has shown promising results, which have raised the interest in this area of research. However, the available corpora with argumentative annotations are often limited to a very specific purpose or a...

Full description

Bibliographic Details
Main Authors: Ramon Ruiz-Dolz, Montserrat Nofre, Mariona Taulé, Stella Heras, Ana García-Fornes
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/15/7160
Description
Summary:The application of the latest Natural Language Processing breakthroughs in computational argumentation has shown promising results, which have raised the interest in this area of research. However, the available corpora with argumentative annotations are often limited to a very specific purpose or are not of adequate size to take advantage of state-of-the-art deep learning techniques (e.g., deep neural networks). In this paper, we present <i>VivesDebate</i>, a large, richly annotated and versatile professional debate corpus for computational argumentation research. The corpus has been created from 29 transcripts of a debate tournament in Catalan and has been machine-translated into Spanish and English. The annotation contains argumentative propositions, argumentative relations, debate interactions and professional evaluations of the arguments and argumentation. The presented corpus can be useful for research on a heterogeneous set of computational argumentation underlying tasks such as Argument Mining, Argument Analysis, Argument Evaluation or Argument Generation, among others. All this makes <i>VivesDebate</i> a valuable resource for computational argumentation research within the context of massive corpora aimed at Natural Language Processing tasks.
ISSN:2076-3417