A Corpus-Based Study of Linguistic Deception in Spanish

In the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emergin...

Full description

Bibliographic Details
Main Author: Ángela Almela
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/11/19/8817
_version_ 1797516872943403008
author Ángela Almela
author_facet Ángela Almela
author_sort Ángela Almela
collection DOAJ
description In the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emerging research area is focusing on computer-assisted deception detection using linguistics, with promising results. Accordingly, in the present article, the reader is firstly provided with an overall review of the state of the art of corpus-based research exploring linguistic cues to deception as well as an overview on several approaches to the study of deception and on previous research into its linguistic detection. In an effort to promote corpus-based research in this context, this study explores linguistic cues to deception in the Spanish written language with the aid of an automatic text classification tool, by means of an ad hoc corpus containing ground truth data. Interestingly, the key findings reveal that, although there is a set of linguistic cues which contributes to the global statistical classification model, there are some discursive differences across the subcorpora, yielding better classification results on the analysis conducted on the subcorpus containing emotionally loaded language.
first_indexed 2024-03-10T07:06:58Z
format Article
id doaj.art-ac7cc2d2ffec43959586c433d2900ceb
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-10T07:06:58Z
publishDate 2021-09-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-ac7cc2d2ffec43959586c433d2900ceb2023-11-22T15:43:26ZengMDPI AGApplied Sciences2076-34172021-09-011119881710.3390/app11198817A Corpus-Based Study of Linguistic Deception in SpanishÁngela Almela0School of Arts, Universidad de Murcia, 30001 Murcia, SpainIn the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emerging research area is focusing on computer-assisted deception detection using linguistics, with promising results. Accordingly, in the present article, the reader is firstly provided with an overall review of the state of the art of corpus-based research exploring linguistic cues to deception as well as an overview on several approaches to the study of deception and on previous research into its linguistic detection. In an effort to promote corpus-based research in this context, this study explores linguistic cues to deception in the Spanish written language with the aid of an automatic text classification tool, by means of an ad hoc corpus containing ground truth data. Interestingly, the key findings reveal that, although there is a set of linguistic cues which contributes to the global statistical classification model, there are some discursive differences across the subcorpora, yielding better classification results on the analysis conducted on the subcorpus containing emotionally loaded language.https://www.mdpi.com/2076-3417/11/19/8817text classificationlinguistic corpusdeceptionlinguistic cuesstatistical analysisdiscriminant function analysis
spellingShingle Ángela Almela
A Corpus-Based Study of Linguistic Deception in Spanish
Applied Sciences
text classification
linguistic corpus
deception
linguistic cues
statistical analysis
discriminant function analysis
title A Corpus-Based Study of Linguistic Deception in Spanish
title_full A Corpus-Based Study of Linguistic Deception in Spanish
title_fullStr A Corpus-Based Study of Linguistic Deception in Spanish
title_full_unstemmed A Corpus-Based Study of Linguistic Deception in Spanish
title_short A Corpus-Based Study of Linguistic Deception in Spanish
title_sort corpus based study of linguistic deception in spanish
topic text classification
linguistic corpus
deception
linguistic cues
statistical analysis
discriminant function analysis
url https://www.mdpi.com/2076-3417/11/19/8817
work_keys_str_mv AT angelaalmela acorpusbasedstudyoflinguisticdeceptioninspanish
AT angelaalmela corpusbasedstudyoflinguisticdeceptioninspanish