A Corpus-Based Study of Linguistic Deception in Spanish
In the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emergin...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-09-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/11/19/8817 |
_version_ | 1797516872943403008 |
---|---|
author | Ángela Almela |
author_facet | Ángela Almela |
author_sort | Ángela Almela |
collection | DOAJ |
description | In the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emerging research area is focusing on computer-assisted deception detection using linguistics, with promising results. Accordingly, in the present article, the reader is firstly provided with an overall review of the state of the art of corpus-based research exploring linguistic cues to deception as well as an overview on several approaches to the study of deception and on previous research into its linguistic detection. In an effort to promote corpus-based research in this context, this study explores linguistic cues to deception in the Spanish written language with the aid of an automatic text classification tool, by means of an ad hoc corpus containing ground truth data. Interestingly, the key findings reveal that, although there is a set of linguistic cues which contributes to the global statistical classification model, there are some discursive differences across the subcorpora, yielding better classification results on the analysis conducted on the subcorpus containing emotionally loaded language. |
first_indexed | 2024-03-10T07:06:58Z |
format | Article |
id | doaj.art-ac7cc2d2ffec43959586c433d2900ceb |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-10T07:06:58Z |
publishDate | 2021-09-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-ac7cc2d2ffec43959586c433d2900ceb2023-11-22T15:43:26ZengMDPI AGApplied Sciences2076-34172021-09-011119881710.3390/app11198817A Corpus-Based Study of Linguistic Deception in SpanishÁngela Almela0School of Arts, Universidad de Murcia, 30001 Murcia, SpainIn the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emerging research area is focusing on computer-assisted deception detection using linguistics, with promising results. Accordingly, in the present article, the reader is firstly provided with an overall review of the state of the art of corpus-based research exploring linguistic cues to deception as well as an overview on several approaches to the study of deception and on previous research into its linguistic detection. In an effort to promote corpus-based research in this context, this study explores linguistic cues to deception in the Spanish written language with the aid of an automatic text classification tool, by means of an ad hoc corpus containing ground truth data. Interestingly, the key findings reveal that, although there is a set of linguistic cues which contributes to the global statistical classification model, there are some discursive differences across the subcorpora, yielding better classification results on the analysis conducted on the subcorpus containing emotionally loaded language.https://www.mdpi.com/2076-3417/11/19/8817text classificationlinguistic corpusdeceptionlinguistic cuesstatistical analysisdiscriminant function analysis |
spellingShingle | Ángela Almela A Corpus-Based Study of Linguistic Deception in Spanish Applied Sciences text classification linguistic corpus deception linguistic cues statistical analysis discriminant function analysis |
title | A Corpus-Based Study of Linguistic Deception in Spanish |
title_full | A Corpus-Based Study of Linguistic Deception in Spanish |
title_fullStr | A Corpus-Based Study of Linguistic Deception in Spanish |
title_full_unstemmed | A Corpus-Based Study of Linguistic Deception in Spanish |
title_short | A Corpus-Based Study of Linguistic Deception in Spanish |
title_sort | corpus based study of linguistic deception in spanish |
topic | text classification linguistic corpus deception linguistic cues statistical analysis discriminant function analysis |
url | https://www.mdpi.com/2076-3417/11/19/8817 |
work_keys_str_mv | AT angelaalmela acorpusbasedstudyoflinguisticdeceptioninspanish AT angelaalmela corpusbasedstudyoflinguisticdeceptioninspanish |