A Corpus-Based Study of Linguistic Deception in Spanish

In the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emergin...

Full description

Bibliographic Details
Main Author:	Ángela Almela
Format:	Article
Language:	English
Published:	MDPI AG 2021-09-01
Series:	Applied Sciences
Subjects:	text classification linguistic corpus deception linguistic cues statistical analysis discriminant function analysis
Online Access:	https://www.mdpi.com/2076-3417/11/19/8817

_version_	1797516872943403008
author	Ángela Almela
author_facet	Ángela Almela
author_sort	Ángela Almela
collection	DOAJ
description	In the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emerging research area is focusing on computer-assisted deception detection using linguistics, with promising results. Accordingly, in the present article, the reader is firstly provided with an overall review of the state of the art of corpus-based research exploring linguistic cues to deception as well as an overview on several approaches to the study of deception and on previous research into its linguistic detection. In an effort to promote corpus-based research in this context, this study explores linguistic cues to deception in the Spanish written language with the aid of an automatic text classification tool, by means of an ad hoc corpus containing ground truth data. Interestingly, the key findings reveal that, although there is a set of linguistic cues which contributes to the global statistical classification model, there are some discursive differences across the subcorpora, yielding better classification results on the analysis conducted on the subcorpus containing emotionally loaded language.
first_indexed	2024-03-10T07:06:58Z
format	Article
id	doaj.art-ac7cc2d2ffec43959586c433d2900ceb
institution	Directory Open Access Journal
issn	2076-3417
language	English
last_indexed	2024-03-10T07:06:58Z
publishDate	2021-09-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj.art-ac7cc2d2ffec43959586c433d2900ceb2023-11-22T15:43:26ZengMDPI AGApplied Sciences2076-34172021-09-011119881710.3390/app11198817A Corpus-Based Study of Linguistic Deception in SpanishÁngela Almela0School of Arts, Universidad de Murcia, 30001 Murcia, SpainIn the last decade, fields such as psychology and natural language processing have devoted considerable attention to the automatization of the process of deception detection, developing and employing a wide array of automated and computer-assisted methods for this purpose. Similarly, another emerging research area is focusing on computer-assisted deception detection using linguistics, with promising results. Accordingly, in the present article, the reader is firstly provided with an overall review of the state of the art of corpus-based research exploring linguistic cues to deception as well as an overview on several approaches to the study of deception and on previous research into its linguistic detection. In an effort to promote corpus-based research in this context, this study explores linguistic cues to deception in the Spanish written language with the aid of an automatic text classification tool, by means of an ad hoc corpus containing ground truth data. Interestingly, the key findings reveal that, although there is a set of linguistic cues which contributes to the global statistical classification model, there are some discursive differences across the subcorpora, yielding better classification results on the analysis conducted on the subcorpus containing emotionally loaded language.https://www.mdpi.com/2076-3417/11/19/8817text classificationlinguistic corpusdeceptionlinguistic cuesstatistical analysisdiscriminant function analysis
spellingShingle	Ángela Almela A Corpus-Based Study of Linguistic Deception in Spanish Applied Sciences text classification linguistic corpus deception linguistic cues statistical analysis discriminant function analysis
title	A Corpus-Based Study of Linguistic Deception in Spanish
title_full	A Corpus-Based Study of Linguistic Deception in Spanish
title_fullStr	A Corpus-Based Study of Linguistic Deception in Spanish
title_full_unstemmed	A Corpus-Based Study of Linguistic Deception in Spanish
title_short	A Corpus-Based Study of Linguistic Deception in Spanish
title_sort	corpus based study of linguistic deception in spanish
topic	text classification linguistic corpus deception linguistic cues statistical analysis discriminant function analysis
url	https://www.mdpi.com/2076-3417/11/19/8817
work_keys_str_mv	AT angelaalmela acorpusbasedstudyoflinguisticdeceptioninspanish AT angelaalmela corpusbasedstudyoflinguisticdeceptioninspanish

A Corpus-Based Study of Linguistic Deception in Spanish

Similar Items