Early dementia detection with speech analysis and machine learning techniques

Abstract This in-depth study journey explores the context of natural language processing and text analysis in dementia detection, revealing their importance in a variety of fields. Beginning with an examination of the widespread and influence of text data. The dataset utilised in this study is from...

Full description

Bibliographic Details
Main Authors: Zerin Jahan, Surbhi Bhatia Khan, Mo Saraee
Format: Article
Language:English
Published: Springer 2024-04-01
Series:Discover Sustainability
Subjects:
Online Access:https://doi.org/10.1007/s43621-024-00217-2
_version_ 1797209574040666112
author Zerin Jahan
Surbhi Bhatia Khan
Mo Saraee
author_facet Zerin Jahan
Surbhi Bhatia Khan
Mo Saraee
author_sort Zerin Jahan
collection DOAJ
description Abstract This in-depth study journey explores the context of natural language processing and text analysis in dementia detection, revealing their importance in a variety of fields. Beginning with an examination of the widespread and influence of text data. The dataset utilised in this study is from TalkBank's DementiaBank, which is basically a vast database of multimedia interactions built with the goal of examining communication patterns in the context of dementia. The various communication styles dementia patients exhibit when communicating with others are seen from a unique perspective by this specific dataset. Thorough data preprocessing procedures, including cleansing, tokenization, and structuring, are undertaken, with a focus on improving prediction capabilities through the combination of textual and non-textual information in the field of feature engineering. In the subsequent phase, the precision, recall, and F1-score metrics of Support Vector Machines (SVM), K-Nearest Neighbours (KNN), Random Forest, and Artificial Neural Networks (ANN) are assessed. Empirical facts are synthesized using text analysis methods and models to formulate a coherent conclusion. The significance of text data analysis, the revolutionary potential of natural language processing, and the direction for future research are highlighted in this synthesis. Throughout this paper, readers are encouraged to leverage text data to embark on their own adventures in the evolving, data-centric world of dementia detection.
first_indexed 2024-04-24T09:56:51Z
format Article
id doaj.art-97f665c80d114d7f91d99e2d03e7d940
institution Directory Open Access Journal
issn 2662-9984
language English
last_indexed 2024-04-24T09:56:51Z
publishDate 2024-04-01
publisher Springer
record_format Article
series Discover Sustainability
spelling doaj.art-97f665c80d114d7f91d99e2d03e7d9402024-04-14T11:06:03ZengSpringerDiscover Sustainability2662-99842024-04-015111810.1007/s43621-024-00217-2Early dementia detection with speech analysis and machine learning techniquesZerin Jahan0Surbhi Bhatia Khan1Mo Saraee2School of Science, Engineering and Environment, University of SalfordSchool of Science, Engineering and Environment, University of SalfordSchool of Science, Engineering and Environment, University of SalfordAbstract This in-depth study journey explores the context of natural language processing and text analysis in dementia detection, revealing their importance in a variety of fields. Beginning with an examination of the widespread and influence of text data. The dataset utilised in this study is from TalkBank's DementiaBank, which is basically a vast database of multimedia interactions built with the goal of examining communication patterns in the context of dementia. The various communication styles dementia patients exhibit when communicating with others are seen from a unique perspective by this specific dataset. Thorough data preprocessing procedures, including cleansing, tokenization, and structuring, are undertaken, with a focus on improving prediction capabilities through the combination of textual and non-textual information in the field of feature engineering. In the subsequent phase, the precision, recall, and F1-score metrics of Support Vector Machines (SVM), K-Nearest Neighbours (KNN), Random Forest, and Artificial Neural Networks (ANN) are assessed. Empirical facts are synthesized using text analysis methods and models to formulate a coherent conclusion. The significance of text data analysis, the revolutionary potential of natural language processing, and the direction for future research are highlighted in this synthesis. Throughout this paper, readers are encouraged to leverage text data to embark on their own adventures in the evolving, data-centric world of dementia detection.https://doi.org/10.1007/s43621-024-00217-2DementiaSpeech transcript analysisText miningFeature extractionWord embeddingMachine learning
spellingShingle Zerin Jahan
Surbhi Bhatia Khan
Mo Saraee
Early dementia detection with speech analysis and machine learning techniques
Discover Sustainability
Dementia
Speech transcript analysis
Text mining
Feature extraction
Word embedding
Machine learning
title Early dementia detection with speech analysis and machine learning techniques
title_full Early dementia detection with speech analysis and machine learning techniques
title_fullStr Early dementia detection with speech analysis and machine learning techniques
title_full_unstemmed Early dementia detection with speech analysis and machine learning techniques
title_short Early dementia detection with speech analysis and machine learning techniques
title_sort early dementia detection with speech analysis and machine learning techniques
topic Dementia
Speech transcript analysis
Text mining
Feature extraction
Word embedding
Machine learning
url https://doi.org/10.1007/s43621-024-00217-2
work_keys_str_mv AT zerinjahan earlydementiadetectionwithspeechanalysisandmachinelearningtechniques
AT surbhibhatiakhan earlydementiadetectionwithspeechanalysisandmachinelearningtechniques
AT mosaraee earlydementiadetectionwithspeechanalysisandmachinelearningtechniques