Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia

Abstract Background Language impairment is an important marker of neurodegenerative disorders. Despite this, there is no universal system of terminology used to describe these impairments and large inter-rater variability can exist between clinicians assessing language. The use of natural language p...

Full description

Bibliographic Details
Main Authors: Anthony Yeung, Andrea Iaboni, Elizabeth Rochon, Monica Lavoie, Calvin Santiago, Maria Yancheva, Jekaterina Novikova, Mengdan Xu, Jessica Robin, Liam D. Kaufman, Fariya Mostafa
Format: Article
Language:English
Published: BMC 2021-06-01
Series:Alzheimer’s Research & Therapy
Subjects:
Online Access:https://doi.org/10.1186/s13195-021-00848-x
_version_ 1818928881776721920
author Anthony Yeung
Andrea Iaboni
Elizabeth Rochon
Monica Lavoie
Calvin Santiago
Maria Yancheva
Jekaterina Novikova
Mengdan Xu
Jessica Robin
Liam D. Kaufman
Fariya Mostafa
author_facet Anthony Yeung
Andrea Iaboni
Elizabeth Rochon
Monica Lavoie
Calvin Santiago
Maria Yancheva
Jekaterina Novikova
Mengdan Xu
Jessica Robin
Liam D. Kaufman
Fariya Mostafa
author_sort Anthony Yeung
collection DOAJ
description Abstract Background Language impairment is an important marker of neurodegenerative disorders. Despite this, there is no universal system of terminology used to describe these impairments and large inter-rater variability can exist between clinicians assessing language. The use of natural language processing (NLP) and automated speech analysis (ASA) is emerging as a novel and potentially more objective method to assess language in individuals with mild cognitive impairment (MCI) and Alzheimer’s dementia (AD). No studies have analyzed how variables extracted through NLP and ASA might also be correlated to language impairments identified by a clinician. Methods Audio recordings (n=30) from participants with AD, MCI, and controls were rated by clinicians for word-finding difficulty, incoherence, perseveration, and errors in speech. Speech recordings were also transcribed, and linguistic and acoustic variables were extracted through NLP and ASA. Correlations between clinician-rated speech characteristics and the variables were compared using Spearman’s correlation. Exploratory factor analysis was applied to find common factors between variables for each speech characteristic. Results Clinician agreement was high in three of the four speech characteristics: word-finding difficulty (ICC = 0.92, p<0.001), incoherence (ICC = 0.91, p<0.001), and perseveration (ICC = 0.88, p<0.001). Word-finding difficulty and incoherence were useful constructs at distinguishing MCI and AD from controls, while perseveration and speech errors were less relevant. Word-finding difficulty as a construct was explained by three factors, including number and duration of pauses, word duration, and syntactic complexity. Incoherence was explained by two factors, including increased average word duration, use of past tense, and changes in age of acquisition, and more negative valence. Conclusions Variables extracted through automated acoustic and linguistic analysis of MCI and AD speech were significantly correlated with clinician ratings of speech and language characteristics. Our results suggest that correlating NLP and ASA with clinician observations is an objective and novel approach to measuring speech and language changes in neurodegenerative disorders.
first_indexed 2024-12-20T03:35:57Z
format Article
id doaj.art-6384e3a3b5984972a0d5d4dd2b5d6b7d
institution Directory Open Access Journal
issn 1758-9193
language English
last_indexed 2024-12-20T03:35:57Z
publishDate 2021-06-01
publisher BMC
record_format Article
series Alzheimer’s Research & Therapy
spelling doaj.art-6384e3a3b5984972a0d5d4dd2b5d6b7d2022-12-21T19:54:52ZengBMCAlzheimer’s Research & Therapy1758-91932021-06-0113111010.1186/s13195-021-00848-xCorrelating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementiaAnthony Yeung0Andrea Iaboni1Elizabeth Rochon2Monica Lavoie3Calvin Santiago4Maria Yancheva5Jekaterina Novikova6Mengdan Xu7Jessica Robin8Liam D. Kaufman9Fariya Mostafa10Department of Psychiatry, University of TorontoDepartment of Psychiatry, University of TorontoKITE Research Institute, Toronto Rehab, University Health NetworkDepartment of Speech-Language Pathology and Rehabilitation Sciences Institute, University of TorontoDivision of Neurology, Department of Medicine, University of TorontoWinterlight LabsWinterlight LabsWinterlight LabsWinterlight LabsWinterlight LabsWinterlight LabsAbstract Background Language impairment is an important marker of neurodegenerative disorders. Despite this, there is no universal system of terminology used to describe these impairments and large inter-rater variability can exist between clinicians assessing language. The use of natural language processing (NLP) and automated speech analysis (ASA) is emerging as a novel and potentially more objective method to assess language in individuals with mild cognitive impairment (MCI) and Alzheimer’s dementia (AD). No studies have analyzed how variables extracted through NLP and ASA might also be correlated to language impairments identified by a clinician. Methods Audio recordings (n=30) from participants with AD, MCI, and controls were rated by clinicians for word-finding difficulty, incoherence, perseveration, and errors in speech. Speech recordings were also transcribed, and linguistic and acoustic variables were extracted through NLP and ASA. Correlations between clinician-rated speech characteristics and the variables were compared using Spearman’s correlation. Exploratory factor analysis was applied to find common factors between variables for each speech characteristic. Results Clinician agreement was high in three of the four speech characteristics: word-finding difficulty (ICC = 0.92, p<0.001), incoherence (ICC = 0.91, p<0.001), and perseveration (ICC = 0.88, p<0.001). Word-finding difficulty and incoherence were useful constructs at distinguishing MCI and AD from controls, while perseveration and speech errors were less relevant. Word-finding difficulty as a construct was explained by three factors, including number and duration of pauses, word duration, and syntactic complexity. Incoherence was explained by two factors, including increased average word duration, use of past tense, and changes in age of acquisition, and more negative valence. Conclusions Variables extracted through automated acoustic and linguistic analysis of MCI and AD speech were significantly correlated with clinician ratings of speech and language characteristics. Our results suggest that correlating NLP and ASA with clinician observations is an objective and novel approach to measuring speech and language changes in neurodegenerative disorders.https://doi.org/10.1186/s13195-021-00848-xNatural language processingAutomated speech analysisMarkersMachine learningAlzheimer’sDementia
spellingShingle Anthony Yeung
Andrea Iaboni
Elizabeth Rochon
Monica Lavoie
Calvin Santiago
Maria Yancheva
Jekaterina Novikova
Mengdan Xu
Jessica Robin
Liam D. Kaufman
Fariya Mostafa
Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia
Alzheimer’s Research & Therapy
Natural language processing
Automated speech analysis
Markers
Machine learning
Alzheimer’s
Dementia
title Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia
title_full Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia
title_fullStr Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia
title_full_unstemmed Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia
title_short Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia
title_sort correlating natural language processing and automated speech analysis with clinician assessment to quantify speech language changes in mild cognitive impairment and alzheimer s dementia
topic Natural language processing
Automated speech analysis
Markers
Machine learning
Alzheimer’s
Dementia
url https://doi.org/10.1186/s13195-021-00848-x
work_keys_str_mv AT anthonyyeung correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT andreaiaboni correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT elizabethrochon correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT monicalavoie correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT calvinsantiago correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT mariayancheva correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT jekaterinanovikova correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT mengdanxu correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT jessicarobin correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT liamdkaufman correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia
AT fariyamostafa correlatingnaturallanguageprocessingandautomatedspeechanalysiswithclinicianassessmenttoquantifyspeechlanguagechangesinmildcognitiveimpairmentandalzheimersdementia