Evaluating the predictability of medical conditions from social media posts.

We studied whether medical conditions across 21 broad categories were predictable from social media content across approximately 20 million words written by 999 consenting patients. Facebook language significantly improved upon the prediction accuracy of demographic variables for 18 of the 21 diseas...

Full description

Bibliographic Details
Main Authors: Raina M Merchant, David A Asch, Patrick Crutchley, Lyle H Ungar, Sharath C Guntuku, Johannes C Eichstaedt, Shawndra Hill, Kevin Padrez, Robert J Smith, H Andrew Schwartz
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0215476
_version_ 1818725059932454912
author Raina M Merchant
David A Asch
Patrick Crutchley
Lyle H Ungar
Sharath C Guntuku
Johannes C Eichstaedt
Shawndra Hill
Kevin Padrez
Robert J Smith
H Andrew Schwartz
author_facet Raina M Merchant
David A Asch
Patrick Crutchley
Lyle H Ungar
Sharath C Guntuku
Johannes C Eichstaedt
Shawndra Hill
Kevin Padrez
Robert J Smith
H Andrew Schwartz
author_sort Raina M Merchant
collection DOAJ
description We studied whether medical conditions across 21 broad categories were predictable from social media content across approximately 20 million words written by 999 consenting patients. Facebook language significantly improved upon the prediction accuracy of demographic variables for 18 of the 21 disease categories; it was particularly effective at predicting diabetes and mental health conditions including anxiety, depression and psychoses. Social media data are a quantifiable link into the otherwise elusive daily lives of patients, providing an avenue for study and assessment of behavioral and environmental disease risk factors. Analogous to the genome, social media data linked to medical diagnoses can be banked with patients' consent, and an encoding of social media language can be used as markers of disease risk, serve as a screening tool, and elucidate disease epidemiology. In what we believe to be the first report linking electronic medical record data with social media data from consenting patients, we identified that patients' Facebook status updates can predict many health conditions, suggesting opportunities to use social media data to determine disease onset or exacerbation and to conduct social media-based health interventions.
first_indexed 2024-12-17T21:36:18Z
format Article
id doaj.art-3d1528f4d9d342ab88d5b8f67adf08af
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-17T21:36:18Z
publishDate 2019-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-3d1528f4d9d342ab88d5b8f67adf08af2022-12-21T21:31:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01146e021547610.1371/journal.pone.0215476Evaluating the predictability of medical conditions from social media posts.Raina M MerchantDavid A AschPatrick CrutchleyLyle H UngarSharath C GuntukuJohannes C EichstaedtShawndra HillKevin PadrezRobert J SmithH Andrew SchwartzWe studied whether medical conditions across 21 broad categories were predictable from social media content across approximately 20 million words written by 999 consenting patients. Facebook language significantly improved upon the prediction accuracy of demographic variables for 18 of the 21 disease categories; it was particularly effective at predicting diabetes and mental health conditions including anxiety, depression and psychoses. Social media data are a quantifiable link into the otherwise elusive daily lives of patients, providing an avenue for study and assessment of behavioral and environmental disease risk factors. Analogous to the genome, social media data linked to medical diagnoses can be banked with patients' consent, and an encoding of social media language can be used as markers of disease risk, serve as a screening tool, and elucidate disease epidemiology. In what we believe to be the first report linking electronic medical record data with social media data from consenting patients, we identified that patients' Facebook status updates can predict many health conditions, suggesting opportunities to use social media data to determine disease onset or exacerbation and to conduct social media-based health interventions.https://doi.org/10.1371/journal.pone.0215476
spellingShingle Raina M Merchant
David A Asch
Patrick Crutchley
Lyle H Ungar
Sharath C Guntuku
Johannes C Eichstaedt
Shawndra Hill
Kevin Padrez
Robert J Smith
H Andrew Schwartz
Evaluating the predictability of medical conditions from social media posts.
PLoS ONE
title Evaluating the predictability of medical conditions from social media posts.
title_full Evaluating the predictability of medical conditions from social media posts.
title_fullStr Evaluating the predictability of medical conditions from social media posts.
title_full_unstemmed Evaluating the predictability of medical conditions from social media posts.
title_short Evaluating the predictability of medical conditions from social media posts.
title_sort evaluating the predictability of medical conditions from social media posts
url https://doi.org/10.1371/journal.pone.0215476
work_keys_str_mv AT rainammerchant evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT davidaasch evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT patrickcrutchley evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT lylehungar evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT sharathcguntuku evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT johannesceichstaedt evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT shawndrahill evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT kevinpadrez evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT robertjsmith evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts
AT handrewschwartz evaluatingthepredictabilityofmedicalconditionsfromsocialmediaposts