From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models

Passively collected behavioral health data from ubiquitous sensors could provide mental health professionals valuable insights into patient's daily lives, but such efforts are impeded by disparate metrics, lack of interoperability, and unclear correlations between the measured signals and an in...

Full description

Bibliographic Details
Main Authors:	Englhardt, Zachary, Ma, Chengqian, Morris, Margaret E., Chang, Chun-Cheng, Xu, Xuhai "Orson", Qin, Lianhui, McDuff, Daniel, Liu, Xin, Patel, Shwetak, Iyer, Vikram
Format:	Article
Language:	English
Published:	Association for Computing Machinery 2024
Online Access:	https://hdl.handle.net/1721.1/155207

_version_	1811088485004607488
author	Englhardt, Zachary Ma, Chengqian Morris, Margaret E. Chang, Chun-Cheng Xu, Xuhai "Orson" Qin, Lianhui McDuff, Daniel Liu, Xin Patel, Shwetak Iyer, Vikram
author_facet	Englhardt, Zachary Ma, Chengqian Morris, Margaret E. Chang, Chun-Cheng Xu, Xuhai "Orson" Qin, Lianhui McDuff, Daniel Liu, Xin Patel, Shwetak Iyer, Vikram
author_sort	Englhardt, Zachary
collection	MIT
description	Passively collected behavioral health data from ubiquitous sensors could provide mental health professionals valuable insights into patient's daily lives, but such efforts are impeded by disparate metrics, lack of interoperability, and unclear correlations between the measured signals and an individual's mental health. To address these challenges, we pioneer the exploration of large language models (LLMs) to synthesize clinically relevant insights from multi-sensor data. We develop chain-of-thought prompting methods to generate LLM reasoning on how data pertaining to activity, sleep and social interaction relate to conditions such as depression and anxiety. We then prompt the LLM to perform binary classification, achieving accuracies of 61.1%, exceeding the state of the art. We find models like GPT-4 correctly reference numerical data 75% of the time. While we began our investigation by developing methods to use LLMs to output binary classifications for conditions like depression, we find instead that their greatest potential value to clinicians lies not in diagnostic classification, but rather in rigorous analysis of diverse self-tracking data to generate natural language summaries that synthesize multiple data streams and identify potential concerns. Clinicians envisioned using these insights in a variety of ways, principally for fostering collaborative investigation with patients to strengthen the therapeutic alliance and guide treatment. We describe this collaborative engagement, additional envisioned uses, and associated concerns that must be addressed before adoption in real-world contexts.
first_indexed	2024-09-23T14:02:54Z
format	Article
id	mit-1721.1/155207
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T14:02:54Z
publishDate	2024
publisher	Association for Computing Machinery
record_format	dspace
spelling	mit-1721.1/1552072024-09-20T04:30:30Z From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models Englhardt, Zachary Ma, Chengqian Morris, Margaret E. Chang, Chun-Cheng Xu, Xuhai "Orson" Qin, Lianhui McDuff, Daniel Liu, Xin Patel, Shwetak Iyer, Vikram Passively collected behavioral health data from ubiquitous sensors could provide mental health professionals valuable insights into patient's daily lives, but such efforts are impeded by disparate metrics, lack of interoperability, and unclear correlations between the measured signals and an individual's mental health. To address these challenges, we pioneer the exploration of large language models (LLMs) to synthesize clinically relevant insights from multi-sensor data. We develop chain-of-thought prompting methods to generate LLM reasoning on how data pertaining to activity, sleep and social interaction relate to conditions such as depression and anxiety. We then prompt the LLM to perform binary classification, achieving accuracies of 61.1%, exceeding the state of the art. We find models like GPT-4 correctly reference numerical data 75% of the time. While we began our investigation by developing methods to use LLMs to output binary classifications for conditions like depression, we find instead that their greatest potential value to clinicians lies not in diagnostic classification, but rather in rigorous analysis of diverse self-tracking data to generate natural language summaries that synthesize multiple data streams and identify potential concerns. Clinicians envisioned using these insights in a variety of ways, principally for fostering collaborative investigation with patients to strengthen the therapeutic alliance and guide treatment. We describe this collaborative engagement, additional envisioned uses, and associated concerns that must be addressed before adoption in real-world contexts. 2024-06-06T16:40:01Z 2024-06-06T16:40:01Z 2024-05-13 2024-06-01T07:58:35Z Article http://purl.org/eprint/type/JournalArticle 2474-9567 https://hdl.handle.net/1721.1/155207 Englhardt, Zachary, Ma, Chengqian, Morris, Margaret E., Chang, Chun-Cheng, Xu, Xuhai "Orson" et al. 2024. "From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models." Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 8 (2). PUBLISHER_CC en 10.1145/3659604 Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/ The author(s) application/pdf Association for Computing Machinery Association for Computing Machinery
spellingShingle	Englhardt, Zachary Ma, Chengqian Morris, Margaret E. Chang, Chun-Cheng Xu, Xuhai "Orson" Qin, Lianhui McDuff, Daniel Liu, Xin Patel, Shwetak Iyer, Vikram From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
title	From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
title_full	From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
title_fullStr	From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
title_full_unstemmed	From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
title_short	From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models
title_sort	from classification to clinical insights towards analyzing and reasoning about mobile and behavioral health data with large language models
url	https://hdl.handle.net/1721.1/155207
work_keys_str_mv	AT englhardtzachary fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT machengqian fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT morrismargarete fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT changchuncheng fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT xuxuhaiorson fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT qinlianhui fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT mcduffdaniel fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT liuxin fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT patelshwetak fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels AT iyervikram fromclassificationtoclinicalinsightstowardsanalyzingandreasoningaboutmobileandbehavioralhealthdatawithlargelanguagemodels

From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models

Similar Items