Using generative AI to investigate medical imagery models and datasetsResearch in context

Summary: Background: AI models have shown promise in performing many medical imaging tasks. However, our ability to explain what signals these models have learned is severely lacking. Explanations are needed in order to increase the trust of doctors in AI-based models, especially in domains where A...

Full description

Bibliographic Details
Main Authors: Oran Lang, Doron Yaya-Stupp, Ilana Traynis, Heather Cole-Lewis, Chloe R. Bennett, Courtney R. Lyles, Charles Lau, Michal Irani, Christopher Semturs, Dale R. Webster, Greg S. Corrado, Avinatan Hassidim, Yossi Matias, Yun Liu, Naama Hammel, Boris Babenko
Format: Article
Language:English
Published: Elsevier 2024-04-01
Series:EBioMedicine
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352396424001105
_version_ 1797214887994195968
author Oran Lang
Doron Yaya-Stupp
Ilana Traynis
Heather Cole-Lewis
Chloe R. Bennett
Courtney R. Lyles
Charles Lau
Michal Irani
Christopher Semturs
Dale R. Webster
Greg S. Corrado
Avinatan Hassidim
Yossi Matias
Yun Liu
Naama Hammel
Boris Babenko
author_facet Oran Lang
Doron Yaya-Stupp
Ilana Traynis
Heather Cole-Lewis
Chloe R. Bennett
Courtney R. Lyles
Charles Lau
Michal Irani
Christopher Semturs
Dale R. Webster
Greg S. Corrado
Avinatan Hassidim
Yossi Matias
Yun Liu
Naama Hammel
Boris Babenko
author_sort Oran Lang
collection DOAJ
description Summary: Background: AI models have shown promise in performing many medical imaging tasks. However, our ability to explain what signals these models have learned is severely lacking. Explanations are needed in order to increase the trust of doctors in AI-based models, especially in domains where AI prediction capabilities surpass those of humans. Moreover, such explanations could enable novel scientific discovery by uncovering signals in the data that aren’t yet known to experts. Methods: In this paper, we present a workflow for generating hypotheses to understand which visual signals in images are correlated with a classification model’s predictions for a given task. This approach leverages an automatic visual explanation algorithm followed by interdisciplinary expert review. We propose the following 4 steps: (i) Train a classifier to perform a given task to assess whether the imagery indeed contains signals relevant to the task; (ii) Train a StyleGAN-based image generator with an architecture that enables guidance by the classifier (“StylEx”); (iii) Automatically detect, extract, and visualize the top visual attributes that the classifier is sensitive towards. For visualization, we independently modify each of these attributes to generate counterfactual visualizations for a set of images (i.e., what the image would look like with the attribute increased or decreased); (iv) Formulate hypotheses for the underlying mechanisms, to stimulate future research. Specifically, present the discovered attributes and corresponding counterfactual visualizations to an interdisciplinary panel of experts so that hypotheses can account for social and structural determinants of health (e.g., whether the attributes correspond to known patho-physiological or socio-cultural phenomena, or could be novel discoveries). Findings: To demonstrate the broad applicability of our approach, we present results on eight prediction tasks across three medical imaging modalities—retinal fundus photographs, external eye photographs, and chest radiographs. We showcase examples where many of the automatically-learned attributes clearly capture clinically known features (e.g., types of cataract, enlarged heart), and demonstrate automatically-learned confounders that arise from factors beyond physiological mechanisms (e.g., chest X-ray underexposure is correlated with the classifier predicting abnormality, and eye makeup is correlated with the classifier predicting low hemoglobin levels). We further show that our method reveals a number of physiologically plausible, previously-unknown attributes based on the literature (e.g., differences in the fundus associated with self-reported sex, which were previously unknown). Interpretation: Our approach enables hypotheses generation via attribute visualizations and has the potential to enable researchers to better understand, improve their assessment, and extract new knowledge from AI-based models, as well as debug and design better datasets. Though not designed to infer causality, importantly, we highlight that attributes generated by our framework can capture phenomena beyond physiology or pathophysiology, reflecting the real world nature of healthcare delivery and socio-cultural factors, and hence interdisciplinary perspectives are critical in these investigations. Finally, we will release code to help researchers train their own StylEx models and analyze their predictive tasks of interest, and use the methodology presented in this paper for responsible interpretation of the revealed attributes. Funding: Google.
first_indexed 2024-04-24T11:21:19Z
format Article
id doaj.art-500f978dbec44cde8b3efb2076472b5c
institution Directory Open Access Journal
issn 2352-3964
language English
last_indexed 2024-04-24T11:21:19Z
publishDate 2024-04-01
publisher Elsevier
record_format Article
series EBioMedicine
spelling doaj.art-500f978dbec44cde8b3efb2076472b5c2024-04-11T04:41:29ZengElsevierEBioMedicine2352-39642024-04-01102105075Using generative AI to investigate medical imagery models and datasetsResearch in contextOran Lang0Doron Yaya-Stupp1Ilana Traynis2Heather Cole-Lewis3Chloe R. Bennett4Courtney R. Lyles5Charles Lau6Michal Irani7Christopher Semturs8Dale R. Webster9Greg S. Corrado10Avinatan Hassidim11Yossi Matias12Yun Liu13Naama Hammel14Boris Babenko15Google, Mountain View, CA, USAGoogle, Mountain View, CA, USAWork Done at Google Via Advanced Clinical, Deerfield, IL, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USA; Work Done at Google Via Advanced Clinical, Deerfield, IL, USA; Work Done at Google via Pro Unlimited, Folsom, CA, USA; University of California San Francisco, Department of Medicine, San Francisco, CA, USA; Weizmann Institute of Science, IsraelGoogle, Mountain View, CA, USA; University of California San Francisco, Department of Medicine, San Francisco, CA, USAGoogle, Mountain View, CA, USAWeizmann Institute of Science, IsraelGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USAGoogle, Mountain View, CA, USA; Corresponding author.Summary: Background: AI models have shown promise in performing many medical imaging tasks. However, our ability to explain what signals these models have learned is severely lacking. Explanations are needed in order to increase the trust of doctors in AI-based models, especially in domains where AI prediction capabilities surpass those of humans. Moreover, such explanations could enable novel scientific discovery by uncovering signals in the data that aren’t yet known to experts. Methods: In this paper, we present a workflow for generating hypotheses to understand which visual signals in images are correlated with a classification model’s predictions for a given task. This approach leverages an automatic visual explanation algorithm followed by interdisciplinary expert review. We propose the following 4 steps: (i) Train a classifier to perform a given task to assess whether the imagery indeed contains signals relevant to the task; (ii) Train a StyleGAN-based image generator with an architecture that enables guidance by the classifier (“StylEx”); (iii) Automatically detect, extract, and visualize the top visual attributes that the classifier is sensitive towards. For visualization, we independently modify each of these attributes to generate counterfactual visualizations for a set of images (i.e., what the image would look like with the attribute increased or decreased); (iv) Formulate hypotheses for the underlying mechanisms, to stimulate future research. Specifically, present the discovered attributes and corresponding counterfactual visualizations to an interdisciplinary panel of experts so that hypotheses can account for social and structural determinants of health (e.g., whether the attributes correspond to known patho-physiological or socio-cultural phenomena, or could be novel discoveries). Findings: To demonstrate the broad applicability of our approach, we present results on eight prediction tasks across three medical imaging modalities—retinal fundus photographs, external eye photographs, and chest radiographs. We showcase examples where many of the automatically-learned attributes clearly capture clinically known features (e.g., types of cataract, enlarged heart), and demonstrate automatically-learned confounders that arise from factors beyond physiological mechanisms (e.g., chest X-ray underexposure is correlated with the classifier predicting abnormality, and eye makeup is correlated with the classifier predicting low hemoglobin levels). We further show that our method reveals a number of physiologically plausible, previously-unknown attributes based on the literature (e.g., differences in the fundus associated with self-reported sex, which were previously unknown). Interpretation: Our approach enables hypotheses generation via attribute visualizations and has the potential to enable researchers to better understand, improve their assessment, and extract new knowledge from AI-based models, as well as debug and design better datasets. Though not designed to infer causality, importantly, we highlight that attributes generated by our framework can capture phenomena beyond physiology or pathophysiology, reflecting the real world nature of healthcare delivery and socio-cultural factors, and hence interdisciplinary perspectives are critical in these investigations. Finally, we will release code to help researchers train their own StylEx models and analyze their predictive tasks of interest, and use the methodology presented in this paper for responsible interpretation of the revealed attributes. Funding: Google.http://www.sciencedirect.com/science/article/pii/S2352396424001105Artificial intelligenceMedical imageryExplainabilityInterpretabilityDeep learningGenerative AI
spellingShingle Oran Lang
Doron Yaya-Stupp
Ilana Traynis
Heather Cole-Lewis
Chloe R. Bennett
Courtney R. Lyles
Charles Lau
Michal Irani
Christopher Semturs
Dale R. Webster
Greg S. Corrado
Avinatan Hassidim
Yossi Matias
Yun Liu
Naama Hammel
Boris Babenko
Using generative AI to investigate medical imagery models and datasetsResearch in context
EBioMedicine
Artificial intelligence
Medical imagery
Explainability
Interpretability
Deep learning
Generative AI
title Using generative AI to investigate medical imagery models and datasetsResearch in context
title_full Using generative AI to investigate medical imagery models and datasetsResearch in context
title_fullStr Using generative AI to investigate medical imagery models and datasetsResearch in context
title_full_unstemmed Using generative AI to investigate medical imagery models and datasetsResearch in context
title_short Using generative AI to investigate medical imagery models and datasetsResearch in context
title_sort using generative ai to investigate medical imagery models and datasetsresearch in context
topic Artificial intelligence
Medical imagery
Explainability
Interpretability
Deep learning
Generative AI
url http://www.sciencedirect.com/science/article/pii/S2352396424001105
work_keys_str_mv AT oranlang usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT doronyayastupp usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT ilanatraynis usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT heathercolelewis usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT chloerbennett usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT courtneyrlyles usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT charleslau usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT michalirani usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT christophersemturs usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT dalerwebster usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT gregscorrado usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT avinatanhassidim usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT yossimatias usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT yunliu usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT naamahammel usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext
AT borisbabenko usinggenerativeaitoinvestigatemedicalimagerymodelsanddatasetsresearchincontext