Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images

Patient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies. A primary challenge in modeling drug response prediction (DRP) with PDXs and neural networks (NNs) is the limited number of drug response samples. We investigate multimodal neural network (MM-Net) and data augm...

Full description

Bibliographic Details
Main Authors:	Alexander Partin, Thomas Brettin, Yitan Zhu, James M. Dolezal, Sara Kochanny, Alexander T. Pearson, Maulik Shukla, Yvonne A. Evrard, James H. Doroshow, Rick L. Stevens
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2023-03-01
Series:	Frontiers in Medicine
Subjects:	drug response prediction histology whole-slide images gene expression multimodal deep learning preclinical drug studies data augmentation
Online Access:	https://www.frontiersin.org/articles/10.3389/fmed.2023.1058919/full

_version_	1811159012112072704
author	Alexander Partin Thomas Brettin Yitan Zhu James M. Dolezal Sara Kochanny Alexander T. Pearson Maulik Shukla Yvonne A. Evrard James H. Doroshow Rick L. Stevens Rick L. Stevens
author_facet	Alexander Partin Thomas Brettin Yitan Zhu James M. Dolezal Sara Kochanny Alexander T. Pearson Maulik Shukla Yvonne A. Evrard James H. Doroshow Rick L. Stevens Rick L. Stevens
author_sort	Alexander Partin
collection	DOAJ
description	Patient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies. A primary challenge in modeling drug response prediction (DRP) with PDXs and neural networks (NNs) is the limited number of drug response samples. We investigate multimodal neural network (MM-Net) and data augmentation for DRP in PDXs. The MM-Net learns to predict response using drug descriptors, gene expressions (GE), and histology whole-slide images (WSIs). We explore whether combining WSIs with GE improves predictions as compared with models that use GE alone. We propose two data augmentation methods which allow us training multimodal and unimodal NNs without changing architectures with a single larger dataset: 1) combine single-drug and drug-pair treatments by homogenizing drug representations, and 2) augment drug-pairs which doubles the sample size of all drug-pair samples. Unimodal NNs which use GE are compared to assess the contribution of data augmentation. The NN that uses the original and the augmented drug-pair treatments as well as single-drug treatments outperforms NNs that ignore either the augmented drug-pairs or the single-drug treatments. In assessing the multimodal learning based on the MCC metric, MM-Net outperforms all the baselines. Our results show that data augmentation and integration of histology images with GE can improve prediction performance of drug response in PDXs.
first_indexed	2024-04-10T05:33:57Z
format	Article
id	doaj.art-e0d0f63467c2490f8c2c2db626e08d8c
institution	Directory Open Access Journal
issn	2296-858X
language	English
last_indexed	2024-04-10T05:33:57Z
publishDate	2023-03-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Medicine
spelling	doaj.art-e0d0f63467c2490f8c2c2db626e08d8c2023-03-07T04:44:10ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2023-03-011010.3389/fmed.2023.10589191058919Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology imagesAlexander Partin0Thomas Brettin1Yitan Zhu2James M. Dolezal3Sara Kochanny4Alexander T. Pearson5Maulik Shukla6Yvonne A. Evrard7James H. Doroshow8Rick L. Stevens9Rick L. Stevens10Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesSection of Hematology/Oncology, Department of Medicine, University of Chicago Medical Center, Chicago, IL, United StatesSection of Hematology/Oncology, Department of Medicine, University of Chicago Medical Center, Chicago, IL, United StatesSection of Hematology/Oncology, Department of Medicine, University of Chicago Medical Center, Chicago, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesFrederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc., Frederick, MD, United StatesDivision of Cancer Therapeutics and Diagnosis, National Cancer Institute, Bethesda, MD, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesDepartment of Computer Science, The University of Chicago, Chicago, IL, United StatesPatient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies. A primary challenge in modeling drug response prediction (DRP) with PDXs and neural networks (NNs) is the limited number of drug response samples. We investigate multimodal neural network (MM-Net) and data augmentation for DRP in PDXs. The MM-Net learns to predict response using drug descriptors, gene expressions (GE), and histology whole-slide images (WSIs). We explore whether combining WSIs with GE improves predictions as compared with models that use GE alone. We propose two data augmentation methods which allow us training multimodal and unimodal NNs without changing architectures with a single larger dataset: 1) combine single-drug and drug-pair treatments by homogenizing drug representations, and 2) augment drug-pairs which doubles the sample size of all drug-pair samples. Unimodal NNs which use GE are compared to assess the contribution of data augmentation. The NN that uses the original and the augmented drug-pair treatments as well as single-drug treatments outperforms NNs that ignore either the augmented drug-pairs or the single-drug treatments. In assessing the multimodal learning based on the MCC metric, MM-Net outperforms all the baselines. Our results show that data augmentation and integration of histology images with GE can improve prediction performance of drug response in PDXs.https://www.frontiersin.org/articles/10.3389/fmed.2023.1058919/fulldrug response predictionhistology whole-slide imagesgene expressionmultimodal deep learningpreclinical drug studiesdata augmentation
spellingShingle	Alexander Partin Thomas Brettin Yitan Zhu James M. Dolezal Sara Kochanny Alexander T. Pearson Maulik Shukla Yvonne A. Evrard James H. Doroshow Rick L. Stevens Rick L. Stevens Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images Frontiers in Medicine drug response prediction histology whole-slide images gene expression multimodal deep learning preclinical drug studies data augmentation
title	Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_full	Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_fullStr	Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_full_unstemmed	Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_short	Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_sort	data augmentation and multimodal learning for predicting drug response in patient derived xenografts from gene expressions and histology images
topic	drug response prediction histology whole-slide images gene expression multimodal deep learning preclinical drug studies data augmentation
url	https://www.frontiersin.org/articles/10.3389/fmed.2023.1058919/full
work_keys_str_mv	AT alexanderpartin dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT thomasbrettin dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT yitanzhu dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT jamesmdolezal dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT sarakochanny dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT alexandertpearson dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT maulikshukla dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT yvonneaevrard dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT jameshdoroshow dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT ricklstevens dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages AT ricklstevens dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages

Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images

Similar Items