Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images

Patient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies. A primary challenge in modeling drug response prediction (DRP) with PDXs and neural networks (NNs) is the limited number of drug response samples. We investigate multimodal neural network (MM-Net) and data augm...

Full description

Bibliographic Details
Main Authors: Alexander Partin, Thomas Brettin, Yitan Zhu, James M. Dolezal, Sara Kochanny, Alexander T. Pearson, Maulik Shukla, Yvonne A. Evrard, James H. Doroshow, Rick L. Stevens
Format: Article
Language:English
Published: Frontiers Media S.A. 2023-03-01
Series:Frontiers in Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmed.2023.1058919/full
_version_ 1811159012112072704
author Alexander Partin
Thomas Brettin
Yitan Zhu
James M. Dolezal
Sara Kochanny
Alexander T. Pearson
Maulik Shukla
Yvonne A. Evrard
James H. Doroshow
Rick L. Stevens
Rick L. Stevens
author_facet Alexander Partin
Thomas Brettin
Yitan Zhu
James M. Dolezal
Sara Kochanny
Alexander T. Pearson
Maulik Shukla
Yvonne A. Evrard
James H. Doroshow
Rick L. Stevens
Rick L. Stevens
author_sort Alexander Partin
collection DOAJ
description Patient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies. A primary challenge in modeling drug response prediction (DRP) with PDXs and neural networks (NNs) is the limited number of drug response samples. We investigate multimodal neural network (MM-Net) and data augmentation for DRP in PDXs. The MM-Net learns to predict response using drug descriptors, gene expressions (GE), and histology whole-slide images (WSIs). We explore whether combining WSIs with GE improves predictions as compared with models that use GE alone. We propose two data augmentation methods which allow us training multimodal and unimodal NNs without changing architectures with a single larger dataset: 1) combine single-drug and drug-pair treatments by homogenizing drug representations, and 2) augment drug-pairs which doubles the sample size of all drug-pair samples. Unimodal NNs which use GE are compared to assess the contribution of data augmentation. The NN that uses the original and the augmented drug-pair treatments as well as single-drug treatments outperforms NNs that ignore either the augmented drug-pairs or the single-drug treatments. In assessing the multimodal learning based on the MCC metric, MM-Net outperforms all the baselines. Our results show that data augmentation and integration of histology images with GE can improve prediction performance of drug response in PDXs.
first_indexed 2024-04-10T05:33:57Z
format Article
id doaj.art-e0d0f63467c2490f8c2c2db626e08d8c
institution Directory Open Access Journal
issn 2296-858X
language English
last_indexed 2024-04-10T05:33:57Z
publishDate 2023-03-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Medicine
spelling doaj.art-e0d0f63467c2490f8c2c2db626e08d8c2023-03-07T04:44:10ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2023-03-011010.3389/fmed.2023.10589191058919Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology imagesAlexander Partin0Thomas Brettin1Yitan Zhu2James M. Dolezal3Sara Kochanny4Alexander T. Pearson5Maulik Shukla6Yvonne A. Evrard7James H. Doroshow8Rick L. Stevens9Rick L. Stevens10Division of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesSection of Hematology/Oncology, Department of Medicine, University of Chicago Medical Center, Chicago, IL, United StatesSection of Hematology/Oncology, Department of Medicine, University of Chicago Medical Center, Chicago, IL, United StatesSection of Hematology/Oncology, Department of Medicine, University of Chicago Medical Center, Chicago, IL, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesFrederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc., Frederick, MD, United StatesDivision of Cancer Therapeutics and Diagnosis, National Cancer Institute, Bethesda, MD, United StatesDivision of Data Science and Learning, Argonne National Laboratory, Lemont, IL, United StatesDepartment of Computer Science, The University of Chicago, Chicago, IL, United StatesPatient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies. A primary challenge in modeling drug response prediction (DRP) with PDXs and neural networks (NNs) is the limited number of drug response samples. We investigate multimodal neural network (MM-Net) and data augmentation for DRP in PDXs. The MM-Net learns to predict response using drug descriptors, gene expressions (GE), and histology whole-slide images (WSIs). We explore whether combining WSIs with GE improves predictions as compared with models that use GE alone. We propose two data augmentation methods which allow us training multimodal and unimodal NNs without changing architectures with a single larger dataset: 1) combine single-drug and drug-pair treatments by homogenizing drug representations, and 2) augment drug-pairs which doubles the sample size of all drug-pair samples. Unimodal NNs which use GE are compared to assess the contribution of data augmentation. The NN that uses the original and the augmented drug-pair treatments as well as single-drug treatments outperforms NNs that ignore either the augmented drug-pairs or the single-drug treatments. In assessing the multimodal learning based on the MCC metric, MM-Net outperforms all the baselines. Our results show that data augmentation and integration of histology images with GE can improve prediction performance of drug response in PDXs.https://www.frontiersin.org/articles/10.3389/fmed.2023.1058919/fulldrug response predictionhistology whole-slide imagesgene expressionmultimodal deep learningpreclinical drug studiesdata augmentation
spellingShingle Alexander Partin
Thomas Brettin
Yitan Zhu
James M. Dolezal
Sara Kochanny
Alexander T. Pearson
Maulik Shukla
Yvonne A. Evrard
James H. Doroshow
Rick L. Stevens
Rick L. Stevens
Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
Frontiers in Medicine
drug response prediction
histology whole-slide images
gene expression
multimodal deep learning
preclinical drug studies
data augmentation
title Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_full Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_fullStr Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_full_unstemmed Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_short Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images
title_sort data augmentation and multimodal learning for predicting drug response in patient derived xenografts from gene expressions and histology images
topic drug response prediction
histology whole-slide images
gene expression
multimodal deep learning
preclinical drug studies
data augmentation
url https://www.frontiersin.org/articles/10.3389/fmed.2023.1058919/full
work_keys_str_mv AT alexanderpartin dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT thomasbrettin dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT yitanzhu dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT jamesmdolezal dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT sarakochanny dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT alexandertpearson dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT maulikshukla dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT yvonneaevrard dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT jameshdoroshow dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT ricklstevens dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages
AT ricklstevens dataaugmentationandmultimodallearningforpredictingdrugresponseinpatientderivedxenograftsfromgeneexpressionsandhistologyimages