Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images

Abstract Programmed death-ligand 1 (PD-L1) expression is currently used in the clinic to assess eligibility for immune-checkpoint inhibitors via the tumor proportion score (TPS), but its efficacy is limited by high interobserver variability. Multiple papers have presented systems for the automatic q...

Full description

Bibliographic Details
Main Authors: Leander van Eekelen, Joey Spronck, Monika Looijen-Salamon, Shoko Vos, Enrico Munari, Ilaria Girolami, Albino Eccher, Balazs Acs, Ceren Boyaci, Gabriel Silva de Souza, Muradije Demirel-Andishmand, Luca Dulce Meesters, Daan Zegers, Lieke van der Woude, Willemijn Theelen, Michel van den Heuvel, Katrien Grünberg, Bram van Ginneken, Jeroen van der Laak, Francesco Ciompi
Format: Article
Language:English
Published: Nature Portfolio 2024-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-024-57067-1
_version_ 1827301224276295680
author Leander van Eekelen
Joey Spronck
Monika Looijen-Salamon
Shoko Vos
Enrico Munari
Ilaria Girolami
Albino Eccher
Balazs Acs
Ceren Boyaci
Gabriel Silva de Souza
Muradije Demirel-Andishmand
Luca Dulce Meesters
Daan Zegers
Lieke van der Woude
Willemijn Theelen
Michel van den Heuvel
Katrien Grünberg
Bram van Ginneken
Jeroen van der Laak
Francesco Ciompi
author_facet Leander van Eekelen
Joey Spronck
Monika Looijen-Salamon
Shoko Vos
Enrico Munari
Ilaria Girolami
Albino Eccher
Balazs Acs
Ceren Boyaci
Gabriel Silva de Souza
Muradije Demirel-Andishmand
Luca Dulce Meesters
Daan Zegers
Lieke van der Woude
Willemijn Theelen
Michel van den Heuvel
Katrien Grünberg
Bram van Ginneken
Jeroen van der Laak
Francesco Ciompi
author_sort Leander van Eekelen
collection DOAJ
description Abstract Programmed death-ligand 1 (PD-L1) expression is currently used in the clinic to assess eligibility for immune-checkpoint inhibitors via the tumor proportion score (TPS), but its efficacy is limited by high interobserver variability. Multiple papers have presented systems for the automatic quantification of TPS, but none report on the task of determining cell-level PD-L1 expression and often reserve their evaluation to a single PD-L1 monoclonal antibody or clinical center. In this paper, we report on a deep learning algorithm for detecting PD-L1 negative and positive tumor cells at a cellular level and evaluate it on a cell-level reference standard established by six readers on a multi-centric, multi PD-L1 assay dataset. This reference standard also provides for the first time a benchmark for computer vision algorithms. In addition, in line with other papers, we also evaluate our algorithm at slide-level by measuring the agreement between the algorithm and six pathologists on TPS quantification. We find a moderately low interobserver agreement at cell-level level (mean reader-reader F1 score = 0.68) which our algorithm sits slightly under (mean reader-AI F1 score = 0.55), especially for cases from the clinical center not included in the training set. Despite this, we find good AI-pathologist agreement on quantifying TPS compared to the interobserver agreement (mean reader-reader Cohen’s kappa = 0.54, 95% CI 0.26–0.81, mean reader-AI kappa = 0.49, 95% CI 0.27—0.72). In conclusion, our deep learning algorithm demonstrates promise in detecting PD-L1 expression at a cellular level and exhibits favorable agreement with pathologists in quantifying the tumor proportion score (TPS). We publicly release our models for use via the Grand-Challenge platform.
first_indexed 2024-04-24T16:19:21Z
format Article
id doaj.art-001f7eaf59b84b3c94b65d25972e094d
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-04-24T16:19:21Z
publishDate 2024-03-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-001f7eaf59b84b3c94b65d25972e094d2024-03-31T11:17:14ZengNature PortfolioScientific Reports2045-23222024-03-0114111010.1038/s41598-024-57067-1Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide imagesLeander van Eekelen0Joey Spronck1Monika Looijen-Salamon2Shoko Vos3Enrico Munari4Ilaria Girolami5Albino Eccher6Balazs Acs7Ceren Boyaci8Gabriel Silva de Souza9Muradije Demirel-Andishmand10Luca Dulce Meesters11Daan Zegers12Lieke van der Woude13Willemijn Theelen14Michel van den Heuvel15Katrien Grünberg16Bram van Ginneken17Jeroen van der Laak18Francesco Ciompi19Department of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterPathology Unit, Department of Molecular and Translational Medicine, University of BresciaDepartment of Pathology, Provincial Hospital of Bolzano (SABES-ASDAA)Department of Pathology and Diagnostics, University and Hospital Trust of VeronaDepartment of Clinical Pathology and Cancer Diagnostics, Karolinska University HospitalDepartment of Clinical Pathology and Cancer Diagnostics, Karolinska University HospitalDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Thoracic Oncology, Netherlands Cancer InstituteRespiratory Diseases Department, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Radiology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterDepartment of Pathology, Radboud University Medical CenterAbstract Programmed death-ligand 1 (PD-L1) expression is currently used in the clinic to assess eligibility for immune-checkpoint inhibitors via the tumor proportion score (TPS), but its efficacy is limited by high interobserver variability. Multiple papers have presented systems for the automatic quantification of TPS, but none report on the task of determining cell-level PD-L1 expression and often reserve their evaluation to a single PD-L1 monoclonal antibody or clinical center. In this paper, we report on a deep learning algorithm for detecting PD-L1 negative and positive tumor cells at a cellular level and evaluate it on a cell-level reference standard established by six readers on a multi-centric, multi PD-L1 assay dataset. This reference standard also provides for the first time a benchmark for computer vision algorithms. In addition, in line with other papers, we also evaluate our algorithm at slide-level by measuring the agreement between the algorithm and six pathologists on TPS quantification. We find a moderately low interobserver agreement at cell-level level (mean reader-reader F1 score = 0.68) which our algorithm sits slightly under (mean reader-AI F1 score = 0.55), especially for cases from the clinical center not included in the training set. Despite this, we find good AI-pathologist agreement on quantifying TPS compared to the interobserver agreement (mean reader-reader Cohen’s kappa = 0.54, 95% CI 0.26–0.81, mean reader-AI kappa = 0.49, 95% CI 0.27—0.72). In conclusion, our deep learning algorithm demonstrates promise in detecting PD-L1 expression at a cellular level and exhibits favorable agreement with pathologists in quantifying the tumor proportion score (TPS). We publicly release our models for use via the Grand-Challenge platform.https://doi.org/10.1038/s41598-024-57067-1
spellingShingle Leander van Eekelen
Joey Spronck
Monika Looijen-Salamon
Shoko Vos
Enrico Munari
Ilaria Girolami
Albino Eccher
Balazs Acs
Ceren Boyaci
Gabriel Silva de Souza
Muradije Demirel-Andishmand
Luca Dulce Meesters
Daan Zegers
Lieke van der Woude
Willemijn Theelen
Michel van den Heuvel
Katrien Grünberg
Bram van Ginneken
Jeroen van der Laak
Francesco Ciompi
Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
Scientific Reports
title Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
title_full Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
title_fullStr Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
title_full_unstemmed Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
title_short Comparing deep learning and pathologist quantification of cell-level PD-L1 expression in non-small cell lung cancer whole-slide images
title_sort comparing deep learning and pathologist quantification of cell level pd l1 expression in non small cell lung cancer whole slide images
url https://doi.org/10.1038/s41598-024-57067-1
work_keys_str_mv AT leandervaneekelen comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT joeyspronck comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT monikalooijensalamon comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT shokovos comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT enricomunari comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT ilariagirolami comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT albinoeccher comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT balazsacs comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT cerenboyaci comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT gabrielsilvadesouza comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT muradijedemirelandishmand comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT lucadulcemeesters comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT daanzegers comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT liekevanderwoude comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT willemijntheelen comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT michelvandenheuvel comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT katriengrunberg comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT bramvanginneken comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT jeroenvanderlaak comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages
AT francescociompi comparingdeeplearningandpathologistquantificationofcelllevelpdl1expressioninnonsmallcelllungcancerwholeslideimages