Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data
Abstract Background Protein biomarkers of cancer progression and response to therapy are increasingly important for improving personalized medicine. Advanced quantitative pathology platforms enable measurement of protein expression in tissues at the single-cell level. However, this rich quantitative...
Main Authors: | , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2023-07-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-023-05408-8 |
_version_ | 1827894814427840512 |
---|---|
author | Misung Yi Tingting Zhan Amy R. Peck Jeffrey A. Hooke Albert J. Kovatich Craig D. Shriver Hai Hu Yunguang Sun Hallgeir Rui Inna Chervoneva |
author_facet | Misung Yi Tingting Zhan Amy R. Peck Jeffrey A. Hooke Albert J. Kovatich Craig D. Shriver Hai Hu Yunguang Sun Hallgeir Rui Inna Chervoneva |
author_sort | Misung Yi |
collection | DOAJ |
description | Abstract Background Protein biomarkers of cancer progression and response to therapy are increasingly important for improving personalized medicine. Advanced quantitative pathology platforms enable measurement of protein expression in tissues at the single-cell level. However, this rich quantitative cell-by-cell biomarker information is most often not exploited. Instead, it is reduced to a single mean across the cells of interest or converted into a simple proportion of binary biomarker-positive or -negative cells. Results We investigated the utility of retaining all quantitative information at the single-cell level by considering the values of the quantile function (inverse of the cumulative distribution function) estimated from a sample of cell signal intensity levels in a tumor tissue. An algorithm was developed for selecting optimal cutoffs for dichotomizing cell signal intensity distribution quantiles as predictors of continuous, categorical or survival outcomes. The proposed algorithm was used to select optimal quantile biomarkers of breast cancer progression based on cancer cells’ cell signal intensity levels of nuclear protein Ki-67, Proliferating cell nuclear antigen, Programmed cell death 1 ligand 2, and Progesterone receptor. The performance of the resulting optimal quantile biomarkers was validated and compared to the standard cancer compartment mean signal intensity markers using an independent external validation cohort. For Ki-67, the optimal quantile biomarker was also compared to established biomarkers based on percentages of Ki67-positive cells. For proteins significantly associated with PFS in the external validation cohort, the optimal quantile biomarkers yielded either larger or similar effect size (hazard ratio for progression-free survival) as compared to cancer compartment mean signal intensity biomarkers. Conclusion The optimal quantile protein biomarkers yield generally improved prognostic value as compared to the standard protein expression markers. The proposed methodology has a broad application to single-cell data from genomics, transcriptomics, proteomics, or metabolomics studies at the single cell level. |
first_indexed | 2024-03-12T22:12:55Z |
format | Article |
id | doaj.art-557a8ed992154aa281c4dfc3aec8358d |
institution | Directory Open Access Journal |
issn | 1471-2105 |
language | English |
last_indexed | 2024-03-12T22:12:55Z |
publishDate | 2023-07-01 |
publisher | BMC |
record_format | Article |
series | BMC Bioinformatics |
spelling | doaj.art-557a8ed992154aa281c4dfc3aec8358d2023-07-23T11:28:10ZengBMCBMC Bioinformatics1471-21052023-07-0124111510.1186/s12859-023-05408-8Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry dataMisung Yi0Tingting Zhan1Amy R. Peck2Jeffrey A. Hooke3Albert J. Kovatich4Craig D. Shriver5Hai Hu6Yunguang Sun7Hallgeir Rui8Inna Chervoneva9Division of Biostatistics, Department of Pharmacology and Experimental Therapeutics, Sidney Kimmel Medical College, Thomas Jefferson UniversityDivision of Biostatistics, Department of Pharmacology and Experimental Therapeutics, Sidney Kimmel Medical College, Thomas Jefferson UniversityDepartment of Pathology, Medical College of WisconsinJohn P. Murtha Cancer Center, Uniformed Services University and Walter Reed National Military Medical CenterJohn P. Murtha Cancer Center, Uniformed Services University and Walter Reed National Military Medical CenterJohn P. Murtha Cancer Center, Uniformed Services University and Walter Reed National Military Medical CenterChan Soon-Shiong Institute of Molecular Medicine at WindberDepartment of Pathology, Medical College of WisconsinDepartment of Pathology, Medical College of WisconsinDivision of Biostatistics, Department of Pharmacology and Experimental Therapeutics, Sidney Kimmel Medical College, Thomas Jefferson UniversityAbstract Background Protein biomarkers of cancer progression and response to therapy are increasingly important for improving personalized medicine. Advanced quantitative pathology platforms enable measurement of protein expression in tissues at the single-cell level. However, this rich quantitative cell-by-cell biomarker information is most often not exploited. Instead, it is reduced to a single mean across the cells of interest or converted into a simple proportion of binary biomarker-positive or -negative cells. Results We investigated the utility of retaining all quantitative information at the single-cell level by considering the values of the quantile function (inverse of the cumulative distribution function) estimated from a sample of cell signal intensity levels in a tumor tissue. An algorithm was developed for selecting optimal cutoffs for dichotomizing cell signal intensity distribution quantiles as predictors of continuous, categorical or survival outcomes. The proposed algorithm was used to select optimal quantile biomarkers of breast cancer progression based on cancer cells’ cell signal intensity levels of nuclear protein Ki-67, Proliferating cell nuclear antigen, Programmed cell death 1 ligand 2, and Progesterone receptor. The performance of the resulting optimal quantile biomarkers was validated and compared to the standard cancer compartment mean signal intensity markers using an independent external validation cohort. For Ki-67, the optimal quantile biomarker was also compared to established biomarkers based on percentages of Ki67-positive cells. For proteins significantly associated with PFS in the external validation cohort, the optimal quantile biomarkers yielded either larger or similar effect size (hazard ratio for progression-free survival) as compared to cancer compartment mean signal intensity biomarkers. Conclusion The optimal quantile protein biomarkers yield generally improved prognostic value as compared to the standard protein expression markers. The proposed methodology has a broad application to single-cell data from genomics, transcriptomics, proteomics, or metabolomics studies at the single cell level.https://doi.org/10.1186/s12859-023-05408-8Cellular protein expressionDistribution quantilesCancer biomarkersTissue microarraysBreast cancer |
spellingShingle | Misung Yi Tingting Zhan Amy R. Peck Jeffrey A. Hooke Albert J. Kovatich Craig D. Shriver Hai Hu Yunguang Sun Hallgeir Rui Inna Chervoneva Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data BMC Bioinformatics Cellular protein expression Distribution quantiles Cancer biomarkers Tissue microarrays Breast cancer |
title | Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data |
title_full | Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data |
title_fullStr | Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data |
title_full_unstemmed | Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data |
title_short | Selection of optimal quantile protein biomarkers based on cell-level immunohistochemistry data |
title_sort | selection of optimal quantile protein biomarkers based on cell level immunohistochemistry data |
topic | Cellular protein expression Distribution quantiles Cancer biomarkers Tissue microarrays Breast cancer |
url | https://doi.org/10.1186/s12859-023-05408-8 |
work_keys_str_mv | AT misungyi selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT tingtingzhan selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT amyrpeck selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT jeffreyahooke selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT albertjkovatich selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT craigdshriver selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT haihu selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT yunguangsun selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT hallgeirrui selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata AT innachervoneva selectionofoptimalquantileproteinbiomarkersbasedoncelllevelimmunohistochemistrydata |