Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data

Background: Homologous recombination deficiency (HRD) is the hallmark of breast cancer gene 1/2 ( BRCA1/2 )-mutated tumors and the unique biomarker for predicting response to double-strand break (DSB)–inducing drugs. The demonstration of HRD in tumors with mutations in genes other than BRCA1/2 is co...

Full description

Bibliographic Details
Main Authors: Maher Albitar, Hong Zhang, Andrew Pecora, Stanley Waintraub, Deena Graham, Mira Hellmann, Donna McNamara, Ahmad Charifa, Ivan De Dios, Wanlong Ma, Andre Goy
Format: Article
Language:English
Published: SAGE Publishing 2023-09-01
Series:Breast Cancer: Basic and Clinical Research
Online Access:https://doi.org/10.1177/11782234231198979
_version_ 1797670250829840384
author Maher Albitar
Hong Zhang
Andrew Pecora
Stanley Waintraub
Deena Graham
Mira Hellmann
Donna McNamara
Ahmad Charifa
Ivan De Dios
Wanlong Ma
Andre Goy
author_facet Maher Albitar
Hong Zhang
Andrew Pecora
Stanley Waintraub
Deena Graham
Mira Hellmann
Donna McNamara
Ahmad Charifa
Ivan De Dios
Wanlong Ma
Andre Goy
author_sort Maher Albitar
collection DOAJ
description Background: Homologous recombination deficiency (HRD) is the hallmark of breast cancer gene 1/2 ( BRCA1/2 )-mutated tumors and the unique biomarker for predicting response to double-strand break (DSB)–inducing drugs. The demonstration of HRD in tumors with mutations in genes other than BRCA1/2 is considered the best biomarker of potential response to these DSB-inducer drugs. Objectives: We explored the potential of developing a practical approach to predict in any tumor the presence of HRD that is similar to that seen in tumors with BRCA1/2 mutations using next-generation sequencing (NGS) along with machine learning (ML). Design: We use copy number alteration (CNA) generated from routine-targeted NGS data along with a modified naïve Bayesian model for the prediction of the presence of HRD. Methods: The CNA from NGS of 434 targeted genes was analyzed using CNVkit software to calculate the log2 of CNA changes. The log2 values of various sequencing reads (bins) were used in ML to train the system on predicting tumors with BRCA1/2 mutations and tumors with abnormalities similar to those detected in BRCA1/2 mutations. Results: Using 31 breast or ovarian cancers with BRCA1/2 mutations and 84 tumors without mutations in any of 12 homologous recombination repair (HRR) genes, the ML demonstrated high sensitivity (90%, 95% confidence interval [CI] = 73%-97.5%) and specificity (98%, 95% CI = 90%-100%). Testing of 114 tumors with mutations in HRR genes other than BRCA1/2 showed 39% positivity for HRD similar to that seen in BRCA1/2 . Testing 213 additional wild-type (WT) cancers showed HRD positivity similar to BRCA1/2 in 32% of cases. Correlation with proportional loss of heterozygosity (LOH) as determined using whole exome sequencing of 51 samples showed 90% (95% CI = 72%-97%) concordance. The approach was also validated in an independent set of 1312 consecutive tumor samples. Conclusions: These data demonstrate that CNA when combined with ML can reliably predict the presence of BRCA1/2 level HRD with high specificity. Using BRCA1/2 mutant cases as gold standard, this ML can be used to predict HRD in cancers with mutations in other HRR genes as well as in WT tumors.
first_indexed 2024-03-11T20:57:00Z
format Article
id doaj.art-a4b6c138eca440cbb2517f8a68962e5c
institution Directory Open Access Journal
issn 1178-2234
language English
last_indexed 2024-03-11T20:57:00Z
publishDate 2023-09-01
publisher SAGE Publishing
record_format Article
series Breast Cancer: Basic and Clinical Research
spelling doaj.art-a4b6c138eca440cbb2517f8a68962e5c2023-09-30T09:03:31ZengSAGE PublishingBreast Cancer: Basic and Clinical Research1178-22342023-09-011710.1177/11782234231198979Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing DataMaher Albitar0Hong Zhang1Andrew Pecora2Stanley Waintraub3Deena Graham4Mira Hellmann5Donna McNamara6Ahmad Charifa7Ivan De Dios8Wanlong Ma9Andre Goy10Genomic Testing Cooperative, Irvine, CA, USAGenomic Testing Cooperative, Irvine, CA, USAJohn Theurer Cancer Center at Hackensack University Medical Center, Hackensack, NJ, USAJohn Theurer Cancer Center at Hackensack University Medical Center, Hackensack, NJ, USAJohn Theurer Cancer Center at Hackensack University Medical Center, Hackensack, NJ, USAJohn Theurer Cancer Center at Hackensack University Medical Center, Hackensack, NJ, USAJohn Theurer Cancer Center at Hackensack University Medical Center, Hackensack, NJ, USAGenomic Testing Cooperative, Irvine, CA, USAGenomic Testing Cooperative, Irvine, CA, USAGenomic Testing Cooperative, Irvine, CA, USAJohn Theurer Cancer Center at Hackensack University Medical Center, Hackensack, NJ, USABackground: Homologous recombination deficiency (HRD) is the hallmark of breast cancer gene 1/2 ( BRCA1/2 )-mutated tumors and the unique biomarker for predicting response to double-strand break (DSB)–inducing drugs. The demonstration of HRD in tumors with mutations in genes other than BRCA1/2 is considered the best biomarker of potential response to these DSB-inducer drugs. Objectives: We explored the potential of developing a practical approach to predict in any tumor the presence of HRD that is similar to that seen in tumors with BRCA1/2 mutations using next-generation sequencing (NGS) along with machine learning (ML). Design: We use copy number alteration (CNA) generated from routine-targeted NGS data along with a modified naïve Bayesian model for the prediction of the presence of HRD. Methods: The CNA from NGS of 434 targeted genes was analyzed using CNVkit software to calculate the log2 of CNA changes. The log2 values of various sequencing reads (bins) were used in ML to train the system on predicting tumors with BRCA1/2 mutations and tumors with abnormalities similar to those detected in BRCA1/2 mutations. Results: Using 31 breast or ovarian cancers with BRCA1/2 mutations and 84 tumors without mutations in any of 12 homologous recombination repair (HRR) genes, the ML demonstrated high sensitivity (90%, 95% confidence interval [CI] = 73%-97.5%) and specificity (98%, 95% CI = 90%-100%). Testing of 114 tumors with mutations in HRR genes other than BRCA1/2 showed 39% positivity for HRD similar to that seen in BRCA1/2 . Testing 213 additional wild-type (WT) cancers showed HRD positivity similar to BRCA1/2 in 32% of cases. Correlation with proportional loss of heterozygosity (LOH) as determined using whole exome sequencing of 51 samples showed 90% (95% CI = 72%-97%) concordance. The approach was also validated in an independent set of 1312 consecutive tumor samples. Conclusions: These data demonstrate that CNA when combined with ML can reliably predict the presence of BRCA1/2 level HRD with high specificity. Using BRCA1/2 mutant cases as gold standard, this ML can be used to predict HRD in cancers with mutations in other HRR genes as well as in WT tumors.https://doi.org/10.1177/11782234231198979
spellingShingle Maher Albitar
Hong Zhang
Andrew Pecora
Stanley Waintraub
Deena Graham
Mira Hellmann
Donna McNamara
Ahmad Charifa
Ivan De Dios
Wanlong Ma
Andre Goy
Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data
Breast Cancer: Basic and Clinical Research
title Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data
title_full Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data
title_fullStr Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data
title_full_unstemmed Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data
title_short Homologous Recombination Abnormalities Associated With Mutations as Predicted by Machine Learning of Targeted Next-Generation Sequencing Data
title_sort homologous recombination abnormalities associated with mutations as predicted by machine learning of targeted next generation sequencing data
url https://doi.org/10.1177/11782234231198979
work_keys_str_mv AT maheralbitar homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT hongzhang homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT andrewpecora homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT stanleywaintraub homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT deenagraham homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT mirahellmann homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT donnamcnamara homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT ahmadcharifa homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT ivandedios homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT wanlongma homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata
AT andregoy homologousrecombinationabnormalitiesassociatedwithmutationsaspredictedbymachinelearningoftargetednextgenerationsequencingdata