Artificial Neural Networks Predicted the Overall Survival and Molecular Subtypes of Diffuse Large B-Cell Lymphoma Using a Pancancer Immune-Oncology Panel

Diffuse large B-cell lymphoma (DLBCL) is one of the most frequent subtypes of non-Hodgkin lymphomas. We used artificial neural networks (multilayer perceptron and radial basis function), machine learning, and conventional bioinformatics to predict the overall survival and molecular subtypes of DLBCL...

Full description

Bibliographic Details
Main Authors: Joaquim Carreras, Shinichiro Hiraiwa, Yara Yukie Kikuti, Masashi Miyaoka, Sakura Tomita, Haruka Ikoma, Atsushi Ito, Yusuke Kondo, Giovanna Roncador, Juan F. Garcia, Kiyoshi Ando, Rifat Hamoudi, Naoya Nakamura
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:Cancers
Subjects:
Online Access:https://www.mdpi.com/2072-6694/13/24/6384
Description
Summary:Diffuse large B-cell lymphoma (DLBCL) is one of the most frequent subtypes of non-Hodgkin lymphomas. We used artificial neural networks (multilayer perceptron and radial basis function), machine learning, and conventional bioinformatics to predict the overall survival and molecular subtypes of DLBCL. The series included 106 cases and 730 genes of a pancancer immune-oncology panel (nCounter) as predictors. The multilayer perceptron predicted the outcome with high accuracy, with an area under the curve (AUC) of 0.98, and ranked all the genes according to their importance. In a multivariate analysis, <i>ARG1</i>, <i>TNFSF12</i>, <i>REL</i>, and <i>NRP1</i> correlated with favorable survival (hazard risks: 0.3–0.5), and <i>IFNA8</i>, <i>CASP1</i>, and <i>CTSG</i>, with poor survival (hazard risks = 1.0–2.1). Gene set enrichment analysis (GSEA) showed enrichment toward poor prognosis. These high-risk genes were also associated with the gene expression of M2-like tumor-associated macrophages (<i>CD163</i>), and <i>MYD88</i> expression. The prognostic relevance of this set of 7 genes was also confirmed within the IPI and <i>MYC</i> translocation strata, the EBER-negative cases, the DLBCL not-otherwise specified (NOS) (High-grade B-cell lymphoma with <i>MYC</i> and <i>BCL2</i> and/or <i>BCL6</i> rearrangements excluded), and an independent series of 414 cases of DLBCL in Europe and North America (GSE10846). The perceptron analysis also predicted molecular subtypes (based on the Lymph2Cx assay) with high accuracy (AUC = 1). <i>STAT6</i>, <i>TREM2</i>, and <i>REL</i> were associated with the germinal center B-cell (GCB) subtype, and <i>CD37</i>, <i>GNLY</i>, <i>CD46</i>, and <i>IL17B</i> were associated with the activated B-cell (ABC)/unspecified subtype. The GSEA had a sinusoidal-like plot with association to both molecular subtypes, and immunohistochemistry analysis confirmed the correlation of <i>MAPK3</i> with the GCB subtype in another series of 96 cases (notably, MAPK3 also correlated with LMO2, but not with M2-like tumor-associated macrophage markers CD163, CSF1R, TNFAIP8, CASP8, PD-L1, PTX3, and IL-10). Finally, survival and molecular subtypes were successfully modeled using other machine learning techniques including logistic regression, discriminant analysis, SVM, CHAID, C5, C&R trees, KNN algorithm, and Bayesian network. In conclusion, prognoses and molecular subtypes were predicted with high accuracy using neural networks, and relevant genes were highlighted.
ISSN:2072-6694