An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer

Motivation: The precise diagnosis of the major subtypes, lung adenocarcinoma and lung squamous cell carcinoma, of non-small-cell lung cancer is of practical importance as some treatments are subtype-specific. However, in some cases diagnosis via the commonly-used method, that is staining the specime...

Full description

Bibliographic Details
Main Authors: Mehdi Hamaneh, Yi-Kuo Yu
Format: Article
Language:English
Published: SAGE Publishing 2022-06-01
Series:Cancer Informatics
Online Access:https://doi.org/10.1177/11769351221100718
_version_ 1828793460303331328
author Mehdi Hamaneh
Yi-Kuo Yu
author_facet Mehdi Hamaneh
Yi-Kuo Yu
author_sort Mehdi Hamaneh
collection DOAJ
description Motivation: The precise diagnosis of the major subtypes, lung adenocarcinoma and lung squamous cell carcinoma, of non-small-cell lung cancer is of practical importance as some treatments are subtype-specific. However, in some cases diagnosis via the commonly-used method, that is staining the specimen using immunohistochemical markers, may be challenging. Hence, having a computational method that complements the diagnosis is desirable. In this paper, we propose a gene signature for this purpose. Results: We developed an expression-based method that systematically suggests a huge set of candidate gene signatures and finds the best candidate. By applying this method to a training set, the optimal gene signature was found by considering close to 765 billion candidate signatures. The 8-gene signature found for classifying the 2 aforementioned subtypes comprises TP63, CALML3, KRT5, PKP1, TESC, SPINK1, C9orf152, and KRT7. The signature achieved a high overall prediction accuracy of 0.936 when tested using 34 independent gene expression datasets obtained using different technologies and comprising 2556 adenocarcinoma and 1630 squamous cell carcinoma samples. Additionally, the signature performed well in clinically challenging cases, that is poorly differentiated tumors and specimens obtained from biopsies. In comparison with 2 previously reported signatures, our signature performed better in terms of overall accuracy and especially accuracy of classifying lung squamous cell carcinoma. Conclusions: Our signature is easy to use and accurate regardless of the technology used to obtain the gene expression profiles. It performs well even in clinically challenging cases and thus can assist pathologists in diagnosis of the ambiguous cases.
first_indexed 2024-12-12T03:26:52Z
format Article
id doaj.art-c26448a5c33f48b59d7ccd5a25ac0780
institution Directory Open Access Journal
issn 1176-9351
language English
last_indexed 2024-12-12T03:26:52Z
publishDate 2022-06-01
publisher SAGE Publishing
record_format Article
series Cancer Informatics
spelling doaj.art-c26448a5c33f48b59d7ccd5a25ac07802022-12-22T00:40:00ZengSAGE PublishingCancer Informatics1176-93512022-06-012110.1177/11769351221100718An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung CancerMehdi HamanehYi-Kuo YuMotivation: The precise diagnosis of the major subtypes, lung adenocarcinoma and lung squamous cell carcinoma, of non-small-cell lung cancer is of practical importance as some treatments are subtype-specific. However, in some cases diagnosis via the commonly-used method, that is staining the specimen using immunohistochemical markers, may be challenging. Hence, having a computational method that complements the diagnosis is desirable. In this paper, we propose a gene signature for this purpose. Results: We developed an expression-based method that systematically suggests a huge set of candidate gene signatures and finds the best candidate. By applying this method to a training set, the optimal gene signature was found by considering close to 765 billion candidate signatures. The 8-gene signature found for classifying the 2 aforementioned subtypes comprises TP63, CALML3, KRT5, PKP1, TESC, SPINK1, C9orf152, and KRT7. The signature achieved a high overall prediction accuracy of 0.936 when tested using 34 independent gene expression datasets obtained using different technologies and comprising 2556 adenocarcinoma and 1630 squamous cell carcinoma samples. Additionally, the signature performed well in clinically challenging cases, that is poorly differentiated tumors and specimens obtained from biopsies. In comparison with 2 previously reported signatures, our signature performed better in terms of overall accuracy and especially accuracy of classifying lung squamous cell carcinoma. Conclusions: Our signature is easy to use and accurate regardless of the technology used to obtain the gene expression profiles. It performs well even in clinically challenging cases and thus can assist pathologists in diagnosis of the ambiguous cases.https://doi.org/10.1177/11769351221100718
spellingShingle Mehdi Hamaneh
Yi-Kuo Yu
An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer
Cancer Informatics
title An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer
title_full An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer
title_fullStr An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer
title_full_unstemmed An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer
title_short An 8-Gene Signature for Classifying Major Subtypes of Non-Small-Cell Lung Cancer
title_sort 8 gene signature for classifying major subtypes of non small cell lung cancer
url https://doi.org/10.1177/11769351221100718
work_keys_str_mv AT mehdihamaneh an8genesignatureforclassifyingmajorsubtypesofnonsmallcelllungcancer
AT yikuoyu an8genesignatureforclassifyingmajorsubtypesofnonsmallcelllungcancer
AT mehdihamaneh 8genesignatureforclassifyingmajorsubtypesofnonsmallcelllungcancer
AT yikuoyu 8genesignatureforclassifyingmajorsubtypesofnonsmallcelllungcancer