The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines

Background: Gene expression signatures are clusters of genes discriminating different statuses of the cells and their definition is critical for understanding the molecular bases of diseases. The identification of a gene signature is complicated by the high dimensional nature of the data and by the...

Full description

Bibliographic Details
Main Authors: Varesio, Luigi, Verri, Alessandro, Mosci, Sofia, Barla, Annalisa, Fardin, Paolo, Rosasco, Lorenzo Andrea
Other Authors: Massachusetts Institute of Technology. Center for Biological & Computational Learning
Format: Article
Language:en_US
Published: BioMed Central 2010
Online Access:http://hdl.handle.net/1721.1/52504
https://orcid.org/0000-0001-6376-4786
_version_ 1826215351179804672
author Varesio, Luigi
Verri, Alessandro
Mosci, Sofia
Barla, Annalisa
Fardin, Paolo
Rosasco, Lorenzo Andrea
author2 Massachusetts Institute of Technology. Center for Biological & Computational Learning
author_facet Massachusetts Institute of Technology. Center for Biological & Computational Learning
Varesio, Luigi
Verri, Alessandro
Mosci, Sofia
Barla, Annalisa
Fardin, Paolo
Rosasco, Lorenzo Andrea
author_sort Varesio, Luigi
collection MIT
description Background: Gene expression signatures are clusters of genes discriminating different statuses of the cells and their definition is critical for understanding the molecular bases of diseases. The identification of a gene signature is complicated by the high dimensional nature of the data and by the genetic heterogeneity of the responding cells. The l[subscript 1]-l[subscript 2] regularization is an embedded feature selection technique that fulfills all the desirable properties of a variable selection algorithm and has the potential to generate a specific signature even in biologically complex settings. We studied the application of this algorithm to detect the signature characterizing the transcriptional response of neuroblastoma tumor cell lines to hypoxia, a condition of low oxygen tension that occurs in the tumor microenvironment. Results: We determined the gene expression profile of 9 neuroblastoma cell lines cultured under normoxic and hypoxic conditions. We studied a heterogeneous set of neuroblastoma cell lines to mimic the in vivo situation and to test the robustness and validity of the l[subscript 1]-l[subscript 2] regularization with double optimization. Analysis by hierarchical, spectral, and k-means clustering or supervised approach based on t-test analysis divided the cell lines on the bases of genetic differences. However, the disturbance of this strong transcriptional response completely masked the detection of the more subtle response to hypoxia. Different results were obtained when we applied the l[subscript 1]-l[subscript 2] regularization framework. The algorithm distinguished the normoxic and hypoxic statuses defining signatures comprising 3 to 38 probesets, with a leave-one-out error of 17%. A consensus hypoxia signature was established setting the frequency score at 50% and the correlation parameter ε equal to 100. This signature is composed by 11 probesets representing 8 well characterized genes known to be modulated by hypoxia. Conclusion: We demonstrate that l[subscript 1]-l[subscript 2] regularization outperforms more conventional approaches allowing the identification and definition of a gene expression signature under complex experimental conditions. The l[subscript 1]-l[subscript 2] regularization and the cross validation generates an unbiased and objective output with a low classification error. We feel that the application of this algorithm to tumor biology will be instrumental to analyze gene expression signatures hidden in the transcriptome that, like hypoxia, may be major determinant of the course of the disease.
first_indexed 2024-09-23T16:25:40Z
format Article
id mit-1721.1/52504
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T16:25:40Z
publishDate 2010
publisher BioMed Central
record_format dspace
spelling mit-1721.1/525042022-09-29T19:47:38Z The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines The l1-l2 regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines Varesio, Luigi Verri, Alessandro Mosci, Sofia Barla, Annalisa Fardin, Paolo Rosasco, Lorenzo Andrea Massachusetts Institute of Technology. Center for Biological & Computational Learning McGovern Institute for Brain Research at MIT Rosasco, Lorenzo Andrea Rosasco, Lorenzo Andrea Background: Gene expression signatures are clusters of genes discriminating different statuses of the cells and their definition is critical for understanding the molecular bases of diseases. The identification of a gene signature is complicated by the high dimensional nature of the data and by the genetic heterogeneity of the responding cells. The l[subscript 1]-l[subscript 2] regularization is an embedded feature selection technique that fulfills all the desirable properties of a variable selection algorithm and has the potential to generate a specific signature even in biologically complex settings. We studied the application of this algorithm to detect the signature characterizing the transcriptional response of neuroblastoma tumor cell lines to hypoxia, a condition of low oxygen tension that occurs in the tumor microenvironment. Results: We determined the gene expression profile of 9 neuroblastoma cell lines cultured under normoxic and hypoxic conditions. We studied a heterogeneous set of neuroblastoma cell lines to mimic the in vivo situation and to test the robustness and validity of the l[subscript 1]-l[subscript 2] regularization with double optimization. Analysis by hierarchical, spectral, and k-means clustering or supervised approach based on t-test analysis divided the cell lines on the bases of genetic differences. However, the disturbance of this strong transcriptional response completely masked the detection of the more subtle response to hypoxia. Different results were obtained when we applied the l[subscript 1]-l[subscript 2] regularization framework. The algorithm distinguished the normoxic and hypoxic statuses defining signatures comprising 3 to 38 probesets, with a leave-one-out error of 17%. A consensus hypoxia signature was established setting the frequency score at 50% and the correlation parameter ε equal to 100. This signature is composed by 11 probesets representing 8 well characterized genes known to be modulated by hypoxia. Conclusion: We demonstrate that l[subscript 1]-l[subscript 2] regularization outperforms more conventional approaches allowing the identification and definition of a gene expression signature under complex experimental conditions. The l[subscript 1]-l[subscript 2] regularization and the cross validation generates an unbiased and objective output with a low classification error. We feel that the application of this algorithm to tumor biology will be instrumental to analyze gene expression signatures hidden in the transcriptome that, like hypoxia, may be major determinant of the course of the disease. EU Integrated Project Health-e-Child Fondazione Italiana per la Lotta al Neuroblastoma Foreign Investment Review Board (project RBIN04PARL) 2010-03-11T16:33:39Z 2010-03-11T16:33:39Z 2009-10 2009-05 Article http://purl.org/eprint/type/JournalArticle 1471-2164 http://hdl.handle.net/1721.1/52504 Fardin, Paolo et al. “The l1-l2 regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines.” BMC Genomics 10.1 (2009): 474. https://orcid.org/0000-0001-6376-4786 en_US http://dx.doi.org/10.1186/1471-2164-10-474 BMC Genomics Creative Commons Attribution http://creativecommons.org/licenses/by/2.0/ application/pdf BioMed Central BioMed Central
spellingShingle Varesio, Luigi
Verri, Alessandro
Mosci, Sofia
Barla, Annalisa
Fardin, Paolo
Rosasco, Lorenzo Andrea
The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
title The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
title_full The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
title_fullStr The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
title_full_unstemmed The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
title_short The l[subscript 1]-l[subscript 2] regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
title_sort l subscript 1 l subscript 2 regularization framework unmasks the hypoxia signature hidden in the transcriptome of a set of heterogeneous neuroblastoma cell lines
url http://hdl.handle.net/1721.1/52504
https://orcid.org/0000-0001-6376-4786
work_keys_str_mv AT varesioluigi thelsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT verrialessandro thelsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT moscisofia thelsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT barlaannalisa thelsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT fardinpaolo thelsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT rosascolorenzoandrea thelsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT varesioluigi thel1l2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT verrialessandro thel1l2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT moscisofia thel1l2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT barlaannalisa thel1l2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT fardinpaolo thel1l2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT rosascolorenzoandrea thel1l2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT varesioluigi lsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT verrialessandro lsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT moscisofia lsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT barlaannalisa lsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT fardinpaolo lsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines
AT rosascolorenzoandrea lsubscript1lsubscript2regularizationframeworkunmasksthehypoxiasignaturehiddeninthetranscriptomeofasetofheterogeneousneuroblastomacelllines