Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo

Summary: A major challenge in single-cell gene expression analysis is to discern meaningful cellular heterogeneity from technical or biological noise. To address this challenge, we present entropy sorting (ES), a mathematical framework that distinguishes genes indicative of cell identity. ES achieve...

Full description

Bibliographic Details
Main Authors: Arthur Radley, Elena Corujo-Simon, Jennifer Nichols, Austin Smith, Sara-Jane Dunn
Format: Article
Language:English
Published: Elsevier 2023-01-01
Series:Stem Cell Reports
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2213671122004568
_version_ 1797955436272418816
author Arthur Radley
Elena Corujo-Simon
Jennifer Nichols
Austin Smith
Sara-Jane Dunn
author_facet Arthur Radley
Elena Corujo-Simon
Jennifer Nichols
Austin Smith
Sara-Jane Dunn
author_sort Arthur Radley
collection DOAJ
description Summary: A major challenge in single-cell gene expression analysis is to discern meaningful cellular heterogeneity from technical or biological noise. To address this challenge, we present entropy sorting (ES), a mathematical framework that distinguishes genes indicative of cell identity. ES achieves this in an unsupervised manner by quantifying if observed correlations between features are more likely to have occurred due to random chance versus a dependent relationship, without the need for any user-defined significance threshold. On synthetic data, we demonstrate the removal of noisy signals to reveal a higher resolution of gene expression patterns than commonly used feature selection methods. We then apply ES to human pre-implantation embryo single-cell RNA sequencing (scRNA-seq) data. Previous studies failed to unambiguously identify early inner cell mass (ICM), suggesting that the human embryo may diverge from the mouse paradigm. In contrast, ES resolves the ICM and reveals sequential lineage bifurcations as in the classical model. ES thus provides a powerful approach for maximizing information extraction from high-dimensional datasets such as scRNA-seq data.
first_indexed 2024-04-10T23:33:10Z
format Article
id doaj.art-a91d520927f84cbb9228913be102c1a9
institution Directory Open Access Journal
issn 2213-6711
language English
last_indexed 2024-04-10T23:33:10Z
publishDate 2023-01-01
publisher Elsevier
record_format Article
series Stem Cell Reports
spelling doaj.art-a91d520927f84cbb9228913be102c1a92023-01-12T04:19:00ZengElsevierStem Cell Reports2213-67112023-01-011814763Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryoArthur Radley0Elena Corujo-Simon1Jennifer Nichols2Austin Smith3Sara-Jane Dunn4Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Jeffrey Cheah Biomedical Centre, Puddicombe Way, Cambridge Biomedical Campus, Cambridge CB2 0AW, UKMRC Human Genetics Unit, MRC Institute of Genetics and Cancer, The University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UKMRC Human Genetics Unit, MRC Institute of Genetics and Cancer, The University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UKLiving Systems Institute, University of Exeter, Stocker Road, Exeter EX4 4QD, UK; Corresponding authorMicrosoft Research, 21 Station Road, Cambridge CB1 2FB, UK; Corresponding authorSummary: A major challenge in single-cell gene expression analysis is to discern meaningful cellular heterogeneity from technical or biological noise. To address this challenge, we present entropy sorting (ES), a mathematical framework that distinguishes genes indicative of cell identity. ES achieves this in an unsupervised manner by quantifying if observed correlations between features are more likely to have occurred due to random chance versus a dependent relationship, without the need for any user-defined significance threshold. On synthetic data, we demonstrate the removal of noisy signals to reveal a higher resolution of gene expression patterns than commonly used feature selection methods. We then apply ES to human pre-implantation embryo single-cell RNA sequencing (scRNA-seq) data. Previous studies failed to unambiguously identify early inner cell mass (ICM), suggesting that the human embryo may diverge from the mouse paradigm. In contrast, ES resolves the ICM and reveals sequential lineage bifurcations as in the classical model. ES thus provides a powerful approach for maximizing information extraction from high-dimensional datasets such as scRNA-seq data.http://www.sciencedirect.com/science/article/pii/S2213671122004568single-cell RNA sequencingfeature selectionhuman embryo inner cell mass
spellingShingle Arthur Radley
Elena Corujo-Simon
Jennifer Nichols
Austin Smith
Sara-Jane Dunn
Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo
Stem Cell Reports
single-cell RNA sequencing
feature selection
human embryo inner cell mass
title Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo
title_full Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo
title_fullStr Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo
title_full_unstemmed Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo
title_short Entropy sorting of single-cell RNA sequencing data reveals the inner cell mass in the human pre-implantation embryo
title_sort entropy sorting of single cell rna sequencing data reveals the inner cell mass in the human pre implantation embryo
topic single-cell RNA sequencing
feature selection
human embryo inner cell mass
url http://www.sciencedirect.com/science/article/pii/S2213671122004568
work_keys_str_mv AT arthurradley entropysortingofsinglecellrnasequencingdatarevealstheinnercellmassinthehumanpreimplantationembryo
AT elenacorujosimon entropysortingofsinglecellrnasequencingdatarevealstheinnercellmassinthehumanpreimplantationembryo
AT jennifernichols entropysortingofsinglecellrnasequencingdatarevealstheinnercellmassinthehumanpreimplantationembryo
AT austinsmith entropysortingofsinglecellrnasequencingdatarevealstheinnercellmassinthehumanpreimplantationembryo
AT sarajanedunn entropysortingofsinglecellrnasequencingdatarevealstheinnercellmassinthehumanpreimplantationembryo