SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data

Summary: The analysis and interpretation of single-cell RNA sequencing (scRNA-seq) experiments are compromised by the presence of poor-quality cells. For meaningful analyses, such poor-quality cells should be excluded as they introduce noise in the data. We introduce SkewC, a quality-assessment tool...

Full description

Bibliographic Details
Main Authors: Imad Abugessaisa, Akira Hasegawa, Shuhei Noguchi, Melissa Cardon, Kazuhide Watanabe, Masataka Takahashi, Harukazu Suzuki, Shintaro Katayama, Juha Kere, Takeya Kasukawa
Format: Article
Language:English
Published: Elsevier 2022-02-01
Series:iScience
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2589004222000475
_version_ 1819282878305927168
author Imad Abugessaisa
Akira Hasegawa
Shuhei Noguchi
Melissa Cardon
Kazuhide Watanabe
Masataka Takahashi
Harukazu Suzuki
Shintaro Katayama
Juha Kere
Takeya Kasukawa
author_facet Imad Abugessaisa
Akira Hasegawa
Shuhei Noguchi
Melissa Cardon
Kazuhide Watanabe
Masataka Takahashi
Harukazu Suzuki
Shintaro Katayama
Juha Kere
Takeya Kasukawa
author_sort Imad Abugessaisa
collection DOAJ
description Summary: The analysis and interpretation of single-cell RNA sequencing (scRNA-seq) experiments are compromised by the presence of poor-quality cells. For meaningful analyses, such poor-quality cells should be excluded as they introduce noise in the data. We introduce SkewC, a quality-assessment tool, to identify skewed cells in scRNA-seq experiments. The tool’s methodology is based on the assessment of gene coverage for each cell, and its skewness as a quality measure; the gene body coverage is a unique characteristic for each protocol, and different protocols yield highly different coverage profiles. This tool is designed to avoid misclustering or false clusters by identifying, isolating, and removing cells with skewed gene body coverage profiles. SkewC is capable of processing any type of scRNA-seq dataset, regardless of the protocol. We envision SkewC as a distinctive QC method to be incorporated into scRNA-seq QC processing to preclude the possibility of scRNA-seq data misinterpretation.
first_indexed 2024-12-24T01:22:35Z
format Article
id doaj.art-46664ab7721b452a85c7254b5d303786
institution Directory Open Access Journal
issn 2589-0042
language English
last_indexed 2024-12-24T01:22:35Z
publishDate 2022-02-01
publisher Elsevier
record_format Article
series iScience
spelling doaj.art-46664ab7721b452a85c7254b5d3037862022-12-21T17:22:36ZengElsevieriScience2589-00422022-02-01252103777SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing dataImad Abugessaisa0Akira Hasegawa1Shuhei Noguchi2Melissa Cardon3Kazuhide Watanabe4Masataka Takahashi5Harukazu Suzuki6Shintaro Katayama7Juha Kere8Takeya Kasukawa9Laboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanLaboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanLaboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanLaboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanLaboratory for Cellular Function Conversion Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanLaboratory for Cellular Function Conversion Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanLaboratory for Cellular Function Conversion Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, JapanFolkhälsan Research Center, Topeliuksenkatu 20, 00250 Helsinki, Finland; Department of Biosciences and Nutrition, Karolinska Institutet, 141 83 Huddinge, Sweden; Stem Cells and Metabolism Research Program, University of Helsinki, P.O. Box 4 (Yliopistonkatu 3), Helsinki, FinlandFolkhälsan Research Center, Topeliuksenkatu 20, 00250 Helsinki, Finland; Department of Biosciences and Nutrition, Karolinska Institutet, 141 83 Huddinge, Sweden; Stem Cells and Metabolism Research Program, University of Helsinki, P.O. Box 4 (Yliopistonkatu 3), Helsinki, Finland; Corresponding authorLaboratory for Large-Scale Biomedical Data Technology, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, 230-0045, Japan; Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan; Corresponding authorSummary: The analysis and interpretation of single-cell RNA sequencing (scRNA-seq) experiments are compromised by the presence of poor-quality cells. For meaningful analyses, such poor-quality cells should be excluded as they introduce noise in the data. We introduce SkewC, a quality-assessment tool, to identify skewed cells in scRNA-seq experiments. The tool’s methodology is based on the assessment of gene coverage for each cell, and its skewness as a quality measure; the gene body coverage is a unique characteristic for each protocol, and different protocols yield highly different coverage profiles. This tool is designed to avoid misclustering or false clusters by identifying, isolating, and removing cells with skewed gene body coverage profiles. SkewC is capable of processing any type of scRNA-seq dataset, regardless of the protocol. We envision SkewC as a distinctive QC method to be incorporated into scRNA-seq QC processing to preclude the possibility of scRNA-seq data misinterpretation.http://www.sciencedirect.com/science/article/pii/S2589004222000475Biological sciencesCell biologyBiocomputational methodBiological sciences research methodologies
spellingShingle Imad Abugessaisa
Akira Hasegawa
Shuhei Noguchi
Melissa Cardon
Kazuhide Watanabe
Masataka Takahashi
Harukazu Suzuki
Shintaro Katayama
Juha Kere
Takeya Kasukawa
SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
iScience
Biological sciences
Cell biology
Biocomputational method
Biological sciences research methodologies
title SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
title_full SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
title_fullStr SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
title_full_unstemmed SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
title_short SkewC: Identifying cells with skewed gene body coverage in single-cell RNA sequencing data
title_sort skewc identifying cells with skewed gene body coverage in single cell rna sequencing data
topic Biological sciences
Cell biology
Biocomputational method
Biological sciences research methodologies
url http://www.sciencedirect.com/science/article/pii/S2589004222000475
work_keys_str_mv AT imadabugessaisa skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT akirahasegawa skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT shuheinoguchi skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT melissacardon skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT kazuhidewatanabe skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT masatakatakahashi skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT harukazusuzuki skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT shintarokatayama skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT juhakere skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata
AT takeyakasukawa skewcidentifyingcellswithskewedgenebodycoverageinsinglecellrnasequencingdata