EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.

More than a decade of genome-wide association studies (GWASs) have identified genetic risk variants that are significantly associated with complex traits. Emerging evidence suggests that the function of trait-associated variants likely acts in a tissue- or cell-type-specific fashion. Yet, it remains...

Full description

Bibliographic Details
Main Authors: Rujin Wang, Dan-Yu Lin, Yuchao Jiang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2022-06-01
Series:PLoS Genetics
Online Access:https://doi.org/10.1371/journal.pgen.1010251
_version_ 1797904257421148160
author Rujin Wang
Dan-Yu Lin
Yuchao Jiang
author_facet Rujin Wang
Dan-Yu Lin
Yuchao Jiang
author_sort Rujin Wang
collection DOAJ
description More than a decade of genome-wide association studies (GWASs) have identified genetic risk variants that are significantly associated with complex traits. Emerging evidence suggests that the function of trait-associated variants likely acts in a tissue- or cell-type-specific fashion. Yet, it remains challenging to prioritize trait-relevant tissues or cell types to elucidate disease etiology. Here, we present EPIC (cEll tyPe enrIChment), a statistical framework that relates large-scale GWAS summary statistics to cell-type-specific gene expression measurements from single-cell RNA sequencing (scRNA-seq). We derive powerful gene-level test statistics for common and rare variants, separately and jointly, and adopt generalized least squares to prioritize trait-relevant cell types while accounting for the correlation structures both within and between genes. Using enrichment of loci associated with four lipid traits in the liver and enrichment of loci associated with three neurological disorders in the brain as ground truths, we show that EPIC outperforms existing methods. We apply our framework to multiple scRNA-seq datasets from different platforms and identify cell types underlying type 2 diabetes and schizophrenia. The enrichment is replicated using independent GWAS and scRNA-seq datasets and further validated using PubMed search and existing bulk case-control testing results.
first_indexed 2024-04-10T09:46:01Z
format Article
id doaj.art-fb88893e30904d43972e4acf7f9b0e88
institution Directory Open Access Journal
issn 1553-7390
1553-7404
language English
last_indexed 2024-04-10T09:46:01Z
publishDate 2022-06-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj.art-fb88893e30904d43972e4acf7f9b0e882023-02-17T05:32:10ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042022-06-01186e101025110.1371/journal.pgen.1010251EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.Rujin WangDan-Yu LinYuchao JiangMore than a decade of genome-wide association studies (GWASs) have identified genetic risk variants that are significantly associated with complex traits. Emerging evidence suggests that the function of trait-associated variants likely acts in a tissue- or cell-type-specific fashion. Yet, it remains challenging to prioritize trait-relevant tissues or cell types to elucidate disease etiology. Here, we present EPIC (cEll tyPe enrIChment), a statistical framework that relates large-scale GWAS summary statistics to cell-type-specific gene expression measurements from single-cell RNA sequencing (scRNA-seq). We derive powerful gene-level test statistics for common and rare variants, separately and jointly, and adopt generalized least squares to prioritize trait-relevant cell types while accounting for the correlation structures both within and between genes. Using enrichment of loci associated with four lipid traits in the liver and enrichment of loci associated with three neurological disorders in the brain as ground truths, we show that EPIC outperforms existing methods. We apply our framework to multiple scRNA-seq datasets from different platforms and identify cell types underlying type 2 diabetes and schizophrenia. The enrichment is replicated using independent GWAS and scRNA-seq datasets and further validated using PubMed search and existing bulk case-control testing results.https://doi.org/10.1371/journal.pgen.1010251
spellingShingle Rujin Wang
Dan-Yu Lin
Yuchao Jiang
EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.
PLoS Genetics
title EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.
title_full EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.
title_fullStr EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.
title_full_unstemmed EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.
title_short EPIC: Inferring relevant cell types for complex traits by integrating genome-wide association studies and single-cell RNA sequencing.
title_sort epic inferring relevant cell types for complex traits by integrating genome wide association studies and single cell rna sequencing
url https://doi.org/10.1371/journal.pgen.1010251
work_keys_str_mv AT rujinwang epicinferringrelevantcelltypesforcomplextraitsbyintegratinggenomewideassociationstudiesandsinglecellrnasequencing
AT danyulin epicinferringrelevantcelltypesforcomplextraitsbyintegratinggenomewideassociationstudiesandsinglecellrnasequencing
AT yuchaojiang epicinferringrelevantcelltypesforcomplextraitsbyintegratinggenomewideassociationstudiesandsinglecellrnasequencing