sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.

Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as...

Full description

Bibliographic Details
Main Authors: Nadezhda M Belonogova, Gulnara R Svishcheva, Anatoly V Kirichenko, Irina V Zorkoltseva, Yakov A Tsepilov, Tatiana I Axenovich
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2022-06-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1010172
_version_ 1828787959744167936
author Nadezhda M Belonogova
Gulnara R Svishcheva
Anatoly V Kirichenko
Irina V Zorkoltseva
Yakov A Tsepilov
Tatiana I Axenovich
author_facet Nadezhda M Belonogova
Gulnara R Svishcheva
Anatoly V Kirichenko
Irina V Zorkoltseva
Yakov A Tsepilov
Tatiana I Axenovich
author_sort Nadezhda M Belonogova
collection DOAJ
description Gene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.
first_indexed 2024-12-12T00:42:42Z
format Article
id doaj.art-ab3673d81b7646c2a5f6ffc8ded982bd
institution Directory Open Access Journal
issn 1553-734X
1553-7358
language English
last_indexed 2024-12-12T00:42:42Z
publishDate 2022-06-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj.art-ab3673d81b7646c2a5f6ffc8ded982bd2022-12-22T00:44:13ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582022-06-01186e101017210.1371/journal.pcbi.1010172sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.Nadezhda M BelonogovaGulnara R SvishchevaAnatoly V KirichenkoIrina V ZorkoltsevaYakov A TsepilovTatiana I AxenovichGene-based association analysis is an effective gene-mapping tool. Many gene-based methods have been proposed recently. However, their power depends on the underlying genetic architecture, which is rarely known in complex traits, and so it is likely that a combination of such methods could serve as a universal approach. Several frameworks combining different gene-based methods have been developed. However, they all imply a fixed set of methods, weights and functional annotations. Moreover, most of them use individual phenotypes and genotypes as input data. Here, we introduce sumSTAAR, a framework for gene-based association analysis using summary statistics obtained from genome-wide association studies (GWAS). It is an extended and modified version of STAAR framework proposed by Li and colleagues in 2020. The sumSTAAR framework offers a wider range of gene-based methods to combine. It allows the user to arbitrarily define a set of these methods, weighting functions and probabilities of genetic variants being causal. The methods used in the framework were adapted to analyse genes with large number of SNPs to decrease the running time. The framework includes the polygene pruning procedure to guard against the influence of the strong GWAS signals outside the gene. We also present new improved matrices of correlations between the genotypes of variants within genes. These matrices estimated on a sample of 265,000 individuals are a state-of-the-art replacement of widely used matrices based on the 1000 Genomes Project data.https://doi.org/10.1371/journal.pcbi.1010172
spellingShingle Nadezhda M Belonogova
Gulnara R Svishcheva
Anatoly V Kirichenko
Irina V Zorkoltseva
Yakov A Tsepilov
Tatiana I Axenovich
sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
PLoS Computational Biology
title sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
title_full sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
title_fullStr sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
title_full_unstemmed sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
title_short sumSTAAR: A flexible framework for gene-based association studies using GWAS summary statistics.
title_sort sumstaar a flexible framework for gene based association studies using gwas summary statistics
url https://doi.org/10.1371/journal.pcbi.1010172
work_keys_str_mv AT nadezhdambelonogova sumstaaraflexibleframeworkforgenebasedassociationstudiesusinggwassummarystatistics
AT gulnararsvishcheva sumstaaraflexibleframeworkforgenebasedassociationstudiesusinggwassummarystatistics
AT anatolyvkirichenko sumstaaraflexibleframeworkforgenebasedassociationstudiesusinggwassummarystatistics
AT irinavzorkoltseva sumstaaraflexibleframeworkforgenebasedassociationstudiesusinggwassummarystatistics
AT yakovatsepilov sumstaaraflexibleframeworkforgenebasedassociationstudiesusinggwassummarystatistics
AT tatianaiaxenovich sumstaaraflexibleframeworkforgenebasedassociationstudiesusinggwassummarystatistics