Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements

Abstract Large-scale longitudinal biobank data can be leveraged to identify genetic variation contributing to human diseases progression and traits trajectories. While methods for genome-wide association studies (GWAS) of multiple correlated traits have been proposed, an efficient multiple-trait app...

Full description

Bibliographic Details
Main Authors: Gamal Abdel-Azim, Parth Patel, Shuwei Li, Shicheng Guo, Mary Helen Black
Format: Article
Language:English
Published: Nature Portfolio 2023-11-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-47555-1
_version_ 1827634700971147264
author Gamal Abdel-Azim
Parth Patel
Shuwei Li
Shicheng Guo
Mary Helen Black
author_facet Gamal Abdel-Azim
Parth Patel
Shuwei Li
Shicheng Guo
Mary Helen Black
author_sort Gamal Abdel-Azim
collection DOAJ
description Abstract Large-scale longitudinal biobank data can be leveraged to identify genetic variation contributing to human diseases progression and traits trajectories. While methods for genome-wide association studies (GWAS) of multiple correlated traits have been proposed, an efficient multiple-trait approach to model longitudinal phenotypes is not currently available. We developed GAMUT, a genome-wide association approach for multiple longitudinal traits. GAMUT employs a mixed-effects model to fit longitudinal outcomes where a fast algorithm for inversion by recursive partitioning of the random effects submatrix is introduced. To evaluate performance of the algorithms introduced and assess their statistical power and type I error, stochastic simulation was conducted. Consistent with our expectation, power was greater for cross-sectional (CS) than longitudinal (LT) effects, particularly with a diminishing LT/CS ratio. With a minimum minor allele count of 3 within genotype by time categories, observed type I error was roughly equal to theoretical genome-wide significance. Additionally, 28 blood-based biomarkers measured at 2 time points on participants of the UK Biobank were used to compare GAMUT against single-trait standard and longitudinal GWAS (including rate of change). Across all biomarkers, we observed 539 (CS) and 248 (LT) significant independent variants for the GAMUT method, and 513 (CS) and 30 (LT) for single-trait longitudinal GWAS, respectively. Only 37 variants were identified by modeling rates of change using standard GWAS.
first_indexed 2024-03-09T15:17:00Z
format Article
id doaj.art-cef29a4dee4d45d38089744c812a1443
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-09T15:17:00Z
publishDate 2023-11-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-cef29a4dee4d45d38089744c812a14432023-11-26T13:03:02ZengNature PortfolioScientific Reports2045-23222023-11-0113111510.1038/s41598-023-47555-1Fast multiple-trait genome-wide association analysis for correlated longitudinal measurementsGamal Abdel-Azim0Parth Patel1Shuwei Li2Shicheng Guo3Mary Helen Black4Janssen Res. & Dev. (Johnson & Johnson)Janssen Res. & Dev. (Johnson & Johnson)Janssen Res. & Dev. (Johnson & Johnson)Janssen Res. & Dev. (Johnson & Johnson)Janssen Res. & Dev. (Johnson & Johnson)Abstract Large-scale longitudinal biobank data can be leveraged to identify genetic variation contributing to human diseases progression and traits trajectories. While methods for genome-wide association studies (GWAS) of multiple correlated traits have been proposed, an efficient multiple-trait approach to model longitudinal phenotypes is not currently available. We developed GAMUT, a genome-wide association approach for multiple longitudinal traits. GAMUT employs a mixed-effects model to fit longitudinal outcomes where a fast algorithm for inversion by recursive partitioning of the random effects submatrix is introduced. To evaluate performance of the algorithms introduced and assess their statistical power and type I error, stochastic simulation was conducted. Consistent with our expectation, power was greater for cross-sectional (CS) than longitudinal (LT) effects, particularly with a diminishing LT/CS ratio. With a minimum minor allele count of 3 within genotype by time categories, observed type I error was roughly equal to theoretical genome-wide significance. Additionally, 28 blood-based biomarkers measured at 2 time points on participants of the UK Biobank were used to compare GAMUT against single-trait standard and longitudinal GWAS (including rate of change). Across all biomarkers, we observed 539 (CS) and 248 (LT) significant independent variants for the GAMUT method, and 513 (CS) and 30 (LT) for single-trait longitudinal GWAS, respectively. Only 37 variants were identified by modeling rates of change using standard GWAS.https://doi.org/10.1038/s41598-023-47555-1
spellingShingle Gamal Abdel-Azim
Parth Patel
Shuwei Li
Shicheng Guo
Mary Helen Black
Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements
Scientific Reports
title Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements
title_full Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements
title_fullStr Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements
title_full_unstemmed Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements
title_short Fast multiple-trait genome-wide association analysis for correlated longitudinal measurements
title_sort fast multiple trait genome wide association analysis for correlated longitudinal measurements
url https://doi.org/10.1038/s41598-023-47555-1
work_keys_str_mv AT gamalabdelazim fastmultipletraitgenomewideassociationanalysisforcorrelatedlongitudinalmeasurements
AT parthpatel fastmultipletraitgenomewideassociationanalysisforcorrelatedlongitudinalmeasurements
AT shuweili fastmultipletraitgenomewideassociationanalysisforcorrelatedlongitudinalmeasurements
AT shichengguo fastmultipletraitgenomewideassociationanalysisforcorrelatedlongitudinalmeasurements
AT maryhelenblack fastmultipletraitgenomewideassociationanalysisforcorrelatedlongitudinalmeasurements