Multi-PGS enhances polygenic prediction by combining 937 polygenic scores

Abstract The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated p...

Full description

Bibliographic Details
Main Authors: Clara Albiñana, Zhihong Zhu, Andrew J. Schork, Andrés Ingason, Hugues Aschard, Isabell Brikell, Cynthia M. Bulik, Liselotte V. Petersen, Esben Agerbo, Jakob Grove, Merete Nordentoft, David M. Hougaard, Thomas Werge, Anders D. Børglum, Preben Bo Mortensen, John J. McGrath, Benjamin M. Neale, Florian Privé, Bjarni J. Vilhjálmsson
Format: Article
Language:English
Published: Nature Portfolio 2023-08-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-023-40330-w
_version_ 1827710255976415232
author Clara Albiñana
Zhihong Zhu
Andrew J. Schork
Andrés Ingason
Hugues Aschard
Isabell Brikell
Cynthia M. Bulik
Liselotte V. Petersen
Esben Agerbo
Jakob Grove
Merete Nordentoft
David M. Hougaard
Thomas Werge
Anders D. Børglum
Preben Bo Mortensen
John J. McGrath
Benjamin M. Neale
Florian Privé
Bjarni J. Vilhjálmsson
author_facet Clara Albiñana
Zhihong Zhu
Andrew J. Schork
Andrés Ingason
Hugues Aschard
Isabell Brikell
Cynthia M. Bulik
Liselotte V. Petersen
Esben Agerbo
Jakob Grove
Merete Nordentoft
David M. Hougaard
Thomas Werge
Anders D. Børglum
Preben Bo Mortensen
John J. McGrath
Benjamin M. Neale
Florian Privé
Bjarni J. Vilhjálmsson
author_sort Clara Albiñana
collection DOAJ
description Abstract The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated phenotypes. We propose a framework to generate multi-PGS from thousands of publicly available genome-wide association studies (GWAS) with no need to individually select the most relevant ones. In this study, the multi-PGS framework increases prediction accuracy over single PGS for all included psychiatric disorders and other available outcomes, with prediction R 2 increases of up to 9-fold for attention-deficit/hyperactivity disorder compared to a single PGS. We also generate multi-PGS for phenotypes without an existing GWAS and for case-case predictions. We benchmark the multi-PGS framework against other methods and highlight its potential application to new emerging biobanks.
first_indexed 2024-03-10T17:35:29Z
format Article
id doaj.art-65c6fd7da745423fbb7f56199da61c83
institution Directory Open Access Journal
issn 2041-1723
language English
last_indexed 2024-03-10T17:35:29Z
publishDate 2023-08-01
publisher Nature Portfolio
record_format Article
series Nature Communications
spelling doaj.art-65c6fd7da745423fbb7f56199da61c832023-11-20T09:52:23ZengNature PortfolioNature Communications2041-17232023-08-0114111110.1038/s41467-023-40330-wMulti-PGS enhances polygenic prediction by combining 937 polygenic scoresClara Albiñana0Zhihong Zhu1Andrew J. Schork2Andrés Ingason3Hugues Aschard4Isabell Brikell5Cynthia M. Bulik6Liselotte V. Petersen7Esben Agerbo8Jakob Grove9Merete Nordentoft10David M. Hougaard11Thomas Werge12Anders D. Børglum13Preben Bo Mortensen14John J. McGrath15Benjamin M. Neale16Florian Privé17Bjarni J. Vilhjálmsson18The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHNational Centre for Register-Based Research, Aarhus UniversityThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHDepartment of Computational Biology, Institut Pasteur, Université de ParisThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHDepartment of Medical Epidemiology and Biostatistics, Karolinska InstituteThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHNational Centre for Register-Based Research, Aarhus UniversityAnalytic and Translational Genetics Unit, Massachusetts General HospitalThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHThe Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCHAbstract The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated phenotypes. We propose a framework to generate multi-PGS from thousands of publicly available genome-wide association studies (GWAS) with no need to individually select the most relevant ones. In this study, the multi-PGS framework increases prediction accuracy over single PGS for all included psychiatric disorders and other available outcomes, with prediction R 2 increases of up to 9-fold for attention-deficit/hyperactivity disorder compared to a single PGS. We also generate multi-PGS for phenotypes without an existing GWAS and for case-case predictions. We benchmark the multi-PGS framework against other methods and highlight its potential application to new emerging biobanks.https://doi.org/10.1038/s41467-023-40330-w
spellingShingle Clara Albiñana
Zhihong Zhu
Andrew J. Schork
Andrés Ingason
Hugues Aschard
Isabell Brikell
Cynthia M. Bulik
Liselotte V. Petersen
Esben Agerbo
Jakob Grove
Merete Nordentoft
David M. Hougaard
Thomas Werge
Anders D. Børglum
Preben Bo Mortensen
John J. McGrath
Benjamin M. Neale
Florian Privé
Bjarni J. Vilhjálmsson
Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
Nature Communications
title Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
title_full Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
title_fullStr Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
title_full_unstemmed Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
title_short Multi-PGS enhances polygenic prediction by combining 937 polygenic scores
title_sort multi pgs enhances polygenic prediction by combining 937 polygenic scores
url https://doi.org/10.1038/s41467-023-40330-w
work_keys_str_mv AT claraalbinana multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT zhihongzhu multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT andrewjschork multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT andresingason multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT huguesaschard multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT isabellbrikell multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT cynthiambulik multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT liselottevpetersen multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT esbenagerbo multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT jakobgrove multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT meretenordentoft multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT davidmhougaard multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT thomaswerge multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT andersdbørglum multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT prebenbomortensen multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT johnjmcgrath multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT benjaminmneale multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT florianprive multipgsenhancespolygenicpredictionbycombining937polygenicscores
AT bjarnijvilhjalmsson multipgsenhancespolygenicpredictionbycombining937polygenicscores