Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study

A popular approach to reduce the high dimensionality resulting from genome-wide association studies is to analyze a whole pathway in a single test for association with a phenotype. Kernel machine regression (KMR) is a highly flexible pathway analysis approach. Initially, KMR was developed to analyze...

Full description

Bibliographic Details
Main Authors: Bernadette Wendel, Markus Heidenreich, Monika Budde, Maria Heilbronner, Mojtaba Oraki Kohshour, Sergi Papiol, Peter Falkai, Thomas G. Schulze, Urs Heilbronner, Heike Bickeböller
Format: Article
Language:English
Published: Frontiers Media S.A. 2022-12-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fgene.2022.1015885/full
_version_ 1811178070997991424
author Bernadette Wendel
Markus Heidenreich
Monika Budde
Maria Heilbronner
Mojtaba Oraki Kohshour
Sergi Papiol
Sergi Papiol
Peter Falkai
Thomas G. Schulze
Thomas G. Schulze
Thomas G. Schulze
Urs Heilbronner
Heike Bickeböller
author_facet Bernadette Wendel
Markus Heidenreich
Monika Budde
Maria Heilbronner
Mojtaba Oraki Kohshour
Sergi Papiol
Sergi Papiol
Peter Falkai
Thomas G. Schulze
Thomas G. Schulze
Thomas G. Schulze
Urs Heilbronner
Heike Bickeböller
author_sort Bernadette Wendel
collection DOAJ
description A popular approach to reduce the high dimensionality resulting from genome-wide association studies is to analyze a whole pathway in a single test for association with a phenotype. Kernel machine regression (KMR) is a highly flexible pathway analysis approach. Initially, KMR was developed to analyze a simple phenotype with just one measurement per individual. Recently, however, the investigation into the influence of genomic factors in the development of disease-related phenotypes across time (trajectories) has gained in importance. Thus, novel statistical approaches for KMR analyzing longitudinal data, i.e. several measurements at specific time points per individual are required. For longitudinal pathway analysis, we extend KMR to long-KMR using the estimation equivalence of KMR and linear mixed models. We include additional random effects to correct for the dependence structure. Moreover, within long-KMR we created a topology-based pathway analysis by combining this approach with a kernel including network information of the pathway. Most importantly, long-KMR not only allows for the investigation of the main genetic effect adjusting for time dependencies within an individual, but it also allows to test for the association of the pathway with the longitudinal course of the phenotype in the form of testing the genetic time-interaction effect. The approach is implemented as an R package, kalpra. Our simulation study demonstrates that the power of long-KMR exceeded that of another KMR method previously developed to analyze longitudinal data, while maintaining (slightly conservatively) the type I error. The network kernel improved the performance of long-KMR compared to the linear kernel. Considering different pathway densities, the power of the network kernel decreased with increasing pathway density. We applied long-KMR to cognitive data on executive function (Trail Making Test, part B) from the PsyCourse Study and 17 candidate pathways selected from Reactome. We identified seven nominally significant pathways.
first_indexed 2024-04-11T06:12:15Z
format Article
id doaj.art-ab83f50b8d5948b1a5e2600a3b285a2f
institution Directory Open Access Journal
issn 1664-8021
language English
last_indexed 2024-04-11T06:12:15Z
publishDate 2022-12-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Genetics
spelling doaj.art-ab83f50b8d5948b1a5e2600a3b285a2f2022-12-22T04:41:11ZengFrontiers Media S.A.Frontiers in Genetics1664-80212022-12-011310.3389/fgene.2022.10158851015885Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse StudyBernadette Wendel0Markus Heidenreich1Monika Budde2Maria Heilbronner3Mojtaba Oraki Kohshour4Sergi Papiol5Sergi Papiol6Peter Falkai7Thomas G. Schulze8Thomas G. Schulze9Thomas G. Schulze10Urs Heilbronner11Heike Bickeböller12Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University Göttingen, Göttingen, GermanyDepartment of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University Göttingen, Göttingen, GermanyInstitute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, GermanyInstitute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, GermanyInstitute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, GermanyInstitute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, GermanyDepartment of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, GermanyDepartment of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, GermanyInstitute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, GermanyDepartment of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, United StatesDepartment of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, United StatesInstitute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, GermanyDepartment of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University Göttingen, Göttingen, GermanyA popular approach to reduce the high dimensionality resulting from genome-wide association studies is to analyze a whole pathway in a single test for association with a phenotype. Kernel machine regression (KMR) is a highly flexible pathway analysis approach. Initially, KMR was developed to analyze a simple phenotype with just one measurement per individual. Recently, however, the investigation into the influence of genomic factors in the development of disease-related phenotypes across time (trajectories) has gained in importance. Thus, novel statistical approaches for KMR analyzing longitudinal data, i.e. several measurements at specific time points per individual are required. For longitudinal pathway analysis, we extend KMR to long-KMR using the estimation equivalence of KMR and linear mixed models. We include additional random effects to correct for the dependence structure. Moreover, within long-KMR we created a topology-based pathway analysis by combining this approach with a kernel including network information of the pathway. Most importantly, long-KMR not only allows for the investigation of the main genetic effect adjusting for time dependencies within an individual, but it also allows to test for the association of the pathway with the longitudinal course of the phenotype in the form of testing the genetic time-interaction effect. The approach is implemented as an R package, kalpra. Our simulation study demonstrates that the power of long-KMR exceeded that of another KMR method previously developed to analyze longitudinal data, while maintaining (slightly conservatively) the type I error. The network kernel improved the performance of long-KMR compared to the linear kernel. Considering different pathway densities, the power of the network kernel decreased with increasing pathway density. We applied long-KMR to cognitive data on executive function (Trail Making Test, part B) from the PsyCourse Study and 17 candidate pathways selected from Reactome. We identified seven nominally significant pathways.https://www.frontiersin.org/articles/10.3389/fgene.2022.1015885/fullpathway analysiskernel machine regressionlongitudinal datanetworkPsyCourse Study
spellingShingle Bernadette Wendel
Markus Heidenreich
Monika Budde
Maria Heilbronner
Mojtaba Oraki Kohshour
Sergi Papiol
Sergi Papiol
Peter Falkai
Thomas G. Schulze
Thomas G. Schulze
Thomas G. Schulze
Urs Heilbronner
Heike Bickeböller
Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study
Frontiers in Genetics
pathway analysis
kernel machine regression
longitudinal data
network
PsyCourse Study
title Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study
title_full Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study
title_fullStr Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study
title_full_unstemmed Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study
title_short Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study
title_sort kalpra a kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal psycourse study
topic pathway analysis
kernel machine regression
longitudinal data
network
PsyCourse Study
url https://www.frontiersin.org/articles/10.3389/fgene.2022.1015885/full
work_keys_str_mv AT bernadettewendel kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT markusheidenreich kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT monikabudde kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT mariaheilbronner kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT mojtabaorakikohshour kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT sergipapiol kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT sergipapiol kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT peterfalkai kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT thomasgschulze kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT thomasgschulze kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT thomasgschulze kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT ursheilbronner kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy
AT heikebickeboller kalpraakernelapproachforlongitudinalpathwayregressionanalysisintegratingnetworkinformationwithanapplicationtothelongitudinalpsycoursestudy