Incorporating methylation genome information improves prediction accuracy for drug treatment responses

Abstract Background An accumulation of evidence has revealed the important role of epigenetic factors in explaining the etiopathogenesis of human diseases. Several empirical studies have successfully incorporated methylation data into models for disease prediction. However, it is still a challenge t...

Full description

Bibliographic Details
Main Authors: Xiaoxuan Xia, Haoyi Weng, Ruoting Men, Rui Sun, Benny Chung Ying Zee, Ka Chun Chong, Maggie Haitian Wang
Format: Article
Language:English
Published: BMC 2018-09-01
Series:BMC Genetics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12863-018-0644-5
_version_ 1818538975846989824
author Xiaoxuan Xia
Haoyi Weng
Ruoting Men
Rui Sun
Benny Chung Ying Zee
Ka Chun Chong
Maggie Haitian Wang
author_facet Xiaoxuan Xia
Haoyi Weng
Ruoting Men
Rui Sun
Benny Chung Ying Zee
Ka Chun Chong
Maggie Haitian Wang
author_sort Xiaoxuan Xia
collection DOAJ
description Abstract Background An accumulation of evidence has revealed the important role of epigenetic factors in explaining the etiopathogenesis of human diseases. Several empirical studies have successfully incorporated methylation data into models for disease prediction. However, it is still a challenge to integrate different types of omics data into prediction models, and the contribution of methylation information to prediction remains to be fully clarified. Results A stratified drug-response prediction model was built based on an artificial neural network to predict the change in the circulating triglyceride level after fenofibrate intervention. Associated single-nucleotide polymorphisms (SNPs), methylation of selected cytosine-phosphate-guanine (CpG) sites, age, sex, and smoking status, were included as predictors. The model with selected SNPs achieved a mean 5-fold cross-validation prediction error rate of 43.65%. After adding methylation information into the model, the error rate dropped to 41.92%. The combination of significant SNPs, CpG sites, age, sex, and smoking status, achieved the lowest prediction error rate of 41.54%. Conclusions Compared to using SNP data only, adding methylation data in prediction models slightly improved the error rate; further prediction error reduction is achieved by a combination of genome, methylation genome, and environmental factors.
first_indexed 2024-12-11T21:35:57Z
format Article
id doaj.art-884aa8ac31be49c9bfd5065ae2ecdc70
institution Directory Open Access Journal
issn 1471-2156
language English
last_indexed 2024-12-11T21:35:57Z
publishDate 2018-09-01
publisher BMC
record_format Article
series BMC Genetics
spelling doaj.art-884aa8ac31be49c9bfd5065ae2ecdc702022-12-22T00:50:00ZengBMCBMC Genetics1471-21562018-09-0119S1677110.1186/s12863-018-0644-5Incorporating methylation genome information improves prediction accuracy for drug treatment responsesXiaoxuan Xia0Haoyi Weng1Ruoting Men2Rui Sun3Benny Chung Ying Zee4Ka Chun Chong5Maggie Haitian Wang6Division of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongDivision of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongDivision of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongDivision of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongDivision of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongDivision of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongDivision of Biostatistics, Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, The Chinese University of Hong KongAbstract Background An accumulation of evidence has revealed the important role of epigenetic factors in explaining the etiopathogenesis of human diseases. Several empirical studies have successfully incorporated methylation data into models for disease prediction. However, it is still a challenge to integrate different types of omics data into prediction models, and the contribution of methylation information to prediction remains to be fully clarified. Results A stratified drug-response prediction model was built based on an artificial neural network to predict the change in the circulating triglyceride level after fenofibrate intervention. Associated single-nucleotide polymorphisms (SNPs), methylation of selected cytosine-phosphate-guanine (CpG) sites, age, sex, and smoking status, were included as predictors. The model with selected SNPs achieved a mean 5-fold cross-validation prediction error rate of 43.65%. After adding methylation information into the model, the error rate dropped to 41.92%. The combination of significant SNPs, CpG sites, age, sex, and smoking status, achieved the lowest prediction error rate of 41.54%. Conclusions Compared to using SNP data only, adding methylation data in prediction models slightly improved the error rate; further prediction error reduction is achieved by a combination of genome, methylation genome, and environmental factors.http://link.springer.com/article/10.1186/s12863-018-0644-5MethylationPredictionSNPsNeural networkTreatment responses
spellingShingle Xiaoxuan Xia
Haoyi Weng
Ruoting Men
Rui Sun
Benny Chung Ying Zee
Ka Chun Chong
Maggie Haitian Wang
Incorporating methylation genome information improves prediction accuracy for drug treatment responses
BMC Genetics
Methylation
Prediction
SNPs
Neural network
Treatment responses
title Incorporating methylation genome information improves prediction accuracy for drug treatment responses
title_full Incorporating methylation genome information improves prediction accuracy for drug treatment responses
title_fullStr Incorporating methylation genome information improves prediction accuracy for drug treatment responses
title_full_unstemmed Incorporating methylation genome information improves prediction accuracy for drug treatment responses
title_short Incorporating methylation genome information improves prediction accuracy for drug treatment responses
title_sort incorporating methylation genome information improves prediction accuracy for drug treatment responses
topic Methylation
Prediction
SNPs
Neural network
Treatment responses
url http://link.springer.com/article/10.1186/s12863-018-0644-5
work_keys_str_mv AT xiaoxuanxia incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses
AT haoyiweng incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses
AT ruotingmen incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses
AT ruisun incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses
AT bennychungyingzee incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses
AT kachunchong incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses
AT maggiehaitianwang incorporatingmethylationgenomeinformationimprovespredictionaccuracyfordrugtreatmentresponses