A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.

The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) t...

Full description

Bibliographic Details
Main Authors: Ming Fang, Weixuan Fu, Dan Jiang, Qin Zhang, Dongxiao Sun, Xiangdong Ding, Jianfeng Liu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4141689?pdf=render
_version_ 1819057147929952256
author Ming Fang
Weixuan Fu
Dan Jiang
Qin Zhang
Dongxiao Sun
Xiangdong Ding
Jianfeng Liu
author_facet Ming Fang
Weixuan Fu
Dan Jiang
Qin Zhang
Dongxiao Sun
Xiangdong Ding
Jianfeng Liu
author_sort Ming Fang
collection DOAJ
description The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation.
first_indexed 2024-12-21T13:34:42Z
format Article
id doaj.art-e749340f591d4afe976fabcebd0cdaca
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-12-21T13:34:42Z
publishDate 2014-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-e749340f591d4afe976fabcebd0cdaca2022-12-21T19:02:12ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0198e9954410.1371/journal.pone.0099544A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.Ming FangWeixuan FuDan JiangQin ZhangDongxiao SunXiangdong DingJianfeng LiuThe multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation.http://europepmc.org/articles/PMC4141689?pdf=render
spellingShingle Ming Fang
Weixuan Fu
Dan Jiang
Qin Zhang
Dongxiao Sun
Xiangdong Ding
Jianfeng Liu
A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
PLoS ONE
title A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
title_full A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
title_fullStr A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
title_full_unstemmed A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
title_short A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
title_sort multiple snp approach for genome wide association study of milk production traits in chinese holstein cattle
url http://europepmc.org/articles/PMC4141689?pdf=render
work_keys_str_mv AT mingfang amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT weixuanfu amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT danjiang amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT qinzhang amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT dongxiaosun amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT xiangdongding amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT jianfengliu amultiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT mingfang multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT weixuanfu multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT danjiang multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT qinzhang multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT dongxiaosun multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT xiangdongding multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle
AT jianfengliu multiplesnpapproachforgenomewideassociationstudyofmilkproductiontraitsinchineseholsteincattle