A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing
Results from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide ass...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2015-04-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | http://journal.frontiersin.org/Journal/10.3389/fgene.2015.00149/full |
_version_ | 1818541169387241472 |
---|---|
author | Qian eWang Qiongshi eLu Hongyu eZhao Hongyu eZhao Hongyu eZhao |
author_facet | Qian eWang Qiongshi eLu Hongyu eZhao Hongyu eZhao Hongyu eZhao |
author_sort | Qian eWang |
collection | DOAJ |
description | Results from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide association studies (GWAS) in the past 10 years, it is challenging to interpret these results as most disease-associated markers have no clear functional roles in disease etiology, and all the identified genomic factors only explain a small portion of disease heritability. With the help of next-generation sequencing (NGS), diverse types of genomic and epigenetic variations can be detected with high accuracy. More importantly, instead of using linkage disequilibrium to detect association signals based on a set of pre-set probes, NGS allows researchers to directly study all the variants in each individual, therefore promises opportunities for identifying functional variants and a more comprehensive dissection of disease heritability. Although the current scale of NGS studies is still limited due to the high cost, the success of several recent studies suggests the great potential for applying NGS in genomic epidemiology, especially as the cost of sequencing continues to drop. In this review, we discuss several pioneer applications of NGS, summarize scientific discoveries for rare and complex diseases, and compare various study designs including targeted sequencing and whole-genome sequencing using population-based and family-based cohorts. Finally, we highlight recent advancements in statistical methods proposed for sequencing analysis, including group-based association tests, meta-analysis techniques, and annotation tools for variant prioritization. |
first_indexed | 2024-12-11T22:05:03Z |
format | Article |
id | doaj.art-376caad912cf4cafa4adeeee09d6a259 |
institution | Directory Open Access Journal |
issn | 1664-8021 |
language | English |
last_indexed | 2024-12-11T22:05:03Z |
publishDate | 2015-04-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Genetics |
spelling | doaj.art-376caad912cf4cafa4adeeee09d6a2592022-12-22T00:48:58ZengFrontiers Media S.A.Frontiers in Genetics1664-80212015-04-01610.3389/fgene.2015.00149135502A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation SequencingQian eWang0Qiongshi eLu1Hongyu eZhao2Hongyu eZhao3Hongyu eZhao4Yale UniversityYale UniversityYale UniversityYale UniversityVA Cooperative Studies Program Coordinating CenterResults from numerous linkage and association studies have greatly deepened scientists’ understanding of the genetic basis of many human diseases, yet some important questions remain unanswered. For example, although a large number of disease-associated loci have been identified from genome-wide association studies (GWAS) in the past 10 years, it is challenging to interpret these results as most disease-associated markers have no clear functional roles in disease etiology, and all the identified genomic factors only explain a small portion of disease heritability. With the help of next-generation sequencing (NGS), diverse types of genomic and epigenetic variations can be detected with high accuracy. More importantly, instead of using linkage disequilibrium to detect association signals based on a set of pre-set probes, NGS allows researchers to directly study all the variants in each individual, therefore promises opportunities for identifying functional variants and a more comprehensive dissection of disease heritability. Although the current scale of NGS studies is still limited due to the high cost, the success of several recent studies suggests the great potential for applying NGS in genomic epidemiology, especially as the cost of sequencing continues to drop. In this review, we discuss several pioneer applications of NGS, summarize scientific discoveries for rare and complex diseases, and compare various study designs including targeted sequencing and whole-genome sequencing using population-based and family-based cohorts. Finally, we highlight recent advancements in statistical methods proposed for sequencing analysis, including group-based association tests, meta-analysis techniques, and annotation tools for variant prioritization.http://journal.frontiersin.org/Journal/10.3389/fgene.2015.00149/fullNext-generation sequencingstatistical methodsStudy DesignGenomic epidemiologyGenetic etiology |
spellingShingle | Qian eWang Qiongshi eLu Hongyu eZhao Hongyu eZhao Hongyu eZhao A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing Frontiers in Genetics Next-generation sequencing statistical methods Study Design Genomic epidemiology Genetic etiology |
title | A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing |
title_full | A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing |
title_fullStr | A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing |
title_full_unstemmed | A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing |
title_short | A Review of Study Designs and Statistical Methods for Genomic Epidemiology Studies using Next Generation Sequencing |
title_sort | review of study designs and statistical methods for genomic epidemiology studies using next generation sequencing |
topic | Next-generation sequencing statistical methods Study Design Genomic epidemiology Genetic etiology |
url | http://journal.frontiersin.org/Journal/10.3389/fgene.2015.00149/full |
work_keys_str_mv | AT qianewang areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT qiongshielu areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT hongyuezhao areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT hongyuezhao areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT hongyuezhao areviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT qianewang reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT qiongshielu reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT hongyuezhao reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT hongyuezhao reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing AT hongyuezhao reviewofstudydesignsandstatisticalmethodsforgenomicepidemiologystudiesusingnextgenerationsequencing |