A Genome-Wide Association Study of Dementia Using the Electronic Medical Record
Dementia is characterized as a decline in cognitive function, including memory, language and problem-solving abilities. In this paper, we conducted a Genome-Wide Association Study (GWAS) using data from the electronic Medical Records and Genomics (eMERGE) network. This study has two aims, (1) to inv...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-02-01
|
Series: | BioMedInformatics |
Subjects: | |
Online Access: | https://www.mdpi.com/2673-7426/3/1/10 |
_version_ | 1797613261951074304 |
---|---|
author | Xiaowen Cao Yao Dong Li Xing Xuekui Zhang |
author_facet | Xiaowen Cao Yao Dong Li Xing Xuekui Zhang |
author_sort | Xiaowen Cao |
collection | DOAJ |
description | Dementia is characterized as a decline in cognitive function, including memory, language and problem-solving abilities. In this paper, we conducted a Genome-Wide Association Study (GWAS) using data from the electronic Medical Records and Genomics (eMERGE) network. This study has two aims, (1) to investigate the genetic mechanism of dementia and (2) to discuss multiple <i>p</i>-value thresholds used to address multiple testing issues. Using the genome-wide significant threshold (<inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>≤</mo><mn>5</mn><mo>×</mo><msup><mn>10</mn><mrow><mo>−</mo><mn>8</mn></mrow></msup></mrow></semantics></math></inline-formula>), we identified four SNPs. Controlling the False Positive Rate (FDR) level below <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0.05</mn></mrow></semantics></math></inline-formula> leads to one extra SNP. Five SNPs that we found are also supported by QQ-plot comparing observed <i>p</i>-values with expected <i>p</i>-values. All these five SNPs belong to the TOMM40 gene on chromosome 19. Other published studies independently validate the relationship between TOMM40 and dementia. Some published studies use a relaxed threshold (<inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>≤</mo><mn>1</mn><mo>×</mo><msup><mn>10</mn><mrow><mo>−</mo><mn>5</mn></mrow></msup></mrow></semantics></math></inline-formula>) to discover SNPs when the statistical power is insufficient. This relaxed threshold is more powerful but cannot properly control false positives in multiple testing. We identified 13 SNPs using this threshold, which led to the discovery of extra genes (such as ATP10A-DT and PTPRM). Other published studies reported these genes as related to brain development or neuro-development, indicating these genes are potential novel genes for dementia. Those novel potential loci and genes may help identify targets for developing new therapies. However, we suggest using them with caution since they are discovered without proper false positive control. |
first_indexed | 2024-03-11T06:53:23Z |
format | Article |
id | doaj.art-419c667db188440582ba16f6446ac75c |
institution | Directory Open Access Journal |
issn | 2673-7426 |
language | English |
last_indexed | 2024-03-11T06:53:23Z |
publishDate | 2023-02-01 |
publisher | MDPI AG |
record_format | Article |
series | BioMedInformatics |
spelling | doaj.art-419c667db188440582ba16f6446ac75c2023-11-17T09:48:44ZengMDPI AGBioMedInformatics2673-74262023-02-013114114910.3390/biomedinformatics3010010A Genome-Wide Association Study of Dementia Using the Electronic Medical RecordXiaowen Cao0Yao Dong1Li Xing2Xuekui Zhang3Department of Mathematics and Statistics, University of Victoria, Victoria, BC V8P 5C2, CanadaDepartment of Mathematics and Statistics, University of Victoria, Victoria, BC V8P 5C2, CanadaDepartment of Mathematics and Statistics, University of Saskatchewan, Saskatoon, SK S7N 5A2, CanadaDepartment of Mathematics and Statistics, University of Victoria, Victoria, BC V8P 5C2, CanadaDementia is characterized as a decline in cognitive function, including memory, language and problem-solving abilities. In this paper, we conducted a Genome-Wide Association Study (GWAS) using data from the electronic Medical Records and Genomics (eMERGE) network. This study has two aims, (1) to investigate the genetic mechanism of dementia and (2) to discuss multiple <i>p</i>-value thresholds used to address multiple testing issues. Using the genome-wide significant threshold (<inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>≤</mo><mn>5</mn><mo>×</mo><msup><mn>10</mn><mrow><mo>−</mo><mn>8</mn></mrow></msup></mrow></semantics></math></inline-formula>), we identified four SNPs. Controlling the False Positive Rate (FDR) level below <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0.05</mn></mrow></semantics></math></inline-formula> leads to one extra SNP. Five SNPs that we found are also supported by QQ-plot comparing observed <i>p</i>-values with expected <i>p</i>-values. All these five SNPs belong to the TOMM40 gene on chromosome 19. Other published studies independently validate the relationship between TOMM40 and dementia. Some published studies use a relaxed threshold (<inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>≤</mo><mn>1</mn><mo>×</mo><msup><mn>10</mn><mrow><mo>−</mo><mn>5</mn></mrow></msup></mrow></semantics></math></inline-formula>) to discover SNPs when the statistical power is insufficient. This relaxed threshold is more powerful but cannot properly control false positives in multiple testing. We identified 13 SNPs using this threshold, which led to the discovery of extra genes (such as ATP10A-DT and PTPRM). Other published studies reported these genes as related to brain development or neuro-development, indicating these genes are potential novel genes for dementia. Those novel potential loci and genes may help identify targets for developing new therapies. However, we suggest using them with caution since they are discovered without proper false positive control.https://www.mdpi.com/2673-7426/3/1/10dementiaGWASTOMM40electronic medical records |
spellingShingle | Xiaowen Cao Yao Dong Li Xing Xuekui Zhang A Genome-Wide Association Study of Dementia Using the Electronic Medical Record BioMedInformatics dementia GWAS TOMM40 electronic medical records |
title | A Genome-Wide Association Study of Dementia Using the Electronic Medical Record |
title_full | A Genome-Wide Association Study of Dementia Using the Electronic Medical Record |
title_fullStr | A Genome-Wide Association Study of Dementia Using the Electronic Medical Record |
title_full_unstemmed | A Genome-Wide Association Study of Dementia Using the Electronic Medical Record |
title_short | A Genome-Wide Association Study of Dementia Using the Electronic Medical Record |
title_sort | genome wide association study of dementia using the electronic medical record |
topic | dementia GWAS TOMM40 electronic medical records |
url | https://www.mdpi.com/2673-7426/3/1/10 |
work_keys_str_mv | AT xiaowencao agenomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT yaodong agenomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT lixing agenomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT xuekuizhang agenomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT xiaowencao genomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT yaodong genomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT lixing genomewideassociationstudyofdementiausingtheelectronicmedicalrecord AT xuekuizhang genomewideassociationstudyofdementiausingtheelectronicmedicalrecord |