Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network

Objectives Genome-wide association studies (GWAS) are performed to study the associations between genetic variants with respect to certain phenotypic traits such as cancer. However, the method that is commonly used in GWAS assumes that certain traits are solely affected by a single mutation. We prop...

Full description

Bibliographic Details
Main Authors: Bharuno Mahesworo, Arif Budiarto, Alam Ahmad Hidayat, Bens Pardamean
Format: Article
Language:English
Published: The Korean Society of Medical Informatics 2022-07-01
Series:Healthcare Informatics Research
Subjects:
Online Access:http://www.e-hir.org/upload/pdf/hir-2022-28-3-247.pdf
_version_ 1828295914109796352
author Bharuno Mahesworo
Arif Budiarto
Alam Ahmad Hidayat
Bens Pardamean
author_facet Bharuno Mahesworo
Arif Budiarto
Alam Ahmad Hidayat
Bens Pardamean
author_sort Bharuno Mahesworo
collection DOAJ
description Objectives Genome-wide association studies (GWAS) are performed to study the associations between genetic variants with respect to certain phenotypic traits such as cancer. However, the method that is commonly used in GWAS assumes that certain traits are solely affected by a single mutation. We propose a network analysis method, in which we generate association networks of single-nucleotide polymorphisms (SNPs) that can differentiate case and control groups. We hypothesize that certain phenotypic traits are attributable to mutations in groups of associated SNPs. Methods We propose a method based on a network analysis framework to study SNP-SNP interactions related to cancer incidence. We employed logistic regression to measure the significance of all SNP pairs from GWAS for the incidence of colorectal cancer and computed a cancer risk score based on the generated SNP networks. Results We demonstrated our method in a dataset from a case-control study of colorectal cancer in the South Sulawesi population. From the GWAS results, 20,094 pairs of 200 SNPs were created. We obtained one cluster containing four pairs of five SNPs that passed the filtering threshold based on their p-values. A locus on chromosome 12 (12:54410007) was found to be strongly connected to the four variants on chromosome 1. A polygenic risk score was computed from the five SNPs, and a significant difference in colorectal cancer risk was obtained between the case and control groups. Conclusions Our results demonstrate the applicability of our method to understand SNP-SNP interactions and compute risk scores for various types of cancer.
first_indexed 2024-04-13T12:02:54Z
format Article
id doaj.art-aa31eef4b9f34c6da4cfe3db8f2f0ef7
institution Directory Open Access Journal
issn 2093-3681
2093-369X
language English
last_indexed 2024-04-13T12:02:54Z
publishDate 2022-07-01
publisher The Korean Society of Medical Informatics
record_format Article
series Healthcare Informatics Research
spelling doaj.art-aa31eef4b9f34c6da4cfe3db8f2f0ef72022-12-22T02:47:44ZengThe Korean Society of Medical InformaticsHealthcare Informatics Research2093-36812093-369X2022-07-0128324725510.4258/hir.2022.28.3.2471127Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism NetworkBharuno Mahesworo0Arif Budiarto1Alam Ahmad Hidayat2Bens Pardamean3 Department of Statistics, School of Computer Science, Bina Nusantara University, Jakarta, Indonesia Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, Indonesia Bioinformatics and Data Science Research Center, Bina Nusantara University, Jakarta, IndonesiaObjectives Genome-wide association studies (GWAS) are performed to study the associations between genetic variants with respect to certain phenotypic traits such as cancer. However, the method that is commonly used in GWAS assumes that certain traits are solely affected by a single mutation. We propose a network analysis method, in which we generate association networks of single-nucleotide polymorphisms (SNPs) that can differentiate case and control groups. We hypothesize that certain phenotypic traits are attributable to mutations in groups of associated SNPs. Methods We propose a method based on a network analysis framework to study SNP-SNP interactions related to cancer incidence. We employed logistic regression to measure the significance of all SNP pairs from GWAS for the incidence of colorectal cancer and computed a cancer risk score based on the generated SNP networks. Results We demonstrated our method in a dataset from a case-control study of colorectal cancer in the South Sulawesi population. From the GWAS results, 20,094 pairs of 200 SNPs were created. We obtained one cluster containing four pairs of five SNPs that passed the filtering threshold based on their p-values. A locus on chromosome 12 (12:54410007) was found to be strongly connected to the four variants on chromosome 1. A polygenic risk score was computed from the five SNPs, and a significant difference in colorectal cancer risk was obtained between the case and control groups. Conclusions Our results demonstrate the applicability of our method to understand SNP-SNP interactions and compute risk scores for various types of cancer.http://www.e-hir.org/upload/pdf/hir-2022-28-3-247.pdfdata analysisgeneticsrisk factorscolorectal neoplasmsmultifactorial inheritance
spellingShingle Bharuno Mahesworo
Arif Budiarto
Alam Ahmad Hidayat
Bens Pardamean
Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network
Healthcare Informatics Research
data analysis
genetics
risk factors
colorectal neoplasms
multifactorial inheritance
title Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network
title_full Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network
title_fullStr Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network
title_full_unstemmed Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network
title_short Cancer Risk Score Prediction Based on a Single-Nucleotide Polymorphism Network
title_sort cancer risk score prediction based on a single nucleotide polymorphism network
topic data analysis
genetics
risk factors
colorectal neoplasms
multifactorial inheritance
url http://www.e-hir.org/upload/pdf/hir-2022-28-3-247.pdf
work_keys_str_mv AT bharunomahesworo cancerriskscorepredictionbasedonasinglenucleotidepolymorphismnetwork
AT arifbudiarto cancerriskscorepredictionbasedonasinglenucleotidepolymorphismnetwork
AT alamahmadhidayat cancerriskscorepredictionbasedonasinglenucleotidepolymorphismnetwork
AT benspardamean cancerriskscorepredictionbasedonasinglenucleotidepolymorphismnetwork