Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography

Abstract The most common BRAF mutation is thymine (T) to adenine (A) missense mutation in nucleotide 1796 (T1796A, V600E). The BRAFV600E gene encodes a protein-dependent kinase (PDK), which is a key component of the mitogen-activated protein kinase pathway and essential for controlling cell prolifer...

Full description

Bibliographic Details
Main Authors: Enock Adjei Agyekum, Yu-guo Wang, Fei-Ju Xu, Debora Akortia, Yong-zhen Ren, Kevoyne Hakeem Chambers, Xian Wang, Jenny Olalia Taupa, Xiao-qin Qian
Format: Article
Language:English
Published: Nature Portfolio 2023-08-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-023-39747-6
_version_ 1797453045259304960
author Enock Adjei Agyekum
Yu-guo Wang
Fei-Ju Xu
Debora Akortia
Yong-zhen Ren
Kevoyne Hakeem Chambers
Xian Wang
Jenny Olalia Taupa
Xiao-qin Qian
author_facet Enock Adjei Agyekum
Yu-guo Wang
Fei-Ju Xu
Debora Akortia
Yong-zhen Ren
Kevoyne Hakeem Chambers
Xian Wang
Jenny Olalia Taupa
Xiao-qin Qian
author_sort Enock Adjei Agyekum
collection DOAJ
description Abstract The most common BRAF mutation is thymine (T) to adenine (A) missense mutation in nucleotide 1796 (T1796A, V600E). The BRAFV600E gene encodes a protein-dependent kinase (PDK), which is a key component of the mitogen-activated protein kinase pathway and essential for controlling cell proliferation, differentiation, and death. The BRAFV600E mutation causes PDK to be activated improperly and continuously, resulting in abnormal proliferation and differentiation in PTC. Based on elastography ultrasound (US) radiomic features, this study seeks to create and validate six distinct machine learning algorithms to predict BRAFV6OOE mutation in PTC patients prior to surgery. This study employed routine US strain elastography image data from 138 PTC patients. The patients were separated into two groups: those who did not have the BRAFV600E mutation (n = 75) and those who did have the mutation (n = 63). The patients were randomly assigned to one of two data sets: training (70%), or validation (30%). From strain elastography US images, a total of 479 radiomic features were retrieved. Pearson's Correlation Coefficient (PCC) and Recursive Feature Elimination (RFE) with stratified tenfold cross-validation were used to decrease the features. Based on selected radiomic features, six machine learning algorithms including support vector machine with the linear kernel (SVM_L), support vector machine with radial basis function kernel (SVM_RBF), logistic regression (LR), Naïve Bayes (NB), K-nearest neighbors (KNN), and linear discriminant analysis (LDA) were compared to predict the possibility of BRAFV600E. The accuracy (ACC), the area under the curve (AUC), sensitivity (SEN), specificity (SPEC), positive predictive value (PPV), negative predictive value (NPV), decision curve analysis (DCA), and calibration curves of the machine learning algorithms were used to evaluate their performance. ① The machine learning algorithms' diagnostic performance depended on 27 radiomic features. ② AUCs for NB, KNN, LDA, LR, SVM_L, and SVM_RBF were 0.80 (95% confidence interval [CI]: 0.65–0.91), 0.87 (95% CI 0.73–0.95), 0.91(95% CI 0.79–0.98), 0.92 (95% CI 0.80–0.98), 0.93 (95% CI 0.80–0.98), and 0.98 (95% CI 0.88–1.00), respectively. ③ There was a significant difference in echogenicity,vertical and horizontal diameter ratios, and elasticity between PTC patients with BRAFV600E and PTC patients without BRAFV600E. Machine learning algorithms based on US elastography radiomic features are capable of predicting the likelihood of BRAFV600E in PTC patients, which can assist physicians in identifying the risk of BRAFV600E in PTC patients. Among the six machine learning algorithms, the support vector machine with radial basis function (SVM_RBF) achieved the best ACC (0.93), AUC (0.98), SEN (0.95), SPEC (0.90), PPV (0.91), and NPV (0.95).
first_indexed 2024-03-09T15:17:19Z
format Article
id doaj.art-23a638fc340c4e6e85e1aaa660801a95
institution Directory Open Access Journal
issn 2045-2322
language English
last_indexed 2024-03-09T15:17:19Z
publishDate 2023-08-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj.art-23a638fc340c4e6e85e1aaa660801a952023-11-26T13:01:14ZengNature PortfolioScientific Reports2045-23222023-08-0113111410.1038/s41598-023-39747-6Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastographyEnock Adjei Agyekum0Yu-guo Wang1Fei-Ju Xu2Debora Akortia3Yong-zhen Ren4Kevoyne Hakeem Chambers5Xian Wang6Jenny Olalia Taupa7Xiao-qin Qian8Ultrasound Medical Laboratory, Department of Ultrasound, Affiliated People’s Hospital of Jiangsu UniversityDepartment of Ultrasound, Traditional Chinese Medicine Hospital of Nanjing Lishui DistrictUltrasound Medical Laboratory, Department of Ultrasound, Affiliated People’s Hospital of Jiangsu UniversitySchool of Public Health, Kwame Nkrumah University of Science and TechnologyUltrasound Medical Laboratory, Department of Ultrasound, Affiliated People’s Hospital of Jiangsu UniversitySchool of Medicine, Jiangsu UniversityUltrasound Medical Laboratory, Department of Ultrasound, Affiliated People’s Hospital of Jiangsu UniversityUltrasound Medical Laboratory, Department of Ultrasound, Affiliated People’s Hospital of Jiangsu UniversityUltrasound Medical Laboratory, Department of Ultrasound, Affiliated People’s Hospital of Jiangsu UniversityAbstract The most common BRAF mutation is thymine (T) to adenine (A) missense mutation in nucleotide 1796 (T1796A, V600E). The BRAFV600E gene encodes a protein-dependent kinase (PDK), which is a key component of the mitogen-activated protein kinase pathway and essential for controlling cell proliferation, differentiation, and death. The BRAFV600E mutation causes PDK to be activated improperly and continuously, resulting in abnormal proliferation and differentiation in PTC. Based on elastography ultrasound (US) radiomic features, this study seeks to create and validate six distinct machine learning algorithms to predict BRAFV6OOE mutation in PTC patients prior to surgery. This study employed routine US strain elastography image data from 138 PTC patients. The patients were separated into two groups: those who did not have the BRAFV600E mutation (n = 75) and those who did have the mutation (n = 63). The patients were randomly assigned to one of two data sets: training (70%), or validation (30%). From strain elastography US images, a total of 479 radiomic features were retrieved. Pearson's Correlation Coefficient (PCC) and Recursive Feature Elimination (RFE) with stratified tenfold cross-validation were used to decrease the features. Based on selected radiomic features, six machine learning algorithms including support vector machine with the linear kernel (SVM_L), support vector machine with radial basis function kernel (SVM_RBF), logistic regression (LR), Naïve Bayes (NB), K-nearest neighbors (KNN), and linear discriminant analysis (LDA) were compared to predict the possibility of BRAFV600E. The accuracy (ACC), the area under the curve (AUC), sensitivity (SEN), specificity (SPEC), positive predictive value (PPV), negative predictive value (NPV), decision curve analysis (DCA), and calibration curves of the machine learning algorithms were used to evaluate their performance. ① The machine learning algorithms' diagnostic performance depended on 27 radiomic features. ② AUCs for NB, KNN, LDA, LR, SVM_L, and SVM_RBF were 0.80 (95% confidence interval [CI]: 0.65–0.91), 0.87 (95% CI 0.73–0.95), 0.91(95% CI 0.79–0.98), 0.92 (95% CI 0.80–0.98), 0.93 (95% CI 0.80–0.98), and 0.98 (95% CI 0.88–1.00), respectively. ③ There was a significant difference in echogenicity,vertical and horizontal diameter ratios, and elasticity between PTC patients with BRAFV600E and PTC patients without BRAFV600E. Machine learning algorithms based on US elastography radiomic features are capable of predicting the likelihood of BRAFV600E in PTC patients, which can assist physicians in identifying the risk of BRAFV600E in PTC patients. Among the six machine learning algorithms, the support vector machine with radial basis function (SVM_RBF) achieved the best ACC (0.93), AUC (0.98), SEN (0.95), SPEC (0.90), PPV (0.91), and NPV (0.95).https://doi.org/10.1038/s41598-023-39747-6
spellingShingle Enock Adjei Agyekum
Yu-guo Wang
Fei-Ju Xu
Debora Akortia
Yong-zhen Ren
Kevoyne Hakeem Chambers
Xian Wang
Jenny Olalia Taupa
Xiao-qin Qian
Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
Scientific Reports
title Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
title_full Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
title_fullStr Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
title_full_unstemmed Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
title_short Predicting BRAFV600E mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
title_sort predicting brafv600e mutations in papillary thyroid carcinoma using six machine learning algorithms based on ultrasound elastography
url https://doi.org/10.1038/s41598-023-39747-6
work_keys_str_mv AT enockadjeiagyekum predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT yuguowang predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT feijuxu predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT deboraakortia predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT yongzhenren predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT kevoynehakeemchambers predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT xianwang predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT jennyolaliataupa predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography
AT xiaoqinqian predictingbrafv600emutationsinpapillarythyroidcarcinomausingsixmachinelearningalgorithmsbasedonultrasoundelastography