Summary: | <b>Objectives:</b> To establish and verify radiomics models based on multiparametric MRI for preoperatively identifying the microsatellite instability (MSI) status of rectal cancer (RC) by comparing different machine learning algorithms. <b>Methods:</b> This retrospective study enrolled 383 (training set, 268; test set, 115) RC patients between January 2017 and June 2022. A total of 4148 radiomics features were extracted from multiparametric MRI, including T<sub>2</sub>-weighted imaging, T<sub>1</sub>-weighted imaging, apparent diffusion coefficient, and contrast-enhanced T<sub>1</sub>-weighted imaging. The analysis of variance, correlation test, univariate logistic analysis, and a gradient-boosting decision tree were used for the dimension reduction. Logistic regression, Bayes, support vector machine (SVM), K-nearest neighbor (KNN), and tree machine learning algorithms were used to build different radiomics models. The relative standard deviation (RSD) and bootstrap method were used to quantify the stability of these five algorithms. Then, predictive performances of different models were assessed using area under curves (AUCs). The performance of the best radiomics model was evaluated using calibration and discrimination. <b>Results:</b> Among these 383 patients, the prevalence of MSI was 14.62% (56/383). The RSD value of logistic regression algorithm was the lowest (4.64%), followed by Bayes (5.44%) and KNN (5.45%), which was significantly better than that of SVM (19.11%) and tree (11.94%) algorithms. The radiomics model based on logistic regression algorithm performed best, with AUCs of 0.827 and 0.739 in the training and test sets, respectively. <b>Conclusions:</b> We developed a radiomics model based on the logistic regression algorithm, which could potentially be used to facilitate the individualized prediction of MSI status in RC patients.
|