PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT

Introduction: Intracerebral hemorrhage (ICH), where a blood vessel ruptures into areas of the brain, accounts for approximately 10–15% of all strokes. X-ray computed tomography (CT) scanning is largely used to assess the location and volume of these hemorrhages. Manual segmentation of the CT scan us...

Full description

Bibliographic Details
Main Authors: John Muschelli, Elizabeth M. Sweeney, Natalie L. Ullman, Paul Vespa, Daniel F. Hanley, Ciprian M. Crainiceanu
Format: Article
Language:English
Published: Elsevier 2017-01-01
Series:NeuroImage: Clinical
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2213158217300414
_version_ 1819153405089677312
author John Muschelli
Elizabeth M. Sweeney
Natalie L. Ullman
Paul Vespa
Daniel F. Hanley
Ciprian M. Crainiceanu
author_facet John Muschelli
Elizabeth M. Sweeney
Natalie L. Ullman
Paul Vespa
Daniel F. Hanley
Ciprian M. Crainiceanu
author_sort John Muschelli
collection DOAJ
description Introduction: Intracerebral hemorrhage (ICH), where a blood vessel ruptures into areas of the brain, accounts for approximately 10–15% of all strokes. X-ray computed tomography (CT) scanning is largely used to assess the location and volume of these hemorrhages. Manual segmentation of the CT scan using planimetry by an expert reader is the gold standard for volume estimation, but is time-consuming and has within- and across-reader variability. We propose a fully automated segmentation approach using a random forest algorithm with features extracted from X-ray computed tomography (CT) scans. Methods: The Minimally Invasive Surgery plus rt-PA in ICH Evacuation (MISTIE) trial was a multi-site Phase II clinical trial that tested the safety of hemorrhage removal using recombinant-tissue plasminogen activator (rt-PA). For this analysis, we use 112 baseline CT scans from patients enrolled in the MISTE trial, one CT scan per patient. ICH was manually segmented on these CT scans by expert readers. We derived a set of imaging predictors from each scan. Using 10 randomly-selected scans, we used a first-pass voxel selection procedure based on quantiles of a set of predictors and then built 4 models estimating the voxel-level probability of ICH. The models used were: 1) logistic regression, 2) logistic regression with a penalty on the model parameters using LASSO, 3) a generalized additive model (GAM) and 4) a random forest classifier. The remaining 102 scans were used for model validation.For each validation scan, the model predicted the probability of ICH at each voxel. These voxel-level probabilities were then thresholded to produce binary segmentations of the hemorrhage. These masks were compared to the manual segmentations using the Dice Similarity Index (DSI) and the correlation of hemorrhage volume of between the two segmentations. We tested equality of median DSI using the Kruskal-Wallis test across the 4 models. We tested equality of the median DSI from sets of 2 models using a Wilcoxon signed-rank test. Results: All results presented are for the 102 scans in the validation set. The median DSI for each model was: 0.89 (logistic), 0.885 (LASSO), 0.88 (GAM), and 0.899 (random forest). Using the random forest results in a slightly higher median DSI compared to the other models. After Bonferroni correction, the hypothesis of equality of median DSI was rejected only when comparing the random forest DSI to the DSI from the logistic (p < 0.001), LASSO (p < 0.001), or GAM (p < 0.001) models. In practical terms the difference between the random forest and the logistic regression is quite small. The correlation (95% CI) between the volume from manual segmentation and the predicted volume was 0.93 (0.9,0.95) for the random forest model. These results indicate that random forest approach can achieve accurate segmentation of ICH in a population of patients from a variety of imaging centers. We provide an R package (https://github.com/muschellij2/ichseg) and a Shiny R application online (http://johnmuschelli.com/ich_segment_all.html) for implementing and testing the proposed approach.
first_indexed 2024-12-22T15:04:40Z
format Article
id doaj.art-4658cef809d54ddb84795c35f6e34fed
institution Directory Open Access Journal
issn 2213-1582
language English
last_indexed 2024-12-22T15:04:40Z
publishDate 2017-01-01
publisher Elsevier
record_format Article
series NeuroImage: Clinical
spelling doaj.art-4658cef809d54ddb84795c35f6e34fed2022-12-21T18:22:01ZengElsevierNeuroImage: Clinical2213-15822017-01-0114C37939010.1016/j.nicl.2017.02.007PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CTJohn Muschelli0Elizabeth M. Sweeney1Natalie L. Ullman2Paul Vespa3Daniel F. Hanley4Ciprian M. Crainiceanu5Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USADepartment of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USADepartment of Neurology, Division of Brain Injury Outcomes, Johns Hopkins Medical Institutions, Baltimore, MD, USADepartment of Neurosurgery, David Geffen School of Medicine at UCLA, Los Angeles, CA, USADepartment of Neurology, Division of Brain Injury Outcomes, Johns Hopkins Medical Institutions, Baltimore, MD, USADepartment of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USAIntroduction: Intracerebral hemorrhage (ICH), where a blood vessel ruptures into areas of the brain, accounts for approximately 10–15% of all strokes. X-ray computed tomography (CT) scanning is largely used to assess the location and volume of these hemorrhages. Manual segmentation of the CT scan using planimetry by an expert reader is the gold standard for volume estimation, but is time-consuming and has within- and across-reader variability. We propose a fully automated segmentation approach using a random forest algorithm with features extracted from X-ray computed tomography (CT) scans. Methods: The Minimally Invasive Surgery plus rt-PA in ICH Evacuation (MISTIE) trial was a multi-site Phase II clinical trial that tested the safety of hemorrhage removal using recombinant-tissue plasminogen activator (rt-PA). For this analysis, we use 112 baseline CT scans from patients enrolled in the MISTE trial, one CT scan per patient. ICH was manually segmented on these CT scans by expert readers. We derived a set of imaging predictors from each scan. Using 10 randomly-selected scans, we used a first-pass voxel selection procedure based on quantiles of a set of predictors and then built 4 models estimating the voxel-level probability of ICH. The models used were: 1) logistic regression, 2) logistic regression with a penalty on the model parameters using LASSO, 3) a generalized additive model (GAM) and 4) a random forest classifier. The remaining 102 scans were used for model validation.For each validation scan, the model predicted the probability of ICH at each voxel. These voxel-level probabilities were then thresholded to produce binary segmentations of the hemorrhage. These masks were compared to the manual segmentations using the Dice Similarity Index (DSI) and the correlation of hemorrhage volume of between the two segmentations. We tested equality of median DSI using the Kruskal-Wallis test across the 4 models. We tested equality of the median DSI from sets of 2 models using a Wilcoxon signed-rank test. Results: All results presented are for the 102 scans in the validation set. The median DSI for each model was: 0.89 (logistic), 0.885 (LASSO), 0.88 (GAM), and 0.899 (random forest). Using the random forest results in a slightly higher median DSI compared to the other models. After Bonferroni correction, the hypothesis of equality of median DSI was rejected only when comparing the random forest DSI to the DSI from the logistic (p < 0.001), LASSO (p < 0.001), or GAM (p < 0.001) models. In practical terms the difference between the random forest and the logistic regression is quite small. The correlation (95% CI) between the volume from manual segmentation and the predicted volume was 0.93 (0.9,0.95) for the random forest model. These results indicate that random forest approach can achieve accurate segmentation of ICH in a population of patients from a variety of imaging centers. We provide an R package (https://github.com/muschellij2/ichseg) and a Shiny R application online (http://johnmuschelli.com/ich_segment_all.html) for implementing and testing the proposed approach.http://www.sciencedirect.com/science/article/pii/S2213158217300414CTICH segmentationIntracerebral hemorrhageStroke
spellingShingle John Muschelli
Elizabeth M. Sweeney
Natalie L. Ullman
Paul Vespa
Daniel F. Hanley
Ciprian M. Crainiceanu
PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
NeuroImage: Clinical
CT
ICH segmentation
Intracerebral hemorrhage
Stroke
title PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
title_full PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
title_fullStr PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
title_full_unstemmed PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
title_short PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
title_sort pitchperfect primary intracranial hemorrhage probability estimation using random forests on ct
topic CT
ICH segmentation
Intracerebral hemorrhage
Stroke
url http://www.sciencedirect.com/science/article/pii/S2213158217300414
work_keys_str_mv AT johnmuschelli pitchperfectprimaryintracranialhemorrhageprobabilityestimationusingrandomforestsonct
AT elizabethmsweeney pitchperfectprimaryintracranialhemorrhageprobabilityestimationusingrandomforestsonct
AT natalielullman pitchperfectprimaryintracranialhemorrhageprobabilityestimationusingrandomforestsonct
AT paulvespa pitchperfectprimaryintracranialhemorrhageprobabilityestimationusingrandomforestsonct
AT danielfhanley pitchperfectprimaryintracranialhemorrhageprobabilityestimationusingrandomforestsonct
AT ciprianmcrainiceanu pitchperfectprimaryintracranialhemorrhageprobabilityestimationusingrandomforestsonct