Real time spatial cluster detection using interpoint distances among precise patient locations

<p>Abstract</p> <p>Background</p> <p>Public health departments in the United States are beginning to gain timely access to health data, often as soon as one day after a visit to a health care facility. Consequently, new approaches to outbreak surveillance are being deve...

Full description

Bibliographic Details
Main Authors: Bonetti Marco, Olson Karen L, Pagano Marcello, Mandl Kenneth D
Format: Article
Language:English
Published: BMC 2005-06-01
Series:BMC Medical Informatics and Decision Making
Online Access:http://www.biomedcentral.com/1472-6947/5/19
_version_ 1811309918290968576
author Bonetti Marco
Olson Karen L
Pagano Marcello
Mandl Kenneth D
author_facet Bonetti Marco
Olson Karen L
Pagano Marcello
Mandl Kenneth D
author_sort Bonetti Marco
collection DOAJ
description <p>Abstract</p> <p>Background</p> <p>Public health departments in the United States are beginning to gain timely access to health data, often as soon as one day after a visit to a health care facility. Consequently, new approaches to outbreak surveillance are being developed. When cases cluster geographically, an analysis of their spatial distribution can facilitate outbreak detection. Our method focuses on detecting perturbations in the distribution of pair-wise distances among all patients in a geographical region. Barring outbreaks, this distribution can be quite stable over time. We sought to exemplify the method by measuring its cluster detection performance, and to determine factors affecting sensitivity to spatial clustering among patients presenting to hospital emergency departments with respiratory syndromes.</p> <p>Methods</p> <p>The approach was to (1) define a baseline spatial distribution of home addresses for a population of patients visiting an emergency department with respiratory syndromes using historical data; (2) develop a controlled feature set simulation by inserting simulated outbreak data with varied parameters into authentic background noise, thereby creating semisynthetic data; (3) compare the observed with the expected spatial distribution; (4) establish the relative value of different alarm strategies so as to maximize sensitivity for the detection of clustering; and (5) measure factors which have an impact on sensitivity.</p> <p>Results</p> <p>Overall sensitivity to detect spatial clustering was 62%. This contrasts with an overall alarm rate of less than 5% for the same number of extra visits when the extra visits were not characterized by geographic clustering. Clusters that produced the least number of alarms were those that were small in size (10 extra visits in a week, where visits per week ranged from 120 to 472), diffusely distributed over an area with a 3 km radius, and located close to the hospital (5 km) in a region most densely populated with patients to this hospital. Near perfect alarm rates were found for clusters that varied on the opposite extremes of these parameters (40 extra visits, within a 250 meter radius, 50 km from the hospital).</p> <p>Conclusion</p> <p>Measuring perturbations in the interpoint distance distribution is a sensitive method for detecting spatial clustering. When cases are clustered geographically, there is clearly power to detect clustering when the spatial distribution is represented by the M statistic, even when clusters are small in size. By varying independent parameters of simulated outbreaks, we have demonstrated empirically the limits of detection of different types of outbreaks.</p>
first_indexed 2024-04-13T09:50:49Z
format Article
id doaj.art-30eb5c3d24de4514b21e31e1b2da20bb
institution Directory Open Access Journal
issn 1472-6947
language English
last_indexed 2024-04-13T09:50:49Z
publishDate 2005-06-01
publisher BMC
record_format Article
series BMC Medical Informatics and Decision Making
spelling doaj.art-30eb5c3d24de4514b21e31e1b2da20bb2022-12-22T02:51:37ZengBMCBMC Medical Informatics and Decision Making1472-69472005-06-01511910.1186/1472-6947-5-19Real time spatial cluster detection using interpoint distances among precise patient locationsBonetti MarcoOlson Karen LPagano MarcelloMandl Kenneth D<p>Abstract</p> <p>Background</p> <p>Public health departments in the United States are beginning to gain timely access to health data, often as soon as one day after a visit to a health care facility. Consequently, new approaches to outbreak surveillance are being developed. When cases cluster geographically, an analysis of their spatial distribution can facilitate outbreak detection. Our method focuses on detecting perturbations in the distribution of pair-wise distances among all patients in a geographical region. Barring outbreaks, this distribution can be quite stable over time. We sought to exemplify the method by measuring its cluster detection performance, and to determine factors affecting sensitivity to spatial clustering among patients presenting to hospital emergency departments with respiratory syndromes.</p> <p>Methods</p> <p>The approach was to (1) define a baseline spatial distribution of home addresses for a population of patients visiting an emergency department with respiratory syndromes using historical data; (2) develop a controlled feature set simulation by inserting simulated outbreak data with varied parameters into authentic background noise, thereby creating semisynthetic data; (3) compare the observed with the expected spatial distribution; (4) establish the relative value of different alarm strategies so as to maximize sensitivity for the detection of clustering; and (5) measure factors which have an impact on sensitivity.</p> <p>Results</p> <p>Overall sensitivity to detect spatial clustering was 62%. This contrasts with an overall alarm rate of less than 5% for the same number of extra visits when the extra visits were not characterized by geographic clustering. Clusters that produced the least number of alarms were those that were small in size (10 extra visits in a week, where visits per week ranged from 120 to 472), diffusely distributed over an area with a 3 km radius, and located close to the hospital (5 km) in a region most densely populated with patients to this hospital. Near perfect alarm rates were found for clusters that varied on the opposite extremes of these parameters (40 extra visits, within a 250 meter radius, 50 km from the hospital).</p> <p>Conclusion</p> <p>Measuring perturbations in the interpoint distance distribution is a sensitive method for detecting spatial clustering. When cases are clustered geographically, there is clearly power to detect clustering when the spatial distribution is represented by the M statistic, even when clusters are small in size. By varying independent parameters of simulated outbreaks, we have demonstrated empirically the limits of detection of different types of outbreaks.</p>http://www.biomedcentral.com/1472-6947/5/19
spellingShingle Bonetti Marco
Olson Karen L
Pagano Marcello
Mandl Kenneth D
Real time spatial cluster detection using interpoint distances among precise patient locations
BMC Medical Informatics and Decision Making
title Real time spatial cluster detection using interpoint distances among precise patient locations
title_full Real time spatial cluster detection using interpoint distances among precise patient locations
title_fullStr Real time spatial cluster detection using interpoint distances among precise patient locations
title_full_unstemmed Real time spatial cluster detection using interpoint distances among precise patient locations
title_short Real time spatial cluster detection using interpoint distances among precise patient locations
title_sort real time spatial cluster detection using interpoint distances among precise patient locations
url http://www.biomedcentral.com/1472-6947/5/19
work_keys_str_mv AT bonettimarco realtimespatialclusterdetectionusinginterpointdistancesamongprecisepatientlocations
AT olsonkarenl realtimespatialclusterdetectionusinginterpointdistancesamongprecisepatientlocations
AT paganomarcello realtimespatialclusterdetectionusinginterpointdistancesamongprecisepatientlocations
AT mandlkennethd realtimespatialclusterdetectionusinginterpointdistancesamongprecisepatientlocations