Evaluation of stability of directly standardized rates for sparse data using simulation methods

Abstract Background Directly standardized rates (DSRs) adjust for different age distributions in different populations and enable, say, the rates of disease between the populations to be directly compared. They are routinely published but there is concern that a DSR is not valid when it is based on...

Full description

Bibliographic Details
Main Authors: Joan K. Morris, Joachim Tan, Paul Fryers, Jonathan Bestwick
Format: Article
Language:English
Published: BMC 2018-12-01
Series:Population Health Metrics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12963-018-0177-1
_version_ 1831635443148390400
author Joan K. Morris
Joachim Tan
Paul Fryers
Jonathan Bestwick
author_facet Joan K. Morris
Joachim Tan
Paul Fryers
Jonathan Bestwick
author_sort Joan K. Morris
collection DOAJ
description Abstract Background Directly standardized rates (DSRs) adjust for different age distributions in different populations and enable, say, the rates of disease between the populations to be directly compared. They are routinely published but there is concern that a DSR is not valid when it is based on a “small” number of events. The aim of this study was to determine the value at which a DSR should not be published when analyzing real data in England. Methods Standard Monte Carlo simulation techniques were used assuming the number of events in 19 age groups (i.e., 0–4, 5–9, ... 90+ years) follow independent Poisson distributions. The total number of events, age specific risks, and the population sizes in each age group were varied. For each of 10,000 simulations the DSR (using the 2013 European Standard Population weights), together with the coverage of three different methods (normal approximation, Dobson, and Tiwari modified gamma) of estimating the 95% confidence intervals (CIs), were calculated. Results The normal approximation was, as expected, not suitable for use when fewer than 100 events occurred. The Tiwari method and the Dobson method of calculating confidence intervals produced similar estimates and either was suitable when the expected or observed numbers of events were 10 or greater. The accuracy of the CIs was not influenced by the distribution of the events across categories (i.e., the degree of clustering, the age distributions of the sampling populations, and the number of categories with no events occurring in them). Conclusions DSRs should not be given when the total observed number of events is less than 10. The Dobson method might be considered the preferred method due to the formulae being simpler than that of the Tiwari method and the coverage being slightly more accurate.
first_indexed 2024-12-19T06:14:29Z
format Article
id doaj.art-cb40e4b2f0dc41cf84fb02472976b94b
institution Directory Open Access Journal
issn 1478-7954
language English
last_indexed 2024-12-19T06:14:29Z
publishDate 2018-12-01
publisher BMC
record_format Article
series Population Health Metrics
spelling doaj.art-cb40e4b2f0dc41cf84fb02472976b94b2022-12-21T20:32:54ZengBMCPopulation Health Metrics1478-79542018-12-011611910.1186/s12963-018-0177-1Evaluation of stability of directly standardized rates for sparse data using simulation methodsJoan K. Morris0Joachim Tan1Paul Fryers2Jonathan Bestwick3Centre for Environmental and Preventive Medicine, Wolfson Institute of Preventive Medicine, Barts and the London School of Medicine and Dentistry, Queen Mary University of LondonCentre for Environmental and Preventive Medicine, Wolfson Institute of Preventive Medicine, Barts and the London School of Medicine and Dentistry, Queen Mary University of LondonHealth Intelligence Division, Public Health EnglandCentre for Environmental and Preventive Medicine, Wolfson Institute of Preventive Medicine, Barts and the London School of Medicine and Dentistry, Queen Mary University of LondonAbstract Background Directly standardized rates (DSRs) adjust for different age distributions in different populations and enable, say, the rates of disease between the populations to be directly compared. They are routinely published but there is concern that a DSR is not valid when it is based on a “small” number of events. The aim of this study was to determine the value at which a DSR should not be published when analyzing real data in England. Methods Standard Monte Carlo simulation techniques were used assuming the number of events in 19 age groups (i.e., 0–4, 5–9, ... 90+ years) follow independent Poisson distributions. The total number of events, age specific risks, and the population sizes in each age group were varied. For each of 10,000 simulations the DSR (using the 2013 European Standard Population weights), together with the coverage of three different methods (normal approximation, Dobson, and Tiwari modified gamma) of estimating the 95% confidence intervals (CIs), were calculated. Results The normal approximation was, as expected, not suitable for use when fewer than 100 events occurred. The Tiwari method and the Dobson method of calculating confidence intervals produced similar estimates and either was suitable when the expected or observed numbers of events were 10 or greater. The accuracy of the CIs was not influenced by the distribution of the events across categories (i.e., the degree of clustering, the age distributions of the sampling populations, and the number of categories with no events occurring in them). Conclusions DSRs should not be given when the total observed number of events is less than 10. The Dobson method might be considered the preferred method due to the formulae being simpler than that of the Tiwari method and the coverage being slightly more accurate.http://link.springer.com/article/10.1186/s12963-018-0177-1Direct standardizationMonte Carlo simulationConfidence interval coverageTiwariDobson
spellingShingle Joan K. Morris
Joachim Tan
Paul Fryers
Jonathan Bestwick
Evaluation of stability of directly standardized rates for sparse data using simulation methods
Population Health Metrics
Direct standardization
Monte Carlo simulation
Confidence interval coverage
Tiwari
Dobson
title Evaluation of stability of directly standardized rates for sparse data using simulation methods
title_full Evaluation of stability of directly standardized rates for sparse data using simulation methods
title_fullStr Evaluation of stability of directly standardized rates for sparse data using simulation methods
title_full_unstemmed Evaluation of stability of directly standardized rates for sparse data using simulation methods
title_short Evaluation of stability of directly standardized rates for sparse data using simulation methods
title_sort evaluation of stability of directly standardized rates for sparse data using simulation methods
topic Direct standardization
Monte Carlo simulation
Confidence interval coverage
Tiwari
Dobson
url http://link.springer.com/article/10.1186/s12963-018-0177-1
work_keys_str_mv AT joankmorris evaluationofstabilityofdirectlystandardizedratesforsparsedatausingsimulationmethods
AT joachimtan evaluationofstabilityofdirectlystandardizedratesforsparsedatausingsimulationmethods
AT paulfryers evaluationofstabilityofdirectlystandardizedratesforsparsedatausingsimulationmethods
AT jonathanbestwick evaluationofstabilityofdirectlystandardizedratesforsparsedatausingsimulationmethods