Reporting and interpretation of subgroup analyses in heart failure randomized controlled trials

Abstract Aims This study aimed to investigate the reporting of subgroup analyses in heart failure (HF) randomized controlled trials (RCTs) and to determine the strength and credibility of subgroup claims. Methods and results All primary HF RCTs published in nine high‐impact journals from 1 January 2...

Full description

Bibliographic Details
Main Authors: Muhammad Shahzeb Khan, Muhammad Arbaz Arshad Khan, Simra Irfan, Tariq Jamal Siddiqi, Stephen J. Greene, Stefan D. Anker, Jayakumar Sreenivasan, Tim Friede, Ayman Samman Tahhan, Muthiah Vaduganathan, Gregg C. Fonarow, Javed Butler
Format: Article
Language:English
Published: Wiley 2021-02-01
Series:ESC Heart Failure
Subjects:
Online Access:https://doi.org/10.1002/ehf2.13122
Description
Summary:Abstract Aims This study aimed to investigate the reporting of subgroup analyses in heart failure (HF) randomized controlled trials (RCTs) and to determine the strength and credibility of subgroup claims. Methods and results All primary HF RCTs published in nine high‐impact journals from 1 January 2008 to 31 December 2017 were included. Multivariable regression analysis was used to identify factors that may favour the reporting of results in specific subgroups. Strength of the subgroup effect claimed was classified into (i) strong, (ii) likely, or (iii) suggestive. Credibility of subgroup claim was scored using a pre‐specified 10 pointer criteria. Of the 261 HF RCTs studied, 107 (41%) reported subgroup analyses. Twenty‐five (23%) RCTs claimed a subgroup effect for the primary outcome of which six (24%) made a strong claim, eight (32%) claimed a likely effect, and 11 (44%) suggested a possible subgroup effect. Seven of the 25 RCTs did not employ interaction testing for subgroup claims of the primary outcome. Three out of 10 pre‐specified credibility criteria were satisfied by half of the trials. Fourteen trials justified the choice of subgroups, and 10 explicitly stated they were underpowered to detect differences within subgroups. Source of funding did not influence the frequency of reporting subgroup analyses (OR 0.53, 95% CI 0.78–3.62, P = 0.52). Conclusions Appropriate credibility criteria were rarely met even by HF RCTs that held strong subgroup claims. Subgroup analyses should be pre‐specified, be adequately powered, present interaction terms, and be replicated in independent data before being integrated into clinical decision making.
ISSN:2055-5822