Conducting quantitative synthesis when comparing medical interventions: AHRQ and the Effective Health Care Program

doi:10.1016/j.jclinepi.2010.08.010

Journal of Clinical Epidemiology

Volume 64, Issue 11, November 2011, Pages 1187-1197

https://doi.org/10.1016/j.jclinepi.2010.08.010 Get rights and content

Abstract

Objective

This article is to establish recommendations for conducting quantitative synthesis, or meta-analysis, using study-level data in comparative effectiveness reviews (CERs) for the Evidence-based Practice Center (EPC) program of the Agency for Healthcare Research and Quality.

Study Design and Setting

We focused on recurrent issues in the EPC program and the recommendations were developed using group discussion and consensus based on current knowledge in the literature.

Results

We first discussed considerations for deciding whether to combine studies, followed by discussions on indirect comparison and incorporation of indirect evidence. Then, we described our recommendations on choosing effect measures and statistical models, giving special attention to combining studies with rare events; and on testing and exploring heterogeneity. Finally, we briefly presented recommendations on combining studies of mixed design and on sensitivity analysis.

Conclusion

Quantitative synthesis should be conducted in a transparent and consistent way. Inclusion of multiple alternative interventions in CERs increases the complexity of quantitative synthesis, whereas the basic issues in quantitative synthesis remain crucial considerations in quantitative synthesis for a CER. We will cover more issues in future versions and update and improve recommendations with the accumulation of new research to advance the goal for transparency and consistency.

Introduction

Comparative effectiveness reviews (CERs) are systematic reviews that summarize comparative effectiveness and harms of alternative clinical options, and aim to help clinicians, policy makers, and patients make informed treatment choices. Quantitative synthesis, or meta-analysis, is often essential for CERs to provide scientifically rigorous summary information. Quantitative synthesis should be conducted in a transparent and consistent way, and methodologies reported explicitly. Reasons for this were made clear during the controversy around the safety of rosiglitazone, where a systematic review that found increased risk for myocardial infarction [1] spurred heated debate on issues around choosing appropriate methods for quantitative syntheses [2], [3], [4]; and the subsequent Congressional hearing [5] brought these issues further into spotlight. This story highlighted the fact that basic issues in quantitative syntheses, such as choice of an effect measure or a model, or how to handle heterogeneity, remain crucial considerations and are often the subject of controversy and debate.

A CER typically evaluates the evidence on multiple alternative interventions, whereas most published meta-analyses compared one intervention with a placebo. Inclusion of multiple interventions increases the complexity of quantitative synthesis and entails methods of comparing multiple interventions simultaneously. Evaluation of multiple interventions also makes the assessment of similarity among studies and the decision to combine studies even more challenging. Presenting results of a meta-analysis from a CER in a way that is useful to decision makers is also a challenge.

The Evidence-based Practice Center (EPC) program of the Agency for Healthcare Research and Quality (AHRQ) [6] is the leading U.S. program providing unbiased and independent CERs. The goal of this article is to summarize our recommendations in conducting quantitative synthesis of CERs for therapeutic benefits and harms for the EPC program with the goal to improve consistency and transparency. The recommendations cover recurrent issues in the EPC program and we focus on methods for combining study-level effect measures. First, we discuss considerations for deciding whether to combine studies, followed by discussions on indirect comparison and incorporation of indirect evidence. Then, we describe our recommendations for choosing effect measures and statistical models, giving special attention to combining studies with rare events; and on testing and exploring heterogeneity. Finally, we briefly present recommendations on combining studies of mixed design and on sensitivity analysis. This article is not a comprehensive review of methods.

The recommendations were developed using group discussion and consensus based on current knowledge in the literature [7]. EPC investigators are encouraged to follow these recommendations but may choose to use alternative methods if deemed appropriate. If alternative methods are used, the investigators are required to provide rationales for their choice, and if appropriate, to state the strengths and limitations of the chosen method to promote consistency and transparency. In addition, several steps in conducting a meta-analysis require subjective decisions, for example, the decision to combine studies or the decision to incorporate indirect evidence. For each subjective decision, investigators should fully explain how the decision was reached.

Section snippets

Decision to combine studies

The decision to combine studies to produce an overall estimate should depend on whether a meaningful answer to a well-formulated research question can be obtained. The purpose of a meta-analysis should be explicitly stated in the methods section of the CER. The overall purpose of the review is not in itself a justification for conducting a meta-analysis, nor is the existence of a group of studies that address the same treatments. Investigators should avoid statements such as “We conducted a

Indirect comparisons and consideration of indirect evidence

Multiple alternative interventions for a given condition usually constitute a network of treatments. In its simplest form, a network consists of three interventions, for example, interventions A, B, and C. Randomized controlled trials (RCT) of A vs. B provide direct evidence on the comparative effectiveness of A vs. B; trials of A vs. C and B vs. C would provide indirect estimates of A vs. B through the “common reference,” C. The inclusion of more interventions would form more complex networks

Choice of effect measures

Effect measures quantify differences in outcomes, either effectiveness or harms, between treatments in trials (or exposure groups in observational studies). The choice of effect measures is first determined by the type of outcomes. For example, relative risk (RR) and odds ratio (OR) are used for a binary outcome and mean difference is for a continuous outcome. They could also be broadly classified into absolute measures—such as risk differences (RDs) or mean differences—and relative

Choice of statistical model for combining studies

Meta-analysis can be performed using either a fixed or a random effects model. A fixed effects model assumes that there is one single treatment effect across studies. Generally, a fixed effects model is not advised in the presence of significant heterogeneity. In practice, clinical and methodological diversity are always present across a set of included studies. Variation among studies is inevitable whether or not the test of heterogeneity detects it. Therefore, we recommend random effects

Test and explore statistical heterogeneity

Investigators should assess the heterogeneity for each meta-analysis. Visual inspection of forest plots and cumulative meta-analysis plots [46] are useful in the initial assessment of statistical heterogeneity. A test for the presence of statistical heterogeneity, for example, Cochran’s Q test, and a measure for magnitude of heterogeneity, for example, the I² statistic [11], [47], are useful and should be reported. Further, interpretation of Q statistic should consider the limitations of the

Combining studies of mixed designs

In principle, studies from different randomized trial designs, for example, parallel, cross-over, factorial, or cluster-randomized design, may be combined in a single meta-analysis. Investigators should perform a comprehensive evaluation of clinical and methodological diversity and statistical heterogeneity to determine whether the trials should actually be combined, and consider any important differences between different types of trials. For cross-over trials, investigators should first

Sensitivity analyses

Completing a CER is a structured process. Investigators make decisions and assumptions in the process of conducting the review and meta-analysis; each of these decisions and assumptions may affect the main findings. Sensitivity analysis should always be conducted in a meta-analysis to investigate the robustness of the results in relation to these decisions and assumptions [60]. Results are robust if decisions and assumptions only lead to small changes in the estimates and do not affect the

Concluding remarks

In this article, we provided our recommendations on important issues in meta-analyses to improve transparency and consistency in conducting CERs. The key points and recommendations for each covered issue are summarized in Table 2. Compared with the Cochrane Handbook, which explains meta-analysis methods in more detail, we focused on selected issues that present particular challenges in CERs. Overall, there is no fundamental inconsistency between our recommendations and Cochrane Handbook on

Acknowledgments

The authors would like to acknowledge Susan Norris for participating in the workgroup calls and commenting on an earlier version of this article, Ben Vandermeer for participating workgroup calls, Christopher Schmid for reviewing and commenting on the article, Mark Helfand and Edwin Reid for editing the article, and Brian Garvey for working on references and formatting the article.

Disclaimer: The views expressed in this article are those of the authors and do not represent the official policies

References (60)

M. Helfand et al.
Principles for developing guidance: AHRQ and the effective health care program
J Clin Epidemiol
(2010)
H.C. Bucher et al.
The results of direct and indirect treatment comparisons in meta-analysis of randomized controlled trials
J Clin Epidemiol
(1997)
F. Song et al.
Indirect comparison in evaluating relative efficacy illustrated by antimicrobial prophylaxis in colorectal surgery
Control Clin Trials
(2000)
R. Chou et al.
Initial highly-active antiretroviral therapy with a protease inhibitor versus a non-nucleoside reverse transcriptase inhibitor: discrepancies between direct and indirect meta-analyses
Lancet
(2006)
J.P. Ioannidis
Indirect comparisons: the mesh and mess of clinical trials
Lancet
(2006)
R. DerSimonian et al.
Meta-analysis in clinical trials
Control Clin Trials
(1986)
R. DerSimonian et al.
Random-effects model for meta-analysis of clinical trials: an update
Contemp Clin Trials
(2007)
T.H. Hamza et al.
The binomial distribution of meta-analysis was preferred to model within-study variability
J Clin Epidemiol
(2008)
J. Lau et al.
Cumulative meta-analysis of clinical trials builds evidence for exemplary medical care
J Clin Epidemiol
(1995)
J. Lau et al.
Summing up evidence: one answer is not always enough
Lancet
(1998)

C.H. Schmid et al.

Meta-regression detected associations between heterogeneous treatment effects and study-level, but not patient-level, factors

J Clin Epidemiol

(2004)

J. Slutsky et al.

Comparing medical interventions: AHRQ and the effective health-care program

J Clin Epidemiol

(2010)

S.E. Nissen et al.

Effect of rosiglitazone on the risk of myocardial infarction and death from cardiovascular causes

New Engl J Med

(2007)

I.J. Dahabreh et al.

Meta-analysis of rare events: an update and sensitivity analysis of cardiovascular events in randomized trials of rosiglitazone

Clin Trials

(2008)

G.A. Diamond et al.

Uncertain effects of rosiglitazone on the risk for myocardial infarction and cardiovascular death

Ann Intern Med

(2007)

J.J. Shuster et al.

Fixed vs random effects meta-analysis in rare event studies: the rosiglitazone link with myocardial infarction and cardiac death

Stat Med

(2007)

Committee on Oversight and Government Reform

Hearing on FDA’s role in evaluating safety of avandia

Agency for Healthcare Research and Quality (AHRQ)

Evidence-based Practice Centers

J. Higgins

Cochrane handbook for systematic reviews of interventions

J.P. Higgins et al.

Quantifying heterogeneity in a meta-analysis

Stat Med

(2002)

R.J. Hardy et al.

Detecting and describing heterogeneity in meta-analysis

Stat Med

(1998)

E.A. Engels et al.

Heterogeneity and statistical significance in meta-analysis: an empirical study of 125 meta-analyses

Stat Med

(2000)

G. Lu et al.

Combination of direct and indirect evidence in mixed treatment comparisons

Stat Med

(2004)

T. Lumley

Network meta-analysis for indirect treatment comparisons

Stat Med

(2002)

S.G. Baker et al.

The transitive fallacy for randomized trials: if A bests B and B bests C in separate trials, is A better than C?

BMC Med Res Methodol

(2002)

D.M. Caldwell et al.

Simultaneous comparison of multiple treatments: combining direct and indirect evidence

BMJ

(2005)

A.M. Glenny et al.

Indirect comparisons of competing interventions

Health Technol Assess

(2005)

F. Song et al.

Validity of indirect comparison for estimating efficacy of competing interventions: empirical evidence from published meta-analyses

BMJ

(2003)

F. Dominici et al.

Meta-analysis of migraine headache treatments: combining information from heterogeneous designs

J Am Stat Assoc

(1999)

G. Lu et al.

Assessing evidence inconsistency in mixed treatment comparisons

J Am Stat Assoc

(2006)

Cited by (423)

Meta-analysis of the implied distribution of callous-unemotional traits across sampling methods and informant
2024, Clinical Psychology Review
Callous-unemotional (CU) traits have been measured in a variety of sample-types (e.g., community or forensic) and from the perspective of different informants (e.g., self-report or parent-report) using the inventory of callous-unemotional traits total score (ICU-T). Although the positive association between CU traits and antisocial behavior is uncontroversial, the degree to which sample-types are different from each other has received little attention despite such knowledge being important for generalization and interpretation of research findings. To address this gap in the literature, we estimated the implied distribution of the ICU-T across sample-types, informants, and their interaction using meta-analytic models of sample means and variances. In unconditional models, we found that sample-type significantly moderated mean ICU-T scores but not variance, while informant significantly moderated the variance of ICU-T scores but not means. There was also a significant interaction between sample-type and informant. Mean parent-reported ICU-T scores were significantly lower than self-reported scores in community samples, but not significantly different in samples with elevated levels of antisocial behavior. Implications of our findings include improved research efficiency, the need for different ICU-T norms across informants, and greater understanding of informant biases.
The effectiveness of Go/No-Go and Stop-Signal training in reducing food consumption and choice: A systematic review and meta-analysis
2024, Appetite
The Go/No-Go and Stop-Signal tasks have been used to reduce excess food intake via repeated pairing of food cues with response inhibition. A meta analysis of 32 studies was conducted to determine whether, and under which conditions, the Go/No-Go and Stop-Signal training tasks are effective in reducing food consumption or choice. Moderators included task parameters (e.g., number of sessions, stop signal), sample differences (e.g., age, weight), and the measure of food consumption or choice. Overall, there was a small effect for Go/No-Go and Stop-Signal training in reducing food consumption or choice, g = −0.21, CI₉₅ = [-0.31, −0.11], p < .001, with this holding individually only for a single session of the Go/No-Go Task, g = −0.31, CI₉₅ = [-0.45, −0.18], p < .001. Comprehensive investigation of the impact of varying moderators indicated that the effect for Go/No-Go training was robust. Nevertheless, there was significant variation in the specific parameters of the task. Overall, the present meta-analysis extends previous findings by providing comprehensive evidence that the Go/No-Go Task is effective in reducing food consumption and choice, as well as providing optimal parameter recommendations for the task.
The effects of L-carnitine supplementation on lipid profiles in adults: A systematic review and dose-response meta-analysis
2024, PharmaNutrition
We aimed to conduct this meta-analysis to systematically assess the effects of L-carnitine supplementation on levels of low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), triglyceride (TG), total cholesterol (TC), Apolipoprotein A (Apo A) and Apolipoprotein B (Apo B) in adults.
A systematic search was done in databases such as PubMed, ISI Web of Science, the Cochrane Library, and Scopus to find acceptable articles up to April 2023. Randomized controlled trials (RCTs) evaluating the effects of L-carnitine on lipid profiles in adults were included considering the specified inclusion and exclusion criteria.
We included the 60 RCTs (n = 3933) with 64 effect sizes in this study. L-carnitine supplementation had a significant effect on TG (WMD= −10.33 mg/dl, P < 0.001), TC (WMD= −6.91 mg/dl, P = 0.032), LDL-C (WMD= −7.51 mg/dl, P < 0.001), HDL-C (WMD= 1.80 mg/dl, P = 0.007) in intervention, compared to a placebo group, in the pooled analysis. Moreover, we conducted the subgroup analyses that have shown L-carnitine supplementation had a reduction effect on TG in baseline ≥ 150 mg/dl, and in any trial duration (<12 and ≥12 weeks), intervention dose ≥ 2 g/day, in overweight (25–29.9 kg/m²) and obese (>30 kg/m²), in type ² diabetes and other health status. Also, L-carnitine significantly impacted TC in baseline ≥ 200 mg/dl, trial duration ≥ 12 weeks, intervention dose ≥ 2 g/day, obese (>30 kg/m²), and other health statuses.
Our results indicate L-carnitine significantly reduces the serum levels of TC, LDL-C, and TG and increases HDL-C, but it had no significant effect on the levels of apolipoproteins.
The Effects of L-Carnitine Supplementation on Blood Pressure in Adults: A Systematic Review and Dose-response Meta-analysis
2024, Clinical Therapeutics
Hypertension stands as a prominent risk factor for cardiovascular disease, making it of utmost importance to address. Studies have shown that L-carnitine supplementation may lower blood pressure (BP) parameters in different populations. Therefore, we have conducted a systematic review and dose-response meta-analysis of published Randomized Controlled Trials (RCTs), including the most recent articles on the effect of L-carnitine supplementation on BP.
PubMed, ISI Web of Science, Cochrane databases, and Scopus were used to collect RCT studies published up to October 2022 without limitations in language. Inclusion criteria were adult participants and recipients of L-carnitine in oral supplemental forms. The funnel plot test, Begg's test, and Egger's test were used to examine publication bias.
After the search strategy, 22 RCTs (n = 1412) with 24 effect sizes fulfilled the criteria. It was found L-Carnitine supplementation did not have a significant effect on systolic blood pressure (SBP) (mm Hg) (weighted mean difference [WMD] = −1.22 mm Hg, 95% CI: −3.79, 1.35; P = 0.352; I² = 85.0%, P < 0.001), and diastolic blood pressure (mm Hg) (WMD = −0.50 mm Hg, 95% CI: −1.49, 0.48; P = 0.318; I² = 43.4%, P = 0.021) in the pooled analysis. Subgroup analyses have shown that L-carnitine supplementation had no lowering effect on SBP in any subgroup. However, there was a significant reduction in diastolic blood pressure in participants with a baseline body mass index >30 kg/m² (WMD = −1.59 mm Hg; 95% CI: −3.11, −0.06; P = 0.041; I² = 41.3%, P = 0.164). There was a significant nonlinear relationship between the duration of L-carnitine intervention and changes in SBP (coefficients = −6.83, P = 0.045).
L-carnitine supplementation in adults did not significantly affect BP. But anyway, more studies should be done in this field on different individuals.
Treatment effects of therapeutic interventions for gaming disorder: A systematic review and meta-analysis
2024, Addictive Behaviors
The prevalence of gaming disorder is assumed to be between 2%−5%. The treatment effect of different therapeutic interventions of gaming disorder has not been studied extensively. This systematic review and meta-analysis sought to identify all intervention studies on gaming disorder with a control group, determine the effect of the interventions, and examine moderators. Studies applying a therapeutic intervention and using an appropriate comparison group were identified by searching electronic databases, previous reviews, and reference lists. Data on type of treatment, name of outcome measurement, symptom level and other study characteristics were extracted and analyzed using meta-analysis and meta-regression. A total of 38 studies and 76 effect sizes, originating from 9524 participants were included. RoB2 and ROBINS-I risk of bias tools were used to assess within-study risk of bias. Correlational hierarchical models with robust variance estimation were fitted to effect size data and yielded a moderate summary estimate. Egger’s sandwich test, funnel plot inspections, and other tests were conducted to assess risk of bias between studies. Results indicate that there may be an overall effect of therapeutic interventions for gaming disorder, but confidence in these findings is compromised by small-study effects, possible publication bias, a limited study pool, and a lack of standardization. The field needs more higher quality studies before the evidence-base can support reliable meta-analytic estimates.
Precipitation change affects forest soil carbon inputs and pools: A global meta-analysis
2024, Science of the Total Environment
The impacts of precipitation change on forest carbon (C) storage will have global consequences, as forests play a major role in sequestering anthropogenic CO₂. Although forest soils are one of the largest terrestrial C pools, there is great uncertainty around the response of forest soil organic carbon (SOC) to precipitation change, which limits our ability to predict future forest C storage. To address this, we conducted a meta-analysis to determine the effect of drought and irrigation experiments on SOC pools, plant C inputs and the soil environment based on 161 studies across 139 forest sites worldwide. Overall, forest SOC content was not affected by precipitation change, but both drought and irrigation altered plant C inputs and soil properties associated with SOC formation and storage. Drought may enhance SOC stability by altering soil aggregate fractions, but the effect of irrigation on SOC fractions remains unexplored. The apparent insensitivity of SOC to precipitation change can be explained by the short duration of most experiments and by biome-specific responses of C inputs and pools to drought or irrigation. Importantly, we demonstrate that SOC content is more likely to decline under irrigation at drier temperate sites, but that dry forests are currently underrepresented across experimental studies. Thus, our meta-analysis advances research into the impacts of precipitation change in forests by revealing important differences among forest biomes, which are likely linked to plant adaptation to extant conditions. We further demonstrate important knowledge gaps around how precipitation change will affect SOC stability, as too few studies currently consider distinct soil C pools. To accurately predict future SOC storage in forests, there is an urgent need for coordinated studies of different soil C pools and fractions across existing sites, as well as new experiments in underrepresented forest types.

View all citing articles on Scopus

: This article was written with support from the Effective Health Care Program at the U.S. Agency for Healthcare Research and Quality.

View full text

AHRQ Series Part II: Methods Guide for Comparative Effectiveness - Guest Editor, Mark HelfandConducting quantitative synthesis when comparing medical interventions: AHRQ and the Effective Health Care Program

Abstract

Objective

Study Design and Setting

Results

Conclusion

Introduction

Section snippets

Decision to combine studies

Indirect comparisons and consideration of indirect evidence

Choice of effect measures

Choice of statistical model for combining studies

Test and explore statistical heterogeneity

Combining studies of mixed designs

Sensitivity analyses

Concluding remarks

Acknowledgments

J Clin Epidemiol

J Clin Epidemiol

Control Clin Trials

Lancet

Lancet

Control Clin Trials

Contemp Clin Trials

J Clin Epidemiol

J Clin Epidemiol

Lancet

J Clin Epidemiol

J Clin Epidemiol

Effect of rosiglitazone on the risk of myocardial infarction and death from cardiovascular causes

New Engl J Med

Meta-analysis of rare events: an update and sensitivity analysis of cardiovascular events in randomized trials of rosiglitazone

Clin Trials

Uncertain effects of rosiglitazone on the risk for myocardial infarction and cardiovascular death

Ann Intern Med

Fixed vs random effects meta-analysis in rare event studies: the rosiglitazone link with myocardial infarction and cardiac death

Stat Med

Hearing on FDA’s role in evaluating safety of avandia

Evidence-based Practice Centers

Cochrane handbook for systematic reviews of interventions

Quantifying heterogeneity in a meta-analysis

Stat Med

Detecting and describing heterogeneity in meta-analysis

Stat Med

Heterogeneity and statistical significance in meta-analysis: an empirical study of 125 meta-analyses

Stat Med

Combination of direct and indirect evidence in mixed treatment comparisons

Stat Med

Network meta-analysis for indirect treatment comparisons

Stat Med

The transitive fallacy for randomized trials: if A bests B and B bests C in separate trials, is A better than C?

BMC Med Res Methodol

Simultaneous comparison of multiple treatments: combining direct and indirect evidence

BMJ

Indirect comparisons of competing interventions

Health Technol Assess

Validity of indirect comparison for estimating efficacy of competing interventions: empirical evidence from published meta-analyses

BMJ

Meta-analysis of migraine headache treatments: combining information from heterogeneous designs

J Am Stat Assoc

Assessing evidence inconsistency in mixed treatment comparisons

J Am Stat Assoc

AHRQ Series Part II: Methods Guide for Comparative Effectiveness - Guest Editor, Mark Helfand
Conducting quantitative synthesis when comparing medical interventions: AHRQ and the Effective Health Care Program