Meta-Analysis: A Critical Appraisal of The Methodology, Benefits and Drawbacks
Meta-Analysis: A Critical Appraisal of The Methodology, Benefits and Drawbacks
I
and reproducible. Khan et al (2003) describe five steps in
n the 21st century, scientists and policy makers rely on an performing a systematic review to ensure its objectivity:
evidence-based decision-making process to guide them 1. Framing questions for a review
towards early interventions, better treatment methods 2. Identifying relevant work
and structured guidelines which work efficiently and 3. Assessing the quality of studies
reliably to produce the best possible outcomes. The 4. Summarizing the evidence
validity of evidence, and hence the decisions based on this, 5. Interpreting the findings.
relies on the quality of the data, the methodology used to To ensure that the evidence is of highest quality, various
extract them, the robustness and totality of the evidence researchers tried to devise processes, criteria and protocols
and use of relevant methods to analyse them. Hence, to prevent biases and design flaws to improve the quality of
decision makers need to be aware of factors that impact reviews and evidence. The Quality of Reporting of Meta-
the quality of evidence along with any shortcomings in analyses (QUOROM) addresses standards for improving
the process of gathering and processing such evidence. The the quality of meta-analysis of clinical randomized
use of meta-analysis as part of systematic review enables controlled trials (Moher et al, 1999). The Consolidated
inclusion of analysis of quantitative data from independent Standards of Reporting Trials (CONSORT) (Moher et
trials in decision-making processes (Khan et al, 2016). al, 2010) encompass various initiatives to deal with the
However, problems arise in meta-analysis. These include: problems arising from inadequate reporting of randomized
■■ Regressions are often non-linear controlled trials. The Preferred Reporting Items for
■■ Effects are often multivariate rather than univariate Systematic Reviews and Meta-Analyses (PRISMA) (Moher
■■ Coverage can be restricted et al, 2009) is an evidence-based minimum set of items
for reporting in systematic review and meta-analysis.
The authors of PRISMA later introduced the PRISMA
Professor Shahjahan Khan, Professor of Statistics, School of
Agricultural, Computing and Environmental Sciences, protocols (Moher et al, 2015; Shamseer et al, 2015). The
International Centre for Applied Climate Sciences and Meta-analysis Of Observational Studies in Epidemiology
Centre for Health Sciences Research, University of (MOOSE) was proposed by Stroup et al (2000) which
Southern Queensland, Toowoomba, Queensland, Australia contains a checklist of specifications for reporting meta-
Ms Breda Memon, Research and Clinical Nurse, South East analysis of observational studies.
Queensland Surgery and Sunnybank Obesity Centre,
McCullough Centre, Sunnybank, Queensland, Australia
Meta-analysis
© 2019 MA Healthcare Ltd
Professor Muhammed A Memon, Consultant Surgeon, Glass (1976) introduce meta-analysis as a statistical
Mayne Medical School, School of Medicine, University of
Queensland, Brisbane, Queensland, Australia procedure to re-analyse the published statistical results from
Correspondence to: Professor MA Memon
a large number of independent studies on a specific topic
(mmemon@yahoo.com) for the purpose of integrating the findings in the context
of educational research. He felt that when faced with an
mean and standard deviation, or correlation coefficient Outcome assessment: 6) Clear assessment of outcomes via record linkage. 7) Adequacy of the
cohort follow ups – whether the follow ups were completed or less than 20% of patients were
(measuring the relationship of two quantitative outcome lost to follow up.
variables) for continuous outcome variables, and odds
ratio, risk ratio and risk difference for categorical outcome Figure 2. Newcastle–Ottawa scale for assessing the quality of non-randomized
variables, along with the sample size. studies. A1–F6 represent authors of these studies.
British Journal of Hospital Medicine, November 2019, Vol 80, No 11 637
Downloaded from magonlinelibrary.com by 131.172.036.029 on November 16, 2019.
Focus on Statistics
Meta-regression is an extension to subgroup plays a significant role in determining the estimate of the
effect size. It is therefore essential to check if heterogeneity
analyses that allows the effect of continuous is present in the data (Sauerland and Seiler, 2005; Ng et al,
as well as categorical characteristics 2006). If the between-studies variation is not significant,
to be investigated. the meta-analysis becomes simple and straightforward.
However, in many cases the studies are heterogeneous and
predetermined confidence level. Such a critical value and therefore meta-analysis must address this fact in computing
the standard error of the estimator are used to calculate the the pooled effect size and the confidence interval. Some of
margin of error for the confidence interval. the commonly used methods to overcome the heterogeneity
If the raw (individual patient) data from all selected are discussed below. Unfortunately, not all of them are
studies are available, then one could analyse the data by equally effective or provide a real remedy for the problem.
using mega-analysis methods. Unfortunately, the sample
effect size (in individual studies) is a random variable as it Assessing heterogeneity
differs from study to study. The study-specific values of the The presence of heterogeneity among the effect size measure
estimated effect size are not only different but may have is assessed by performing the Cochrane’s Q-statistic or I2
opposing results, producing contradictory evidence. In the statistic (Huedo-Medina et al, 2006). The main problem
face of conflicting evidence from different primary studies, with the Q-statistic is that its value increases as the number
the challenge is to reconcile the results to come up with a of studies in a meta-analysis grows larger. While it is useful
valid estimated common effect size. to detect heterogeneity and inform on the degree of its
In many cases, different studies use different statistical significance, it is unable to describe the extent
measurements such as the median, the minimum and of the presence of true heterogeneity (Huedo-Medina et
maximum values and/or the interquartile range instead of al, 2006).
reporting mean and standard deviation of the quantitative An alternative method for assessing heterogeneity is
outcome variables. Therefore, the effect size of all selected the I2 statistic used in conjunction with its confidence
studies must be converted to the same unit of measurement intervals (Higgins and Thompson, 2002). The I2 statistic
before pooling them. Wan et al (2014), using Hozo’s presents as a percentage of the total variability that can
methodology and improving upon it, provided details on be attributed to true heterogeneity within a set of effect
estimating the sample mean and standard deviation from sizes. Therefore, an I2 statistic equalling 0% suggests that
the sample size, median, range and/or interquartile range. there is no between-study variability and that all variation
observed is a result of sampling error. Conversely, when I2
Inverse variance method and redistribution of weights approaches 100%, it suggests that the observed variation is
The main objective of any meta-analysis is to pool statistics the result of between-study variability rather than sampling
from independent studies with a view to synthesizing them error. A rough guide for the interpretation of I2 statistic is:
to calculate an estimate of the common effect size. There 0–40% might not be important; 30–60% may represent
are different ways of pooling statistics, depending on the moderate heterogeneity; 50–90% may represent substantial
type of weight used for individual studies in computing heterogeneity and 75–100% is considerable heterogeneity
the pooled estimate. (Huedo-Medina et al, 2006; Higgins and Green, 2011).
Conventionally, the inverse variance weights are
commonly used in meta-analysis (Borenstein et al, 2009). Subgroup analysis
This way the weights are redistributed among the studies One way to minimize the impact of heterogeneity in a
depending on the extent of spread of the individual studies. meta-analysis is to group the studies based on the value of
It is well known that the sample variance (a measure of the the individual study variance (Figures 3a and b) (Higgins
spread or the differences between the observed value of and Green, 2011). Studies with similar values of the
the relevant outcome variable and their arithmetic mean) variance are sub-grouped and separate meta-analyses are
is inversely related to the sample size (n), and hence larger conducted on each of the subgroups along with the overall
studies (with higher sample size) should receive higher meta-analysis. However, the problem remains with the
weights under the principle of inverse variance weight estimation of the common effect size of all studies based
than the smaller studies. But in many cases this principle on combining the results from all the subgroups because
may lead to allocating higher than appropriate weight of significant heterogeneity among the subgroups.
to the lower quality smaller studies and vice versa. The
redistribution of weights varies significantly under various Meta-regression
commonly used statistical models especially if there is Meta-regression is a technique for performing a regression
© 2019 MA Healthcare Ltd
significant heterogeneity among the studies. analysis to assess the relationship between the treatment
effects and the study characteristics of interest (e.g. suture vs
Heterogeneity prosthesis) or factors concerning the execution of the study
Since meta-analyses are based on the summary statistics (e.g. allocation sequence concealment) (Figures 3a and b)
of individual studies, the between-studies variation often (Thompson and Higgins, 2002; van Houwelingen et al,
Standard error
Granderath et al (2005) 0 50
Oelschlanger et al (2011) 10 47 12 39 0.69 0.27 1.77 0.38
Watson et al (2014) 8 35 8 75 2.14 0.74 6.18 0.3
Oor et al (2018) 6 30 7 29 0.83 0.25 2.76 0.23
Pooled OR 25 197 29 227 1 0.49 2.05 0.1 2.0 4.0 6.0 10.0 -6 -4 -2 0 2 4 6
Test for heterogeneity Q = 3.01 P = 0.56 I2 % = 0 Favours suture Favours mesh Log odds ratio
Test for overall effect Z = 0.028 P = 0.99
Standard error
Granderath et al (2005) 0 50 0 50 1 0.02 51.39 0.04
Watson et al (2014) 8 35 5 37 1.69 0.5 5.67 0.43
Oor et al (2018) 6 30 7 29 0.83 0.25 2.76 0.43
Figure 3. Forest and funnel plots. In these graphs which were created using random effects model, squares indicate point estimates of treatment
effect with the size of the squares representing the weight attribute to each study. The horizontal lines represent 95% confidence interval for odds
ration. The pooled estimate for complication rate (is the pooled odds ratio obtained by combining all odds ratios of the five studies using the inverse
variance weighted method) and is represented by the diamond and the size of the diamond depicts the 95% confidence interval. Funnel plot showing
no outliers (dots within the triangle) and therefore, no publication bias. a. Analysis based on a total sample of meshes (i.e. both absorbable and non-
absorbable meshes). b. Subgroup analysis based only on non-absorbable meshes. No. = total number of cases; Case = number of patients with
complication; OR = odd ratio; L = lower limit of confidence interval; U = upper limit of confidence interval.
models is the way they allocate the inverse variance weights and Knapp and by Sidik and Jonkman (IntHout et al,
to the individual studies. The objective of redistribution 2014) is claimed to be simple and robust especially when
of weights among the studies, under different statistical there is heterogeneity and the number of studies in the
models, is to find a more precise estimator of the common meta-analysis is small. It is beyond the scope of this article
effect size to achieve a shorter confidence interval. to provide a detailed account of these methods.
British Journal of Hospital Medicine, November 2019, Vol 80, No 11 639
Downloaded from magonlinelibrary.com by 131.172.036.029 on November 16, 2019.
Focus on Statistics
The relative location of the diamond with respect to the research question to the interpretation and discussion
the no-effect vertical line indicates which intervention is of the results, to ensure the quality and value of the final
favoured by the data. If appropriate, subgroup analyses product (Bernard, 2014). BJHM
are also included in the forest plot along with the
combined meta-analysis. Conflict of interest: none.
British Journal of Hospital Medicine, November 2019, Vol 80, No 11 641
Downloaded from magonlinelibrary.com by 131.172.036.029 on November 16, 2019.