Systematic reviews to evaluate causation: an overview of methods and application

Khalid S Khan; Elizabeth Ball; Caroline E Fox; Catherine Meads

doi:10.1136/ebmed-2011-100287

Article Text

PDF

Methods

Systematic reviews to evaluate causation: an overview of methods and application

Free

Khalid S Khan1,2,
Elizabeth Ball1,2,
Caroline E Fox3,4,
Catherine Meads2

¹Department of Obstetrics and Gynaecology, Royal London Hospital, London, UK
²Centre for Primary Care and Public Health, Queen Mary University of London, London, UK
³Birmingham Women's Hospital NHS Foundation Trust, Birmingham, UK
⁴Reproduction, Genes and Development, School of Clinical and Experimental Medicine, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK

Correspondence to Khalid S Khan, Centre for Primary Care and Public Health, Queen Mary University of London, Yvonne Carter Building, 58 Turner St, Whitechapel, London, E1 2AB, UK;k.s.khan{at}qmul.ac.uk

https://doi.org/10.1136/ebmed-2011-100287

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Good-quality systematic reviews inform evidence-based decision making, but they usually focus on diagnosis or effectiveness of treatment rather than the aetiology of a condition. The cause of many medical conditions is multifactorial and can be hard to establish1 ,2 but it is the understanding of disease causation that underpins medical education, practice and research. Thus, systematic reviews to evaluate aetiological evidence are much needed. This article outlines methods employed to review causation, collating evidence on the relationship between exposures and putative clinical outcomes.

Assessing causation using systematic reviews

Systematic reviews are robust pieces of research based upon a clearly formulated question, from which it is possible to identify relevant studies, appraise their quality and summarise the evidence using scientific and replicable methods. It is the use of an explicit and systematic approach that differentiates such reviews from anecdotal evidence, expert commentaries and narrative reviews. A causation review requires specific steps to transparently investigate causal criteria (table 1).

View this table:

Table 1

Common criteria for causation and their assessment through systematic reviews

Originally causal criteria such as strength, consistence and temporality of association (this is not a comprehensive list) were derived from studies in the fields of microbiology3 and epidemiology2 but have now gained wider acceptance within clinical medical research.4,–,6 Causation reviews are difficult to identify because there are no specific medical subject headings (MESH terms) used for indexing within the searchable databases. However, there are MESH terms for aetiology (and causality) which can be combined with search filters for systematic reviews.7

In addition, evidence for causation of health disorders may be concealed within the results sections or tables of a published review and not necessarily labelled as such. For example, when applying the experimental evidence criterion (please refer table 1) the presumed causal agent may be removed within a randomised controlled trial. If the condition under investigation is absent after treatment compared with the control group where it is present, a causal relationship may be inferred. For example, the removal of visible areas of endometriosis and restoration of anatomy by division of adhesions in order to treat pelvic pain associated with endometriosis, which has been investigated in a recent Cochrane review.8

Setting up a causation systematic review

At the outset, any hypothesis concerning strength, consistency and temporality should be specified. The mechanisms behind the development of the condition under investigation should be explored and the disease process must be outlined including aetiological factors, pathophysiology and clinical manifestations.9 Studying aetio-pathogenesis in this way involves evaluation of the proposed biological pathway in relevant animal, laboratory or human studies.10 Multiple sources should be searched and the search terms should identify literature on the events in the biological pathway, including unpublished studies to avoid the considerable risk of publication bias in basic science literature.9,–,11 Standard systematic review techniques need to be used to ensure rigour in the final conclusions regarding causality.12 ,13

Assessing study design and quality

Epidemiological principles relevant to studying causation without bias include prospective design; measurement of causal agent, correct temporality of the association and control for confounding, although it is acknowledged that this list is not exhaustive. In observational studies, adjustment using multivariable modelling should be suggested as a way of controlling for known confounding variables. There are no validated quality assessment checklists or scores for causality systematic reviews yet so any quality assessment will need to acknowledge debate on what constitutes good quality in this area. This is similar to another new area for systematic reviews such as diagnostic yield where quality assessment checklists are evolving.14

Detailed quality assessment is helpful in exploring heterogeneity and in generating inferences.10 In this regard study design can be very informative. For example, when randomisation is not feasible, research may use cohort or case-control design. In studies on fetal exposure to potentially harmful maternal drugs,15 these designs were used to evaluate causation. Dolovich et al15 reported an increase in major fetal malformations after benzodiazepine exposure in utero in a meta-analysis of case controlled, but not of cohort studies (figure 1). The difference in results between cohort and case control is likely to be explained by recall bias or poor selection of control group, and weakens our inference about causation.13 ,16

Figure 1

Quality assessment of studies: association of major malformations with prenatal exposure to benzodiazepines (adapted from13 ,15; values of point estimates of OR>1.0 indicate an association of malformations with exposure to benzodiazepines compared to no exposure).

Data synthesis

Formal examination of hypotheses concerning consistency, strength, temporality, dose-response relationship and the biological plausibility of suggested mechanisms facilitates the assessment of causality. Strength of association can be subdivided into magnitude (ie, how large was the effect) and precision (eg, p values and CI). For example, a meta-analysis of observational studies showed that bicycle helmets reduce the risk of head injuries in cyclists involved in a crash (OR 0.31, 95% CI 0.26 to 0.37).17 The point estimate of OR and the upper limit of CI suggest a large, precise effect and, thus, a high degree of confidence in a causal association. In another example, the OR for exposure to benzodiazepines in pregnancy and association with major malformations in the newborn baby was 1.43 (0.89 to 2.31).15 This result suggests a great deal of uncertainty in a causal association as the CI includes the possibility of no association at all. With regard to magnitude, ORs of above 2 and below 0.5 are often considered worthy of further causal exploration if derived from good-quality studies. In a randomised controlled trial (RCT) of folic acid supplementation starting before pregnancy, the association was large (RR 0.28, 95% CI 0.12 to 0.71).18 When generated through randomised trials, such large effects are sufficiently compelling to lead to public policy changes. The precision achieved through meta-analysis does not by itself prove causation. One has to beware of the effect of bias and confounding. Even large effects from observational studies need careful consideration when inferring causation.19

Statistical analysis for heterogeneity and graphical representations may be used to explore consistency (figures 1 and 2). Statistical heterogeneity in the associations observed may arise when selected studies show qualitatively opposite results (eg, positive association as well as negative association seen through point estimates of ORs among cohort studies in figure 1). In this case the likelihood of there being a causal association will be low. Statistical heterogeneity in observed effects may also arise when selected studies show qualitatively same results (eg, positive association of different effect sizes). In this case, establishing a causal association may benefit from examination of a ‘dose-response’ relationship.

When feasible, combining study results (meta-analysis) may evaluate strength of association. Subgroup meta-analyses may be useful to explore the effect of study quality, temporality and dose-response relationships. In one example regarding the role of homocysteine in pre-eclampsia,20 (figure 2) subgroup meta-analysis considering temporality of association reassured us, despite the strength of association among temporal studies being less than that overall, that the association met this causal criterion.

Figure 2

Temporality of association: subgroup meta-analysis on the role of homocysteine in preeclampsia (adapted from20; values of point estimates of weighted mean difference >0 µmol/l indicate an association of hyper homocysteinaemia with preeclampsia).

A biological gradient can be explored through meta-regression which investigates the effect of one or more study characteristics on the size of treatment effect, taking precision into account. Hooper et al4 use meta-regression to show the association between modified dietary fat intake and cardiovascular disease. A genuine relation may be inferred when a slope is significantly different from zero. In another example for dose-response, (table 2) Ronksley et al6 observed greater cardiac protection with increasing alcohol intake. In addition, when biological plausibility is demonstrated, for example, a favourable change in biomarkers of coronary heart disease (higher levels of high-density lipoprotein cholesterol and adiponectin and lower levels of fibrinogen) in those who drank moderate amounts of alcohol,21 our confidence in a causal association is increased through existence of a disease mechanism.

View this table:

Table 2

Dose-response relationship: pooled RRs (95% CI) for cardiovascular outcomes (number of pooled studies in parentheses after each effect estimate; heterogeneity statistic unavailable)⇓

Conclusion

The strength of the any causal inference that can be drawn from a review depends upon the rigour of the review methods employed, the validity of the review. This will be based on responses to several questions such as: was the initial search adequate; was the quality of the included studies adequate; were the findings both substantive and statistically significant; etc. Through systematic review both meta-analytic (quantitative) and criteria-based (qualitative) methods can be used together in making causal inferences.

When evaluating the effect of bias and confounding on causal criteria such as strength, consistency and dose-response of association, there is debate about how often there is correlation between results from RCTs and observational studies.22 ,23 Conclusions stemming from the estimates of effect generated by the studies of different design being similar are valid only if the two groups of studies are similar in all respects other than the design itself. This is not often the case and medical advice based on observational studies has frequently been overturned when RCT evidence has emerged so that we now know we should rely on RCTs wherever feasible. For example, our belief about the protective effect of hormone replacement therapy on cardiovascular disease based on observational studies24 ,25 has been overturned by RCT evidence.26 ,27

Traditionally, qualitative narrative review techniques are used to assess causal criteria. A systematic review, with judicious use of meta-analysis, provides an elegant solution to this difficult interpretative problem. When synthesising data to test hypotheses concerning causal criteria, there are no accepted scoring systems to decide whether there is sufficient evidence for causality or not. Ultimately, these assessments are judgments. But, a systematic approach to evidence synthesis and interpretation is likely to generate more transparent, robust inferences than ad hoc considerations. When evidence is found to be lacking, reviews examining causation may be useful in identifying gaps in research.

References

↵
1. Weed DL
. On the use of causal criteria. Int J Epidemiol 1997;26:1137–41.
OpenUrl Abstract/FREE Full Text
↵
1. Hill AB
. The environment and disease: association or causation? Proc R Soc Med 1965;58:295–300.
OpenUrl PubMed Web of Science
↵
1. Evans AS
. Causation and disease: the Henle-Koch postulates revisited. Yale J Biol Med 1976;49:175–95.
OpenUrl PubMed Web of Science
↵
1. Hooper L,
2. Summerbell CD,
3. Higgins JP,
4. et al
. Dietary fat intake and prevention of cardiovascular disease: systematic review. BMJ 2001;322:757–63.
OpenUrl Abstract/FREE Full Text
↵
1. Mignini LE,
2. Villar J,
3. Khan KS
. Mapping the theories of preeclampsia: the need for systematic reviews of mechanisms of the disease. Am J Obstet Gynecol 2006;194:317–21.
OpenUrl CrossRef PubMed Web of Science
↵
1. Ronksley PE,
2. Brien SE,
3. Turner BJ,
4. et al
. Association of alcohol consumption with selected cardiovascular disease outcomes: a systematic review and meta-analysis. BMJ 2011;342:d671.
OpenUrl Abstract/FREE Full Text
↵
1. Wong SS,
2. Wilczynski NL,
3. Haynes RB,
4. et al
. Developing optimal search strategies for detecting sound clinical prediction studies in MEDLINE. AMIA Annu Symp Proc 2003:728–32.
↵
1. Jacobson TZ,
2. Duffy JM,
3. Barlow D,
4. et al
. Laparoscopic surgery for pelvic pain associated with endometriosis. Cochrane Database Syst Rev 2009;4:CD001300.
↵
1. Parascandola M,
2. Weed DL
. Causation in epidemiology. J Epidemiol Community Health 2001;55:905–12.
OpenUrl Abstract/FREE Full Text
↵
1. Weed DL
. Meta-analysis and causal inference: a case study of benzene and non-Hodgkin lymphoma. Ann Epidemiol 2010;20:347–55.
OpenUrl CrossRef PubMed
↵
1. Weed DL
. Environmental epidemiology: basics and proof of cause-effect. Toxicology 2002;181–182:399–403.
OpenUrl
↵
1. Fox C,
2. Mignini L,
3. Khan K
. Systematic reviews of research to assess causation: a guide to methods and application. Eur Clinics Obstet Gynecol 2006;1:251–6.
OpenUrl CrossRef
↵
1. Khan K,
2. Kunz R,
3. Kleijnen J,
4. et al
. Systematic reviews to support evidence based medicine. Second edition. London: Hodder Arnold 2011.
↵
1. Meads CA,
2. Davenport CF
. Quality assessment of diagnostic before-after studies: development of methodology in the context of a systematic review. BMC Med Res Methodol 2009;9:3.
OpenUrl CrossRef PubMed
↵
1. Dolovich LR,
2. Addis A,
3. Vaillancourt JM,
4. et al
. Benzodiazepine use in pregnancy and major malformations or oral cleft: meta-analysis of cohort and case-control studies. BMJ 1998;317:839–43.
OpenUrl Abstract/FREE Full Text
↵
1. Khan KS,
2. Wykes C,
3. Gee H
. Benzodiazepine use in pregnancy and major malformations or oral clefts. Quality of primary studies must influence inferences made from meta-analyses. BMJ 1999;319:919.
OpenUrl PubMed
↵
1. Thompson DC,
2. Rivara FP,
3. Thompson R
. Helmets for preventing head and facial injuries in bicyclists. Cochrane Database Syst Rev 2000;2:CD001855.
↵
1. Anonymous
. Prevention of neural tube defects: results of the Medical Research Council Vitamin Study. MRC Vitamin Study Research Group. Lancet 1991;338(8760):131–7.
OpenUrl CrossRef PubMed Web of Science
↵
1. Cornfield J,
2. Haenszel W,
3. Hammond EC,
4. et al
. Smoking and lung cancer: recent evidence and a discussion of some questions. 1959. Int J Epidemiol 2009;38:1175–91.
OpenUrl Abstract/FREE Full Text
↵
1. Mignini LE,
2. Latthe PM,
3. Villar J,
4. et al
. Mapping the theories of preeclampsia: the role of homocysteine. Obstet Gynecol 2005;105:411–25.
OpenUrl PubMed Web of Science
↵
1. Brien SE,
2. Ronksley PE,
3. Turner BJ,
4. et al
. Effect of alcohol consumption on biological markers associated with risk of coronary heart disease: systematic review and meta-analysis of interventional studies. BMJ 2011;342:d636.
OpenUrl Abstract/FREE Full Text
↵
1. Concato J,
2. Shah N,
3. Horwitz RI
. Randomized, controlled trials, observational studies, and the hierarchy of research designs. N Engl J Med 2000;342:1887–92.
OpenUrl CrossRef PubMed Web of Science
↵
1. Kunz R,
2. Oxman AD
. The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials. BMJ 1998;317:1185–90.
OpenUrl Abstract/FREE Full Text
↵
1. Stampfer MJ,
2. Colditz GA
. Estrogen replacement therapy and coronary heart disease: a quantitative assessment of the epidemiologic evidence. Prev Med 1991;20:47–63.
OpenUrl CrossRef PubMed Web of Science
↵
American College of Physicians. Guidelines for counseling postmenopausal women about preventive hormone therapy. Ann Intern Med 1992;117:1038–41.
OpenUrl CrossRef PubMed Web of Science
↵
1. Humphrey LL,
2. Chan BK,
3. Sox HC
. Postmenopausal hormone replacement therapy and the primary prevention of cardiovascular disease. Ann Intern Med 2002;137:273–84.
OpenUrl CrossRef PubMed Web of Science
↵
1. Sare GM,
2. Gray LJ,
3. Bath PM
. Association between hormone replacement therapy and subsequent arterial and venous vascular events: a meta-analysis. Eur Heart J 2008;29:2031–41.
OpenUrl Abstract/FREE Full Text

Footnotes

Competing interests None.

[1] ↵
Weed DL
. On the use of causal criteria. Int J Epidemiol 1997;26:1137–41.
OpenUrl Abstract/FREE Full Text

[2] Weed DL

[3] ↵
Hill AB
. The environment and disease: association or causation? Proc R Soc Med 1965;58:295–300.
OpenUrl PubMed Web of Science

[4] Hill AB

[5] ↵
Evans AS
. Causation and disease: the Henle-Koch postulates revisited. Yale J Biol Med 1976;49:175–95.
OpenUrl PubMed Web of Science

[6] Evans AS

[7] ↵
Hooper L,
Summerbell CD,
Higgins JP,
et al
. Dietary fat intake and prevention of cardiovascular disease: systematic review. BMJ 2001;322:757–63.
OpenUrl Abstract/FREE Full Text

[8] Hooper L,

[9] Summerbell CD,

[10] Higgins JP,

[11] et al

[12] ↵
Mignini LE,
Villar J,
Khan KS
. Mapping the theories of preeclampsia: the need for systematic reviews of mechanisms of the disease. Am J Obstet Gynecol 2006;194:317–21.
OpenUrl CrossRef PubMed Web of Science

[13] Mignini LE,

[14] Villar J,

[15] Khan KS

[16] ↵
Ronksley PE,
Brien SE,
Turner BJ,
et al
. Association of alcohol consumption with selected cardiovascular disease outcomes: a systematic review and meta-analysis. BMJ 2011;342:d671.
OpenUrl Abstract/FREE Full Text

[17] Ronksley PE,

[18] Brien SE,

[19] Turner BJ,

[20] et al

[21] ↵
Wong SS,
Wilczynski NL,
Haynes RB,
et al
. Developing optimal search strategies for detecting sound clinical prediction studies in MEDLINE. AMIA Annu Symp Proc 2003:728–32.

[22] Wong SS,

[23] Wilczynski NL,

[24] Haynes RB,

[25] et al

[26] ↵
Jacobson TZ,
Duffy JM,
Barlow D,
et al
. Laparoscopic surgery for pelvic pain associated with endometriosis. Cochrane Database Syst Rev 2009;4:CD001300.

[27] Jacobson TZ,

[28] Duffy JM,

[29] Barlow D,

[30] et al

[31] ↵
Parascandola M,
Weed DL
. Causation in epidemiology. J Epidemiol Community Health 2001;55:905–12.
OpenUrl Abstract/FREE Full Text

[32] Parascandola M,

[33] Weed DL

[34] ↵
Weed DL
. Meta-analysis and causal inference: a case study of benzene and non-Hodgkin lymphoma. Ann Epidemiol 2010;20:347–55.
OpenUrl CrossRef PubMed

[35] Weed DL

[36] ↵
Weed DL
. Environmental epidemiology: basics and proof of cause-effect. Toxicology 2002;181–182:399–403.
OpenUrl

[37] Weed DL

[38] ↵
Fox C,
Mignini L,
Khan K
. Systematic reviews of research to assess causation: a guide to methods and application. Eur Clinics Obstet Gynecol 2006;1:251–6.
OpenUrl CrossRef

[39] Fox C,

[40] Mignini L,

[41] Khan K

[42] ↵
Khan K,
Kunz R,
Kleijnen J,
et al
. Systematic reviews to support evidence based medicine. Second edition. London: Hodder Arnold 2011.

[43] Khan K,

[44] Kunz R,

[45] Kleijnen J,

[46] et al

[47] ↵
Meads CA,
Davenport CF
. Quality assessment of diagnostic before-after studies: development of methodology in the context of a systematic review. BMC Med Res Methodol 2009;9:3.
OpenUrl CrossRef PubMed

[48] Meads CA,

[49] Davenport CF

[50] ↵
Dolovich LR,
Addis A,
Vaillancourt JM,
et al
. Benzodiazepine use in pregnancy and major malformations or oral cleft: meta-analysis of cohort and case-control studies. BMJ 1998;317:839–43.
OpenUrl Abstract/FREE Full Text

[51] Dolovich LR,

[52] Addis A,

[53] Vaillancourt JM,

[54] et al

[55] ↵
Khan KS,
Wykes C,
Gee H
. Benzodiazepine use in pregnancy and major malformations or oral clefts. Quality of primary studies must influence inferences made from meta-analyses. BMJ 1999;319:919.
OpenUrl PubMed

[56] Khan KS,

[57] Wykes C,

[58] Gee H

[59] ↵
Thompson DC,
Rivara FP,
Thompson R
. Helmets for preventing head and facial injuries in bicyclists. Cochrane Database Syst Rev 2000;2:CD001855.

[60] Thompson DC,

[61] Rivara FP,

[62] Thompson R

[63] ↵
Anonymous
. Prevention of neural tube defects: results of the Medical Research Council Vitamin Study. MRC Vitamin Study Research Group. Lancet 1991;338(8760):131–7.
OpenUrl CrossRef PubMed Web of Science

[64] Anonymous

[65] ↵
Cornfield J,
Haenszel W,
Hammond EC,
et al
. Smoking and lung cancer: recent evidence and a discussion of some questions. 1959. Int J Epidemiol 2009;38:1175–91.
OpenUrl Abstract/FREE Full Text

[66] Cornfield J,

[67] Haenszel W,

[68] Hammond EC,

[69] et al

[70] ↵
Mignini LE,
Latthe PM,
Villar J,
et al
. Mapping the theories of preeclampsia: the role of homocysteine. Obstet Gynecol 2005;105:411–25.
OpenUrl PubMed Web of Science

[71] Mignini LE,

[72] Latthe PM,

[73] Villar J,

[74] et al

[75] ↵
Brien SE,
Ronksley PE,
Turner BJ,
et al
. Effect of alcohol consumption on biological markers associated with risk of coronary heart disease: systematic review and meta-analysis of interventional studies. BMJ 2011;342:d636.
OpenUrl Abstract/FREE Full Text

[76] Brien SE,

[77] Ronksley PE,

[78] Turner BJ,

[79] et al

[80] ↵
Concato J,
Shah N,
Horwitz RI
. Randomized, controlled trials, observational studies, and the hierarchy of research designs. N Engl J Med 2000;342:1887–92.
OpenUrl CrossRef PubMed Web of Science

[81] Concato J,

[82] Shah N,

[83] Horwitz RI

[84] ↵
Kunz R,
Oxman AD
. The unpredictability paradox: review of empirical comparisons of randomised and non-randomised clinical trials. BMJ 1998;317:1185–90.
OpenUrl Abstract/FREE Full Text

[85] Kunz R,

[86] Oxman AD

[87] ↵
Stampfer MJ,
Colditz GA
. Estrogen replacement therapy and coronary heart disease: a quantitative assessment of the epidemiologic evidence. Prev Med 1991;20:47–63.
OpenUrl CrossRef PubMed Web of Science

[88] Stampfer MJ,

[89] Colditz GA

[90] ↵
American College of Physicians. Guidelines for counseling postmenopausal women about preventive hormone therapy. Ann Intern Med 1992;117:1038–41.
OpenUrl CrossRef PubMed Web of Science

[91] ↵
Humphrey LL,
Chan BK,
Sox HC
. Postmenopausal hormone replacement therapy and the primary prevention of cardiovascular disease. Ann Intern Med 2002;137:273–84.
OpenUrl CrossRef PubMed Web of Science

[92] Humphrey LL,

[93] Chan BK,

[94] Sox HC

[95] ↵
Sare GM,
Gray LJ,
Bath PM
. Association between hormone replacement therapy and subsequent arterial and venous vascular events: a meta-analysis. Eur Heart J 2008;29:2031–41.
OpenUrl Abstract/FREE Full Text

[96] Sare GM,

[97] Gray LJ,

[98] Bath PM

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

Introduction

Assessing causation using systematic reviews

Setting up a causation systematic review

Assessing study design and quality

Data synthesis

Conclusion

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password