Statistics from Altmetric.com
What are the benefits of random allocation in clinical studies? John Worrall, a philosopher of science, recently questioned whether evidence-based medicine's advice to base therapeutic decisions on the results of randomised controlled trials (RCTs) could be justified.1 2 Here we provide a response to Worrall and others who challenge the epistemological value of RCTs. Worrall's primary target is the view that RCTs are the only reliable source of evidence for medicine. But in arguing against this strong view, he posits a similarly strong counterposition. Worrall argues that randomisation offers no advantage over balanced systematic designs in which experimental and control groups are carefully matched according to known confounders.
The best we can do (as ever) is test our theories against rivals that seem plausible in the light of background knowledge. Once we have eliminated other explanations that we know are possible (by suitable deliberate, or post hoc, control) we have done as much as we can epistemologically.2
Focusing on this claim, we first discuss Worrall's arguments and then provide reasons to reject this view. There are good reasons to randomise studies of therapeutic interventions; principally, RCTs have the capacity to avoid a form of selection bias that cannot be avoided in observational studies.
Worrall draws the conclusion that randomised trials provide no epistemological benefit from two key arguments:
He criticises the notion promoted in the literature that random allocation controls for known and unknown confounders at baseline. Since the number of unknown (possible) causes are ‘innumerable’, “it would clearly be a miracle if all of those factors just happened to be balanced in the two groups. . . on a single random division.”2 Indeed, the probability that any single confounder is unevenly distributed in a given RCT ranges from zero to one.3 Hence, it is possible that some unknown confounder is distributed unevenly between treatment and control and correlated to the effect of intervention that was tested in the RCT.
Randomisation is not essential for controlling selection bias or concealing allocation. Worrall and others4 maintain that these sources of bias can be equally well eliminated in studies that are not randomised.
Neither ‘1’ nor ‘2’, however, undermines the epistemological benefits of RCTs over observational studies in testing the efficacy of therapeutic interventions. Worrall's first argument is only relevant if the point he makes is not understood (however, ‘1’ is well understood by trial analysts) and ‘2’ suffers from serious problems.
Worrall's ‘1’, while not always explicitly discussed, is nonetheless well recognised in the statistical and epidemiological literature. The crucial point is that it is not necessary that randomisation control for all known and unknown confounders at baseline to make valid statistical inferences. It is not necessary for the purpose of issuing a probability statement to know the value of unmeasured covariates; it is sufficient to know their distribution in probability, which randomisation is designed to provide.5 Similar knowledge regarding unmeasured covariates is not possible in observational designs. Analysis of observational studies rely on additional assumptions which cannot be easily verified. In short, Worrall's ‘1’ does not undermine standard frequentist statistical tests.
Another epistemological benefit of RCTs over observational studies is the capacity of RCTs to avoid confounding by indication (choice of treatment bias). Confounding by indication, in which treatment assignment is a function of the risk of future health outcomes (prognosis), is a particular problem for observational studies. Even if we match for a number of known factors, it is difficult in observational designs to rule out or account for all the factors which may influence the physicians' treatment assignment or the patients' reasons to accept or decline the intervention (see, for instance, Collins and MacMahon6). RCTs are prone to other biases but not this one.
Eliminating confounding by indication is a strong argument for interventional studies over observational studies. Worrall's ‘2’ claims that it is possible to use alternatives to randomisation to ensure experimental groups are equally balanced for known confounders and conceal allocation. Worrall does not provide details on these alternatives. Presumably, he is focusing on interventional studies and expressing the idea that it is conceptually possible to devise a method (other than randomisation) that stratifies patients according to relevant prognostic information and allocates them to treatment or control that is independent of the investigator. A number of problems can be raised against this suggestion.
First, alternative methods of allocation are not possible for case-control or cohort studies – in these study designs, the patient decides to (or the patient's circumstances lead them to) take or not take the intervention under investigation. Worrall and Borgerson fail to give sufficient weight to the importance of confounding by indication as a justification for interventional studies (and RCTs in particular) over observational studies.
Second, where randomisation is possible and easily carried out, methods that claim to do better than randomisation should be critically examined. Randomising provides a distinct (and uncontroversial) experimental distribution on which to base statistical inferences; this is the case for frequentist7 8 and Bayesian9 10 approaches to statistical inference. Alternative methods will require a more complicated statistical model and additional assumptions of the data. This makes the appropriate choice of statistical model and analysis more cumbersome and typically more difficult to defend. These problems vitiate the proposed advantages of a non-random methods of allocation in interventional studies.
It is important to note that none of the previous discussion invalidates the importance of observational studies. RCTs are neither necessary nor sufficient to conclusively establish therapeutic claims: not necessary because alternative methods may be appropriate when the effect size is large relative to bias or random error that may possibly obscure it,11 and not sufficient because there is more to assessing a therapeutic claim than success in a randomised trial (eg, assessing if the results from a RCT are generalisable). Furthermore, observational studies may be the best available method for particular questions (eg, assessing rare adverse effects of a medication). The question we are addressing here is: do RCTs offer unique epistemological value above and beyond observational studies?
It should be no surprise that a positive result from a well-conducted RCT falls short of providing conclusive proof of the efficacy of an intervention. A number of assumptions need to hold for a causal conclusion to be valid. Different accounts of causation and different approaches to statistical inference spell out the necessary assumptions in different ways. Nancy Cartwright12 and Dan Steel13 discuss the assumptions that need to hold on two probabilistic accounts of causation. Senn,8 with a focus on clinical trials, discusses assumptions within the frequentist approach. The need for these assumptions does not undermine the importance of RCTs for assessing the efficacy of medical interventions.
RCTs offer unique epistemological advantages that cannot be realised via observational studies. Neither does this mean that the observational studies are unimportant nor does this mean that RCTs are the best method for all questions and in all circumstances (as is reflected in the Oxford Centre for Evidence-based Medicine's Levels of Evidence).14 Rather, (therapeutic) clinical research should be understood as the means to respond to uncertainties about treatment effects of competing interventions, and the design of clinical study (observational vs RCTs) should be matched to the question/uncertainties at hand.15
This article began as a discussion on the Evidence-Based Health email discussion list initiated by BD. The authors would like to acknowledge the members of the EBM discussion group, whose opinion and free sharing of ideas has informed this work.
Competing interests ALC and BD declare no competing interests. SS notes that he is a consultant to the pharmaceutical industry and an academic whose career is furthered by publishing.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.