Meta-analysis using individual participant data from randomised trials: opportunities and limitations created by access to raw data

Ewelina Rogozińska; Nadine Marlin; Shakila Thangaratinam; Khalid S Khan; Javier Zamora

doi:10.1136/ebmed-2017-110775

Article Text

EBM opinion and debate

Meta-analysis using individual participant data from randomised trials: opportunities and limitations created by access to raw data

Free

Ewelina Rogozińska1,2,
Nadine Marlin3,
Shakila Thangaratinam1,2,
Khalid S Khan1,2,
Javier Zamora1,4

¹ Women’s Health Research Unit, Blizard Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
² Department of Multidisciplinary Evidence Synthesis Hub (mEsh), Barts and the London School of Medicine, Queen Mary University of London, London, UK
³ Pragmatic Clinical Trials Unit, Barts and the London School of Medicine, Queen Mary University of London, London, UK
⁴ Clinical Biostatistics Unit, Hospital Ramon y Cajal (IRYCIS) and CIBER Epidemiology and Public Health, Madrid, Spain

Correspondence to Ewelina Rogozińska, Women’s Health Research Unit, Barts and the London School of Medicine and Dentistry, Queen Mary University London, London E1 2AB, UK; e.a.rogozinska{at}qmul.ac.uk

https://doi.org/10.1136/ebmed-2017-110775

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Meta-analysis using individual participant data (IPD) is becoming increasingly popular, despite being a laborious and resource-intensive method of evidence synthesis compared with a standard review using study-level data.1 It has the potential to overcome the limitations of meta-analyses based on published data through access to raw trial data,1–4 such as standardisation of analysis methods and data across trials1 5 (table 1). Access to IPD can facilitate integrity checks and intention to treat analysis by imputing for missing data. Collation of rarely reported variables for the key outcomes can result in greater precision of the intervention effect and address the problem of selective reporting.1

View this table:

Table 1

Purported theoretical advantages of IPD meta-analysis of clinical trials and practical limitations in achieving them

Existing methodological literature focuses mainly on cost, team’s expertise and management of the collaboration.3 Yet, not much is available on practical challenges associated with data harmonisation and their consequences for IPD meta-analyses. The aim of this article is to discuss some of the opportunities created by access to IPD and the limitations of meta-analysis using IPD as indicated in table 1. We use the i-WIP IPD meta-analysis of 36 trials (12 526 participants from 16 countries; 50 investigators) on the effect of diet and physical activity-based interventions in pregnancy6 as an example (online supplementary appendix 1).

Supplementary Material

Supplementary Appendix 1

[SP1.pdf]

Standardisation of data across trials

Access to IPD should create a unique opportunity to unify all essential data. This is true assuming that collected data can be brought to the same format without losing their value. Routinely collected data such as age, weight or height tend to be captured as real values making them relatively easy to harmonise. Participant characteristics recorded in other formats or those less routinely collected can be much more challenging to standardise. One of the subgroups of interest in the project was maternal ethnic origin.7 The characteristic was available for 47% (17/36) trials of which one differentiated only between indigenous and non-indigenous women, four classified women only as Caucasian or non-Caucasian and eight declared to include only Caucasians or not recognise ‘ethnicity’ in their country. The characteristic was grouped into six categories (Caucasian, Asian, Afro-Caribbean, Central and South American, Middle Eastern and other and unknown) but due to a low proportion of women from groups other than Caucasian (>80% of included women) in the analysis of differential effects of intervention by ethnic origin, the characteristic was used in the binary format (Caucasian/non-Caucasian).6

Harmonisation of outcome definitions faced similar challenges. While some definitions are relatively easy to bring to a common format across the trials, for example, preterm birth, standardisation of others was simply not feasible. The task can be even more daunting when there is no consensus on classification methods, or the definitions changed over the years. Despite access to IPD, direct communication with the research teams and the idea endorsement by the members of the i-WIP collaborative group, standardisation of outcomes such as gestational diabetes (GDM) or caesarean section turned out to be unachievable within the study funding time. Diagnosis of GDM was based on a broad range of guidelines that followed algorithms that did not always overlap with each other. We have made an attempt to standardise the definitions of GDM and collected the blood test measurements used to diagnose the condition. However, the variability in glucose loads (50, 75 or 100 g) and tests’ timing (fasting, 1 hour or 2 hours) leads us to abandon this task and acknowledge the variability in the outcome definition as a limitation. The variety of GDM definitions and the blood test measures, as well as the coding of participants’ ethnic origin in the trials with diet and/or physical activity in pregnancy, is presented in online supplementary appendix 2.

Supplementary Material

Supplementary Appendix 2

[SP2.pdf]

Unreported outcomes

Selective reporting of intervention effects depending on statistical significance is one of the most important sources of bias affecting clinical trials.8–11 Despite clear guidance on reporting of outcomes in the trial reports,12 the problem persists, having a serious impact on the meta-analysis. In combination with variation in choice of trial outcomes,13 they are contributing to the serious waste of research efforts. More frequent reporting of statistically significant results can lead to a potential overestimation of underlying treatment effects in a meta-analysis when using data extracted from trial publications. IPD meta-analysis has the potential to address this problem through facilitating analysis of core outcome sets,14 if available in trial datasets but not reported in publications.

Access to individual records should increase the number of trials included in the analysis and enhance the quality of outcome data. However, the benefits may not always be substantial. In the i-WIP project, the number of trials with the outcomes of interest was higher through access to IPD in comparison with data extracted from publications (online supplementary appendix 3). In addition, use of the raw data to generate outcomes not considered in original trials (eg, use of gestational age at delivery to define the occurrence of prematurity) may lead to a substantial increase in the number of the trial that can be incorporated into the meta-analysis (table 2). Nevertheless, the presence of data in the dataset did not always allow to incorporate a given dataset in the statistical analysis. Too few events (eg, stillbirths) and lack of all measures (baseline and final for weight gain) prevented trial inclusion. Still, in the example, incorporation of trials with previously unavailable outcome data changed the value of the effect estimate by more than 10% in three outcomes and its statistical significance in one (table 2).

Supplementary Material

Supplementary Appendix 3

[SP3.pdf]

The addition of unreported data may or may not lead to a change in funnel plot asymmetry. In the example, incorporation of unpublished outcomes in the meta-analysis for admission to neonatal intensive care unit has not revealed any potential bias. Similarly, for small for gestational age infant where outcome data were generated using raw data, if the outcome was not considered in original trials (table 2). For continuous outcomes (gestational weight gain in the example), the change in the plot asymmetry might also occur due to the standardisation of the analysis methods rather than incorporation of unreported data (figure 1).

View this table:

Table 2

Meta-analysis of trials with diet and/or physical activity-based interventions in pregnancy with available IPD

Figure 1

Comparison of funnel plots between meta-analyses using published and Individual Participant Data (IPD).

Role of IPD meta-analysis in dynamic research areas

The authors of guidance on the appraisal of IPD meta-analyses of randomised trials advocate checking for the proportion of trials from which IPD was obtained.5 A recent study showed that only 25% of evaluated IPD meta-analyses obtained 100% of identified trial data.15 Acquisition of all eligible trials can be challenging for numerous reasons, with uncooperative trial investigators mentioned most commonly.5 IPD meta-analysis is a lengthy and resource-intensive process which can also decrease the chance of complying with the above-mentioned recommendation.

Since the publication of the systematic review that laid the grounds for the IPD meta-analysis we used as an example,16 there has been a significant increase in the number of trials evaluating the effects of diet and/or physical activity-based interventions in pregnancy. Between the end of data acquisition in June 2015 to February 2017, findings from additional 45 trials have been published (figure 2) making achieving the goal of being up-to-date and obtaining the majority of IPD virtually impossible.17 In combination with the trials for which IPD was sought but not obtained, the number of trials outside the IPD meta-analysis (non-IPD studies) constituted 65% of trials (67/103 trials) and 51% of women randomised to all eligible trials (12 960/25 486). The meta-analysis combining IPD with non-IPD studies showed a stronger overall effect of interventions in reduction of gestational weight gain and a significant reduction of odds for GDM than one using only IPD.17

Figure 2

Number of randomised controlled trials with diet and/or physical activity-based interventions provided antenatally.

Summary

Despite the advantages of meta-analysis using IPD, the method encounters problems faced by other research methods such as uncooperative investigators or incompleteness of records. The IPD meta-analysis is a resource-demanding approach to evidence synthesis and requires a thorough evaluation of what is achievable. It might be that we will need to accept that some primary research is not usable for evidence synthesis. Mapping of definitions and additional data that could help to standardise the outcome across the trials may not tackle all the issues but will facilitate the smoother conduct of IPD meta-analyses. The efforts associated with obtaining IPD and its harmonisation need to be balanced by the potential gains achievable through a complex and profound statistical analysis. Prospectively designed IPD meta-analyses have the potential to overcome some of the challenges described in this article as they tend to collect data in a preagreed format.18 Promotion of consensus on the research standards with regard to outcome definitions, capturing of participants’ characteristics and effective ways of implementing them in the trials should help to reduce the potential research waste. Finally, putting the findings of IPD meta-analysis into a context of the totality of evidence is paramount for the validity of results.5 Currently, guidelines recommend adding non-IPD studies to IPD meta-analysis when a substantial proportion of trials IPD was not obtained at the beginning of the project. In addition, in some areas of medical research, the amount of evidence generated annually makes it difficult to stay up-to-date while conducting IPD meta-analysis. Therefore, adding newly published trials is as important as incorporating the not shared ones.

Acknowledgments

We wouldlike to acknowledge all members of Collaborative Group who contributed theirtrial data to the i-WIP IPD meta-analysis funded by The National Institute forHealth Research (NIHR) Health Technology Assessment (HTA) programme (No.12/01/50).

References

1.↵
2. Riley RD ,
3. Lambert PC ,
4. Abo-Zaid G
. Meta-analysis of individual participant data: rationale, conduct, and reporting. BMJ 2010;340:c221. doi:10.1136/bmj.c221
OpenUrl FREE Full Text
2.↵
2. Lambert PC ,
3. Sutton AJ ,
4. Abrams KR , et al
. A comparison of summary patient-level covariates in meta-regression with individual patient data meta-analysis. J Clin Epidemiol 2002;55:86–94. doi:10.1016/S0895-4356(01)00414-0
OpenUrl CrossRef PubMed Web of Science
3.↵
2. Stewart LA ,
3. Tierney JF
. To IPD or not to IPD? Advantages and disadvantages of systematic reviews using individual patient data. Eval Health Prof 2002;25:76–97. doi:10.1177/0163278702025001006
OpenUrl CrossRef PubMed Web of Science
4.↵
2. Tudur Smith C ,
3. Marcucci M ,
4. Nolan SJ , et al
. Individual participant data meta-analyses compared with meta-analyses based on aggregate data. Cochrane Database Syst Rev 2016;9:MR000007. doi:10.1002/14651858.MR000007.pub3
OpenUrl
5.↵
2. Tierney JF ,
3. Vale C ,
4. Riley R , et al
. Individual Participant Data (IPD) meta-analyses of randomised controlled trials: guidance on their use. PLoS Med 2015;12:e1001855. doi:10.1371/journal.pmed.1001855
6.↵
2. Rogozińska E ,
3. Marlin N ,
4. Jackson L , et al
. Effects of antenatal diet and physical activity on maternal and fetal outcomes: individual patient data meta-analysis and health economic evaluation. Health Technol Assess 2017;21:1–158. doi:10.3310/hta21410
OpenUrl
7.↵
2. Ruifrok AE ,
3. Rogozinska E ,
4. van Poppel MN , et al
. Study protocol: differential effects of diet and physical activity based interventions in pregnancy on maternal and fetal outcomes–individual patient data (IPD) meta-analysis and health economic evaluation. Syst Rev 2014;3:131. doi:10.1186/2046-4053-3-131
OpenUrl CrossRef PubMed
8.↵
2. Williamson PR ,
3. Gamble C
. Identification and impact of outcome selection bias in meta-analysis. Stat Med 2005;24:1547–61. doi:10.1002/sim.2025
OpenUrl CrossRef PubMed Web of Science
9.↵
2. Williamson PR ,
3. Gamble C ,
4. Altman DG , et al
. Outcome selection bias in meta-analysis. Stat Methods Med Res 2005;14:515–24. doi:10.1191/0962280205sm415oa
OpenUrl CrossRef PubMed Web of Science
10.↵
2. Kirkham JJ ,
3. Dwan KM ,
4. Altman DG , et al
. The impact of outcome reporting bias in randomised controlled trials on a cohort of systematic reviews. BMJ 2010;340:c365. doi:10.1136/bmj.c365
OpenUrl Abstract/FREE Full Text
11.↵
2. Chan AW ,
3. Altman DG
. Identifying outcome reporting bias in randomised trials on PubMed: review of publications and survey of authors. BMJ 2005;330:753. doi:10.1136/bmj.38356.424606.8F
OpenUrl Abstract/FREE Full Text
12.↵
2. Moher D ,
3. Hopewell S ,
4. Schulz KF , et al
. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. BMJ 2010;340:c869. doi:10.1136/bmj.c869
OpenUrl FREE Full Text
13.↵
2. Clarke M
. Standardising outcomes for clinical trials and systematic reviews. Trials 2007;8:39. doi:10.1186/1745-6215-8-39
OpenUrl CrossRef PubMed
14.↵
2. Clarke M ,
3. Williamson PR
. Core outcome sets and systematic reviews. Syst Rev 2016;5:11. doi:10.1186/s13643-016-0188-6
OpenUrl CrossRef PubMed
15.↵
2. Nevitt SJ ,
3. Marson AG ,
4. Davie B , et al
. Exploring changes over time and characteristics associated with data retrieval across individual participant data meta-analyses: systematic review. BMJ 2017;357:j1390. doi:10.1136/bmj.j1390
OpenUrl Abstract/FREE Full Text
16.↵
2. Thangaratinam S ,
3. Rogozinska E ,
4. Jolly K , et al
. Effects of interventions in pregnancy on maternal weight and obstetric outcomes: meta-analysis of randomised evidence. BMJ 2012;344:e2088. doi:10.1136/bmj.e2088
17.↵
2. Rogozinska E ,
3. Marlin N ,
4. Betran AP , et al
. Effect of diet and physical activity based interventions in pregnancy on gestational weight gain and pregnancy outcomes: meta-analysis of individual participant data from randomised trials. BMJ 2017;358:j3119.
OpenUrl Abstract/FREE Full Text
18.↵
2. Ganzevoort W ,
3. Alfirevic Z ,
4. von Dadelszen P , et al
. STRIDER: Sildenafil therapy in dismal prognosis early-onset intrauterine growth restriction – a protocol for a systematic review with individual participant data and aggregate data meta-analysis and trial sequential analysis. Syst Rev 2014;3:23. doi:10.1186/2046-4053-3-23
OpenUrl CrossRef PubMed

Footnotes

Twitter Follow Ewelina Rogozinska @EaRogozinska
Contributors ER wrote the initial draft of the manuscript and all subsequent drafts after critical review by JZ, NM and KSK. ER is a guarantor for the manuscript.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.

[1] 1.↵

Riley RD ,
Lambert PC ,
Abo-Zaid G
. Meta-analysis of individual participant data: rationale, conduct, and reporting. BMJ 2010;340:c221. doi:10.1136/bmj.c221
OpenUrl FREE Full Text

[3] Riley RD ,

[4] Lambert PC ,

[5] Abo-Zaid G

[6] 2.↵

Lambert PC ,
Sutton AJ ,
Abrams KR , et al
. A comparison of summary patient-level covariates in meta-regression with individual patient data meta-analysis. J Clin Epidemiol 2002;55:86–94. doi:10.1016/S0895-4356(01)00414-0
OpenUrl CrossRef PubMed Web of Science

[8] Lambert PC ,

[9] Sutton AJ ,

[10] Abrams KR , et al

[11] 3.↵

Stewart LA ,
Tierney JF
. To IPD or not to IPD? Advantages and disadvantages of systematic reviews using individual patient data. Eval Health Prof 2002;25:76–97. doi:10.1177/0163278702025001006
OpenUrl CrossRef PubMed Web of Science

[13] Stewart LA ,

[14] Tierney JF

[15] 4.↵

Tudur Smith C ,
Marcucci M ,
Nolan SJ , et al
. Individual participant data meta-analyses compared with meta-analyses based on aggregate data. Cochrane Database Syst Rev 2016;9:MR000007. doi:10.1002/14651858.MR000007.pub3
OpenUrl

[17] Tudur Smith C ,

[18] Marcucci M ,

[19] Nolan SJ , et al

[20] 5.↵

Tierney JF ,
Vale C ,
Riley R , et al
. Individual Participant Data (IPD) meta-analyses of randomised controlled trials: guidance on their use. PLoS Med 2015;12:e1001855. doi:10.1371/journal.pmed.1001855

[22] Tierney JF ,

[23] Vale C ,

[24] Riley R , et al

[25] 6.↵

Rogozińska E ,
Marlin N ,
Jackson L , et al
. Effects of antenatal diet and physical activity on maternal and fetal outcomes: individual patient data meta-analysis and health economic evaluation. Health Technol Assess 2017;21:1–158. doi:10.3310/hta21410
OpenUrl

[27] Rogozińska E ,

[28] Marlin N ,

[29] Jackson L , et al

[30] 7.↵

Ruifrok AE ,
Rogozinska E ,
van Poppel MN , et al
. Study protocol: differential effects of diet and physical activity based interventions in pregnancy on maternal and fetal outcomes–individual patient data (IPD) meta-analysis and health economic evaluation. Syst Rev 2014;3:131. doi:10.1186/2046-4053-3-131
OpenUrl CrossRef PubMed

[32] Ruifrok AE ,

[33] Rogozinska E ,

[34] van Poppel MN , et al

[35] 8.↵

Williamson PR ,
Gamble C
. Identification and impact of outcome selection bias in meta-analysis. Stat Med 2005;24:1547–61. doi:10.1002/sim.2025
OpenUrl CrossRef PubMed Web of Science

[37] Williamson PR ,

[38] Gamble C

[39] 9.↵

Williamson PR ,
Gamble C ,
Altman DG , et al
. Outcome selection bias in meta-analysis. Stat Methods Med Res 2005;14:515–24. doi:10.1191/0962280205sm415oa
OpenUrl CrossRef PubMed Web of Science

[41] Williamson PR ,

[42] Gamble C ,

[43] Altman DG , et al

[44] 10.↵

Kirkham JJ ,
Dwan KM ,
Altman DG , et al
. The impact of outcome reporting bias in randomised controlled trials on a cohort of systematic reviews. BMJ 2010;340:c365. doi:10.1136/bmj.c365
OpenUrl Abstract/FREE Full Text

[46] Kirkham JJ ,

[47] Dwan KM ,

[48] Altman DG , et al

[49] 11.↵

Chan AW ,
Altman DG
. Identifying outcome reporting bias in randomised trials on PubMed: review of publications and survey of authors. BMJ 2005;330:753. doi:10.1136/bmj.38356.424606.8F
OpenUrl Abstract/FREE Full Text

[51] Chan AW ,

[52] Altman DG

[53] 12.↵

Moher D ,
Hopewell S ,
Schulz KF , et al
. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. BMJ 2010;340:c869. doi:10.1136/bmj.c869
OpenUrl FREE Full Text

[55] Moher D ,

[56] Hopewell S ,

[57] Schulz KF , et al

[58] 13.↵

Clarke M
. Standardising outcomes for clinical trials and systematic reviews. Trials 2007;8:39. doi:10.1186/1745-6215-8-39
OpenUrl CrossRef PubMed

[60] Clarke M

[61] 14.↵

Clarke M ,
Williamson PR
. Core outcome sets and systematic reviews. Syst Rev 2016;5:11. doi:10.1186/s13643-016-0188-6
OpenUrl CrossRef PubMed

[63] Clarke M ,

[64] Williamson PR

[65] 15.↵

Nevitt SJ ,
Marson AG ,
Davie B , et al
. Exploring changes over time and characteristics associated with data retrieval across individual participant data meta-analyses: systematic review. BMJ 2017;357:j1390. doi:10.1136/bmj.j1390
OpenUrl Abstract/FREE Full Text

[67] Nevitt SJ ,

[68] Marson AG ,

[69] Davie B , et al

[70] 16.↵

Thangaratinam S ,
Rogozinska E ,
Jolly K , et al
. Effects of interventions in pregnancy on maternal weight and obstetric outcomes: meta-analysis of randomised evidence. BMJ 2012;344:e2088. doi:10.1136/bmj.e2088

[72] Thangaratinam S ,

[73] Rogozinska E ,

[74] Jolly K , et al

[75] 17.↵

Rogozinska E ,
Marlin N ,
Betran AP , et al
. Effect of diet and physical activity based interventions in pregnancy on gestational weight gain and pregnancy outcomes: meta-analysis of individual participant data from randomised trials. BMJ 2017;358:j3119.
OpenUrl Abstract/FREE Full Text

[77] Rogozinska E ,

[78] Marlin N ,

[79] Betran AP , et al

[80] 18.↵

Ganzevoort W ,
Alfirevic Z ,
von Dadelszen P , et al
. STRIDER: Sildenafil therapy in dismal prognosis early-onset intrauterine growth restriction – a protocol for a systematic review with individual participant data and aggregate data meta-analysis and trial sequential analysis. Syst Rev 2014;3:23. doi:10.1186/2046-4053-3-23
OpenUrl CrossRef PubMed

[82] Ganzevoort W ,

[83] Alfirevic Z ,

[84] von Dadelszen P , et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

Introduction

Supplementary Material

Standardisation of data across trials

Supplementary Material

Unreported outcomes

Supplementary Material

Role of IPD meta-analysis in dynamic research areas

Summary

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password