Rating the certainty in evidence in the absence of a single estimate of effect

M Hassan Murad; Reem A Mustafa; Holger J Schünemann; Shahnaz Sultan; Nancy Santesso

doi:10.1136/ebmed-2017-110668

Article Text

PDF

XML

EBM Primer

Rating the certainty in evidence in the absence of a single estimate of effect

M Hassan Murad1,
Reem A Mustafa2,3,
Holger J Schünemann3,
Shahnaz Sultan4,
Nancy Santesso3

¹ Evidence-based Practice Center, Mayo Clinic, Rochester, Minnesota, USA
² Department of Biomedical and Health Informatics, University of Missouri-Kansas City, Kansas City, Missouri, USA
³ Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, Ontario, Canada
⁴ Division of Gastroenterology, Department of Medicine, University of Minnesota, and Minneapolis Veterans Affairs Health Care System, Minneapolis, Minnesota, USA

Correspondence to Dr M Hassan Murad
, Evidence-based Practice Center, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA; murad.mohammad{at}mayo.edu

https://doi.org/10.1136/ebmed-2017-110668

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Background

Practitioners of evidence-based medicine need to know the level of certainty in the evidence they are applying to patient care. Whether they are using recommendations from a clinical practice guideline based on a systematic review of the literature or using the results directly from a systematic review, they need to know how trustworthy the evidence is with regard to the benefits and harms of a treatment or a diagnostic test. This construct is called certainty or quality of evidence. The GRADE (Grading of Recommendation, Assessment, Development and Evaluation) approach is a modern framework for rating the certainty in evidence.1 Using GRADE, randomised controlled trials and observational studies are considered to generate high and low certainty evidence, respectively. This initial grade that is based on study design is modified using several key domains such as the methodological limitations of the studies, indirectness of the evidence to the question at hand, imprecision of estimates, inconsistency of the evidence, and the likelihood of publication bias. A body of evidence about a specific outcome is downgraded or upgraded to a final rating of high, moderate, low or very low. High certainty in evidence means that the investigators are very confident that the effect they found across studies is close to the true effect, and very low means that they have very little confidence in the effect.1

Often, a single pooled effect estimate from a meta-analysis is available and is used for assessing the certainty in evidence. However, when studies measure outcomes differently or report outcomes in ways that cannot be standardised and meta-analysed, or in situations of urgency, only a narrative synthesis might be available. Consider a systematic review of self-management programmes in patients with chronic obstructive pulmonary disease.2 There were five randomised trials that informed the effect of the intervention on respiratory symptoms. The individual studies presented their results using different tools and measures which precluded pooling. Two trials3 ,4 used the Borg scale to assess respiratory symptoms. One of these trials3 also presented results for respiratory symptom severity. The third trial5 presented results as the proportion of days rated in patients' diaries as having mild, moderate or severe respiratory symptoms. The fourth trial6 presented results about mean breathlessness and sputum production scores over 2-week periods and the fifth trial7 presented results as breathlessness, sputum volume and sputum colour during exacerbations. These studies could not be pooled, but the evidence could be summarised narratively. There is some guidance about how to synthesise the effects of interventions narratively.8 Guidance about how to grade certainty in this evidence is needed. A judgement on the certainty in evidence is still required because certainty is a key component of decision-making. Providing decision makers (patients, clinicians and policymakers) with evidence of unknown trustworthiness compromises their ability to transform evidence to action.9 Decision makers would want to know how confident we are in the effect of these programmes to improve respiratory symptoms before offering such programmes to patients with chronic obstructive pulmonary disease.

The approach

We provide suggestions on the use of GRADE to rate the certainty of evidence when a meta-analysis has not been performed, and instead a narrative summary of the effect was provided. The approach leverages the meaning of the constructs that represent GRADE domains to produce judgements on how these constructs affect our certainty. In table 1, we explain how the GRADE domains (methodological limitations of the studies or risk of bias, indirectness, imprecision, inconsistency and the likelihood of publication bias) can be applied without a single pooled estimate. Note that this guidance does not address meta-narrative reviews10–13 (which answer questions about conceptual underpinnings and understanding of a phenomenon) or qualitative systematic reviews14 (which summarise themes from focus groups and interviews); rather, we address evidence synthesis of quantitative estimates of effect not amenable to meta-analysis (and thus summarised narratively).

View this table:

Table 1

Applying the GRADE approach when evidence for an effect is summarised narratively (a meta-analysis is not available)

Example

In table 2, we again refer to the systematic review of self-management programmes in patients with chronic obstructive pulmonary disease2 and illustrate how we applied the GRADE approach. The outcome of interest in this table is respiratory symptoms which were not pooled in meta-analysis. Evidence derived from five randomised trials showed small to no reductions in respiratory symptoms and was judged to warrant low certainty (rated down for methodological limitations of the included studies and inconsistency). Based on this assessment, decision makers can conclude that self-management programmes may slightly reduce respiratory symptoms. This evidence could also be presented to decision makers in a summary of findings table (typically used in guideline development and generated using GRADEpro which allows narrative summaries of the evidence; https://gradepro.org). Table 3 shows one row of a summary of findings table with explanatory notes. The certainty of evidence in table 3 summarises the GRADE judgements about the different domains (all detailed in table 2) that collectively determined the certainty in evidence for one outcome (respiratory symptoms).

View this table:

Table 2

Illustrative example of rating the certainty in evidence in the absence of a single estimate of effect

View this table:

Table 3

Illustrative example of how the summary of findings can be presented to guideline developers

Discussion

Evidence-based practice is founded on making decisions using the best available evidence, whether it is based on a pooled single effect estimate, or on a narrative review of the individual studies informing each outcome. Stakeholders require that such evidence is appraised and the certainty in the effect is determined in order to inform decision-making. One of the greatest strengths of the GRADE approach is that it provides a systematic method to assess the certainty in evidence and a transparent documentation of the judgements used to assess the body of evidence. While typically it is thought to only apply to results that have been statistically aggregated, evaluating the certainty of evidence can also be performed when results have been narratively summarised.16 In this setting, some certainty domains can be applied directly. For other domains, we have provided additional guidance in which the meaning and connotation of those domains can be used. Taken together, an overall assessment of the evidence can be determined. Stakeholders engaged in shared decision-making in a patient–physician dyad, in guidelines development, or in public health and policy, can then use the summarised effect and the certainty in the evidence to make informed decisions.

References

↵
2. Balshem H ,
3. Helfand M ,
4. Schunemann HJ , et al
. GRADE guidelines: 3. Rating the quality of evidence. J Clin Epidemiol 2011;64:401–6. doi:10.1016/j.jclinepi.2010.07.015
OpenUrl CrossRef PubMed Web of Science
↵
2. Effing T ,
3. Monninkhof EM ,
4. van der Valk PD , et al
. Self-management education for patients with chronic obstructive pulmonary disease. Cochrane Database Syst Rev 2007;(4):CD002990. doi:10.1002/14651858.CD002990.pub2
↵
2. Gourley GA ,
3. Portner TS ,
4. Gourley DR , et al
. Humanistic outcomes in the hypertension and COPD arms of a multicenter outcomes study. J Am Pharm Assoc (Wash) 1998;38:586–97. doi:10.1016/S1086-5802(16)30372-2
OpenUrl PubMed
↵
2. Boxall AM ,
3. Barclay L ,
4. Sayers A , et al
. Managing chronic obstructive pulmonary disease in the community. A randomized controlled trial of home-based pulmonary rehabilitation for elderly housebound patients. J Cardiopulm Rehabil 2005;25:378–85.
OpenUrl CrossRef PubMed
↵
2. Watson PB ,
3. Town GI ,
4. Holbrook N , et al
. Evaluation of a self-management plan for chronic obstructive pulmonary disease. Eur Respir J 1997;10:1267–71. doi:10.1183/09031936.97.10061267
OpenUrl Abstract/FREE Full Text
↵
2. Monninkhof E ,
3. van der Valk P ,
4. Schermer T , et al
. Economic evaluation of a comprehensive self-management programme in patients with moderate to severe chronic obstructive pulmonary disease. Chron Respir Dis 2004;1:7–16. doi:10.1191/1479972304cd005oa
OpenUrl CrossRef PubMed
↵
2. Bourbeau J ,
3. Julien M ,
4. Maltais F , et al
. Reduction of hospital utilization in patients with chronic obstructive pulmonary disease: a disease-specific self-management intervention. Arch Intern Med 2003;163:585–91. doi:10.1001/archinte.163.5.585
OpenUrl CrossRef PubMed Web of Science
↵
2. Popay J ,
3. Roberts H ,
4. Sowden A , et al
. Guidance on the conduct of narrative synthesis in systematic reviews: a product from the ESRC Methods Programme. 2006. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.178.3100&rep=rep1&type=pdf (accessed 11 Jan 2017).
↵
2. Alonso-Coello P ,
3. Schunemann HJ ,
4. Moberg J , et al
. GRADE Evidence to Decision (EtD) frameworks: a systematic and transparent approach to making well informed healthcare choices. 1: Introduction. BMJ 2016;353:i2016. doi:10.1136/bmj.i2016
OpenUrl FREE Full Text
↵
2. Abu Dabrh AM ,
3. Firwana B ,
4. Cowl CT , et al
. Health assessment of commercial drivers: a meta-narrative systematic review. BMJ Open 2014;4:e003434. doi:10.1136/bmjopen-2013-003434
OpenUrl Abstract/FREE Full Text
↵
2. Domecq JP ,
3. Prutsky G ,
4. Elraiyah T , et al
. Patient engagement in research: a systematic review. BMC Health Serv Res 2014;14:89. doi:10.1186/1472-6963-14-89
OpenUrl CrossRef PubMed
↵
2. Mohammed K ,
3. Nolan MB ,
4. Rajjo T , et al
. Creating a patient-centered health care delivery system: a systematic review of health care quality from the patient perspective. Am J Med Qual 2016;31:12–21. doi:10.1177/1062860614545124
OpenUrl CrossRef PubMed
↵
2. Wong G ,
3. Greenhalgh T ,
4. Westhorp G , et al
. RAMESES publication standards: meta-narrative reviews. BMC Med 2013;11:20. doi:10.1186/1741-7015-11-20
OpenUrl CrossRef PubMed
↵
2. Lewin S ,
3. Glenton C ,
4. Munthe-Kaas H , et al
. Using qualitative evidence in decision making for health and social interventions: an approach to assess confidence in findings from qualitative evidence syntheses (GRADE-CERQual). PLoS Med 2015;12:e1001895. doi:10.1371/journal.pmed.1001895
OpenUrl
2. Guyatt GH ,
3. Oxman AD ,
4. Kunz R , et al
. GRADE guidelines 6. Rating the quality of evidence—imprecision. J Clin Epidemiol 2011;64:1283–93. doi:10.1016/j.jclinepi.2011.01.012
OpenUrl CrossRef PubMed
↵
Thayer KA, Schünemann HJ. Using GRADE to respond to health threats with different levels of urgency. Environment International 2016;92-93:585–9. doi:10.1016/j.envint.2016.03.027

Footnotes

Competing interests None declared.
Provenance and peer review Not commissioned; internally peer reviewed.

[1] ↵

Balshem H ,
Helfand M ,
Schunemann HJ , et al
. GRADE guidelines: 3. Rating the quality of evidence. J Clin Epidemiol 2011;64:401–6. doi:10.1016/j.jclinepi.2010.07.015
OpenUrl CrossRef PubMed Web of Science

[3] Balshem H ,

[4] Helfand M ,

[5] Schunemann HJ , et al

[6] ↵

Effing T ,
Monninkhof EM ,
van der Valk PD , et al
. Self-management education for patients with chronic obstructive pulmonary disease. Cochrane Database Syst Rev 2007;(4):CD002990. doi:10.1002/14651858.CD002990.pub2

[8] Effing T ,

[9] Monninkhof EM ,

[10] van der Valk PD , et al

[11] ↵

Gourley GA ,
Portner TS ,
Gourley DR , et al
. Humanistic outcomes in the hypertension and COPD arms of a multicenter outcomes study. J Am Pharm Assoc (Wash) 1998;38:586–97. doi:10.1016/S1086-5802(16)30372-2
OpenUrl PubMed

[13] Gourley GA ,

[14] Portner TS ,

[15] Gourley DR , et al

[16] ↵

Boxall AM ,
Barclay L ,
Sayers A , et al
. Managing chronic obstructive pulmonary disease in the community. A randomized controlled trial of home-based pulmonary rehabilitation for elderly housebound patients. J Cardiopulm Rehabil 2005;25:378–85.
OpenUrl CrossRef PubMed

[18] Boxall AM ,

[19] Barclay L ,

[20] Sayers A , et al

[21] ↵

Watson PB ,
Town GI ,
Holbrook N , et al
. Evaluation of a self-management plan for chronic obstructive pulmonary disease. Eur Respir J 1997;10:1267–71. doi:10.1183/09031936.97.10061267
OpenUrl Abstract/FREE Full Text

[23] Watson PB ,

[24] Town GI ,

[25] Holbrook N , et al

[26] ↵

Monninkhof E ,
van der Valk P ,
Schermer T , et al
. Economic evaluation of a comprehensive self-management programme in patients with moderate to severe chronic obstructive pulmonary disease. Chron Respir Dis 2004;1:7–16. doi:10.1191/1479972304cd005oa
OpenUrl CrossRef PubMed

[28] Monninkhof E ,

[29] van der Valk P ,

[30] Schermer T , et al

[31] ↵

Bourbeau J ,
Julien M ,
Maltais F , et al
. Reduction of hospital utilization in patients with chronic obstructive pulmonary disease: a disease-specific self-management intervention. Arch Intern Med 2003;163:585–91. doi:10.1001/archinte.163.5.585
OpenUrl CrossRef PubMed Web of Science

[33] Bourbeau J ,

[34] Julien M ,

[35] Maltais F , et al

[36] ↵

Popay J ,
Roberts H ,
Sowden A , et al
. Guidance on the conduct of narrative synthesis in systematic reviews: a product from the ESRC Methods Programme. 2006. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.178.3100&rep=rep1&type=pdf (accessed 11 Jan 2017).

[38] Popay J ,

[39] Roberts H ,

[40] Sowden A , et al

[41] ↵

Alonso-Coello P ,
Schunemann HJ ,
Moberg J , et al
. GRADE Evidence to Decision (EtD) frameworks: a systematic and transparent approach to making well informed healthcare choices. 1: Introduction. BMJ 2016;353:i2016. doi:10.1136/bmj.i2016
OpenUrl FREE Full Text

[43] Alonso-Coello P ,

[44] Schunemann HJ ,

[45] Moberg J , et al

[46] ↵

Abu Dabrh AM ,
Firwana B ,
Cowl CT , et al
. Health assessment of commercial drivers: a meta-narrative systematic review. BMJ Open 2014;4:e003434. doi:10.1136/bmjopen-2013-003434
OpenUrl Abstract/FREE Full Text

[48] Abu Dabrh AM ,

[49] Firwana B ,

[50] Cowl CT , et al

[51] ↵

Domecq JP ,
Prutsky G ,
Elraiyah T , et al
. Patient engagement in research: a systematic review. BMC Health Serv Res 2014;14:89. doi:10.1186/1472-6963-14-89
OpenUrl CrossRef PubMed

[53] Domecq JP ,

[54] Prutsky G ,

[55] Elraiyah T , et al

[56] ↵

Mohammed K ,
Nolan MB ,
Rajjo T , et al
. Creating a patient-centered health care delivery system: a systematic review of health care quality from the patient perspective. Am J Med Qual 2016;31:12–21. doi:10.1177/1062860614545124
OpenUrl CrossRef PubMed

[58] Mohammed K ,

[59] Nolan MB ,

[60] Rajjo T , et al

[61] ↵

Wong G ,
Greenhalgh T ,
Westhorp G , et al
. RAMESES publication standards: meta-narrative reviews. BMC Med 2013;11:20. doi:10.1186/1741-7015-11-20
OpenUrl CrossRef PubMed

[63] Wong G ,

[64] Greenhalgh T ,

[65] Westhorp G , et al

[66] ↵

Lewin S ,
Glenton C ,
Munthe-Kaas H , et al
. Using qualitative evidence in decision making for health and social interventions: an approach to assess confidence in findings from qualitative evidence syntheses (GRADE-CERQual). PLoS Med 2015;12:e1001895. doi:10.1371/journal.pmed.1001895
OpenUrl

[68] Lewin S ,

[69] Glenton C ,

[70] Munthe-Kaas H , et al

[71] Guyatt GH ,
Oxman AD ,
Kunz R , et al
. GRADE guidelines 6. Rating the quality of evidence—imprecision. J Clin Epidemiol 2011;64:1283–93. doi:10.1016/j.jclinepi.2011.01.012
OpenUrl CrossRef PubMed

[73] Guyatt GH ,

[74] Oxman AD ,

[75] Kunz R , et al

[76] ↵
Thayer KA, Schünemann HJ. Using GRADE to respond to health threats with different levels of urgency. Environment International 2016;92-93:585–9. doi:10.1016/j.envint.2016.03.027

Log in using your username and password

Main menu

Log in using your username and password

You are here

Statistics from Altmetric.com

Request Permissions

Background

The approach

Example

Discussion

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password