| HOME | HELP | E-MAIL ALERTS | SEARCH |
|
PEDIATRICS Vol. 114 No. 3 September 2004, pp. 874-877 (doi:10.1542/peds.2004-1260)
| ||||||||||||||||||||||||||||||||||||||||||||||||
POLICY STATEMENT |
| ABSTRACT |
|---|
|
|
|---|
Key Words: practice guidelines evidence-based recommendation classification system
Abbreviations: AAP, American Academy of Pediatrics
| INTRODUCTION |
|---|
|
|
|---|
The contemporary, evidence-based approach to guideline development differs from other methods of creating policy in several ways, including:
Like all scientists, evidence-based guideline developers define their methods first and then allow their methods to lead to the results rather than deciding first on the outcome.
A variety of systems have been used to convey information to guideline readers regarding the quality of evidence that supports a given recommendation and the strength assigned to the recommendation by guideline developers. A large number of numeric and alphabetic codes contribute to general confusion about the meaning of these scales. A recent evidence report prepared by the Agency for Healthcare Research and Quality found 121 scales, checklists, or other types of instruments for rating evidence quality.1
The American Academy of Pediatrics (AAP) develops clinical practice guidelines internally through various entities and in collaboration with other organizations and also considers for endorsement guidelines developed by external organizations. Clearly, the method used to classify recommendations in clinical practice guidelines should be consistent and explicit. A unified approach will facilitate communication between guideline developers and users and promote appropriate interpretation and application of guidelines by clinicians.
The objective of this statement is to describe a system for defining recommendation strength for AAP evidence-based practice guidelines that is clear, informative, and helpful to users. A common current approach to indicate the strength of a recommendation is to append a term to describe the "level of consensus" or, in some instances, fervor achieved. Because this judgment may not reflect unanimity of the development team and can be influenced disproportionately by vociferous or persuasive team members, it represents a lapse in an otherwise explicit system to link guideline statements to the strength of evidence and recommendation.
The proposed system was derived from existing systems that support the principle of using explicit criteria for guideline development. This system is intended for use by the AAP in the process of developing evidence-based clinical practice guidelines. This statement describes 3 sequential processes in evidence-based policy setting: 1) determination of evidence quality in support of a proposed recommendation; 2) evaluation of the balance between anticipated benefits and harms when the recommendation is carried out; and 3) designation of recommendation strength.
| CLASSIFICATION PROCEDURE |
|---|
|
|
|---|
Highest-quality evidence for therapeutic interventions comes from well-designed and well-conducted randomized, controlled trials performed on a population similar to the guideline's target population. The lowest-quality evidence is derived from case reports, reasoning from first principles of pathophysiology, and expert opinion based on ill-defined "clinical experience." Intermediate-quality evidence is associated with a randomized, controlled trial with "nonfatal flaws" or methodologic limitations (for example, one performed on a group from a population different from the target population, therefore requiring that findings be extrapolated) or with an observational design such as a case-control or cohort study. For studies of diagnostic tests, the representativeness of the population studied, the adequacy of the description of the test, the appropriateness of the reference standard against which the test is compared, and the methods used to avoid bias in interpretation of results (such as blinded comparison with the reference standard) are criteria for judging quality.3
After systematically reviewing the literature for studies that bear on a policy decision, evidence-based guideline developers must carefully review each study, extract the findings, and appraise both the quality of the study design and its execution. The specific quality criteria applied depend on the design and type of study. For example, appraisal of controlled trials requires consideration of the adequacy of randomization and blinding and loss to follow-up; assessing case-control studies requires consideration of the appropriateness of matching cases and controls.
Next, guideline developers must consider the quality of the aggregate of studies that bear on the issue. Judging the strength of a body of evidence requires careful consideration of the consistency of the results of individual studies, the magnitude of the effect that the studies detect, and the individual and aggregate sample sizes of these studies.1
2.Assess Anticipated Balance Between Benefits and Harms
The anticipated benefit, harm, risk, and cost of adherence to a guideline recommendation constitute the second factor that influences the strength of a recommendation. Guyatt et al4 suggest looking at the clarity of the balance between benefits and harms. When the evidence indicates a clear benefit not offset by important harms or costs or a clear harm not mitigated by important benefit, stronger recommendations are possible. On the other hand, when the magnitude of the benefit is small or benefits are present but offset by important adverse consequences, the equilibrium between benefits and harms prevents a strong recommendation. A clear preponderance of benefit or harm supports stronger statements for or against a course of action. When the benefit-harm assessment is balanced, no matter how good the studies, practitioners should be offered options rather than recommendations. Such cases mirror the situation described by Bass,5 in which "there is adequate evidence at hand to support those who wish to treat...yet the evidence is not so overwhelming as to suggest that those who do not choose to use this form of therapy are in error."
3.Assign Recommendation Strength
Recommendation strength communicates the guideline developers' (and the sponsoring organizations') assessment of the importance of adherence to a particular recommendation and is based on both the quality of the supporting evidence and the magnitude of the potential benefit or harm. The proposed classification defines 4 levels of policy (strong recommendation, recommendation, option, and no recommendation) based on:
|
When the evidence is of low quality and the benefit-harm equilibrium is balanced, guideline developers generally should not constrain the clinician's discretion by making a recommendation but instead should designate acceptable alternatives as options. Although options do not direct clinicians' actions toward one activity or another, they may place boundaries by delineating appropriate alternative practices.
When the evidence is scant and the balance of benefit and harm is unknown, as for example with some complementary and alternative medicine practices, no recommendation regarding therapy may be possible. Stating that no recommendation is possible provides information but little direction to the clinician. No-recommendation statements are therefore of limited utility and should be discouraged. In some cases, guideline developers still may be able to make policy or offer options based on evidence. For example, although there may be no evidence of effectiveness of a complementary and alternative medicine practice, developers may be able to recommend that clinicians should inquire about use of complementary and alternative medicine and counsel about potential interactions. In other circumstances, guideline developers might suggest individualization on the basis of risk and values.
In some cases, a recommendation or strong recommendation may be made when analysis of the balance of benefits and harms demonstrates an exceptional preponderance of benefit or harm and it would be unethical to perform clinical trials to "prove" the point. These are almost exclusively situations of a medical (not social or political) nature. For example, the anticipated benefit of a recommendation for prescribing anthrax prophylaxis to exposed patients clearly outweighs the expected harms and calls for a strong recommendation, although studies do not exist to support the practice. Such situations with poor evidence but a highly unbalanced benefit-harm equation must be unmistakably differentiated from other circumstances in which high-quality evidence supports strong recommendations. Requiring the authors to explicitly state the benefit-harm proposition opens it up for constructive debate.
| INTERPRETING GUIDELINE RECOMMENDATIONS |
|---|
|
|
|---|
A strong recommendation means that the committee believes that the benefits of the recommended approach clearly exceed the harms of that approach (or, in the case of a strong negative recommendation, that the harms clearly exceed the benefits) and that the quality of the evidence supporting this approach is either excellent or impossible to obtain. Clinicians should follow such guidance unless a clear and compelling rationale for acting in a contrary manner is present.
A recommendation means that the committee believes that the benefits exceed the harms (or, in the case of a negative recommendation, that the harms exceed the benefits), but the quality of the evidence on which this recommendation is based is not as strong. Clinicians also generally should follow such guidance but also should be alert to new information and sensitive to patient preferences.
An option means either that the evidence quality that exists is suspect or that well-designed, well-conducted studies have demonstrated little clear advantage to one approach versus another. Options offer clinicians flexibility in their decision-making regarding appropriate practice, although they may set boundaries on alternatives. Patient preference should have a substantial role in influencing clinical decision-making, particularly when policies are expressed as options.
No recommendation is made when there is both a lack of pertinent evidence and an unclear balance between benefits and harms. Clinicians should feel little constraint in their decision-making when addressing areas with insufficient evidence. Patient preference should have a substantial role in influencing clinical decision-making.
The AAP believes that adoption of an explicit, consistent classification of recommendations will facilitate communication between the AAP entities that develop guidelines (committees, sections, and task forces and the board of directors) and pediatricians who apply them to clinical practice. We recognize that any classification system may be considered by payors in making reimbursement decisions. This classification is intended only to increase transparency and to enhance the utility and credibility of AAP clinical practice guidelines. Direct linkage of this classification to reimbursement decisions would be overly simplistic, because recommendation strength is one of many factors that should be considered in developing reimbursement policy. Experience and new knowledge will likely require periodic revision of the proposed classification system.
| Steering Committee on Quality Improvement and Management, 20032004 |
|---|
|
|
|---|
Carole M. Lannon, MD, MPH, Director
Norman Harbaugh, MD
Elizabeth Susan Hodgson, MD
*Edgar K. Marcuse, MD, MPH
*Richard N. Shiffman, MD
Lisa Simpson, MB, BCh
| Liaisons |
|---|
|
|
|---|
Paul Darden, MD
Section on Epidemiology
Denise Dougherty, PhD
Agency for Healthcare Research and Quality
Ellen Schwalenstocker, MBA
National Association of Children's Hospitals and Related Institutions
| Staff |
|---|
|
|
|---|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. M. Perrin, R. A. Friedman, T. K. Knilans, the Black Box Working Group, and the Section on Cardiology and Cardiac Surgery Cardiovascular Monitoring and Stimulant Drugs for Attention-Deficit/Hyperactivity Disorder Pediatrics, August 1, 2008; 122(2): 451 - 453. [Full Text] [PDF] |
||||
![]() |
V. A. Moyer and D. Nelson Pediatricians and the US Preventive Services Task Force: A Natural Partnership to Enhance the Health of Children Pediatrics, July 1, 2008; 122(1): 174 - 176. [Full Text] [PDF] |
||||
![]() |
M. G. Kahn and J. Todd Comparative Quality Measures: Putting Evidence Above Expediency Pediatrics, July 1, 2008; 122(1): 182 - 183. [Full Text] [PDF] |
||||
![]() |
Steering Committee on Quality Improvement and Mana Febrile Seizures: Clinical Practice Guideline for the Long-term Management of the Child With Simple Febrile Seizures Pediatrics, June 1, 2008; 121(6): 1281 - 1286. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. T. Laven Insufficient Evidence for Committee Recommendations on Obesity Pediatrics, May 1, 2008; 121(5): 1077 - 1078. [Full Text] [PDF] |
||||
![]() |
Steering Committee on Quality Improvement and Mana Toward Transparent Clinical Policies Pediatrics, March 1, 2008; 121(3): 643 - 646. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. N. Shiffman and A. S. Lieberthal The Importance of Disclaimers: Distinction Between Optimal Care and Standard of Care: In Reply Pediatrics, August 1, 2007; 120(2): 455 - 455. [Full Text] [PDF] |
||||
![]() |
Subcommittee on Diagnosis and Management of Bronch Diagnosis and Management of Bronchiolitis Pediatrics, October 1, 2006; 118(4): 1774 - 1793. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Lieberthal and T. G. Ganiats Thoughts on the American Academy of Pediatrics/American Academy of Family Physicians Clinical Practice Guideline on Acute Otitis Media: A Different Perspective: In Reply Pediatrics, May 1, 2005; 115(5): 1444 - 1445. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | E-MAIL ALERTS | SEARCH |