CHE
About Us
What's New
Info Cycle
Projects
Users' Guides
Desktops
Contact Us
Site Map

Applying Clinical Trial Results.
Part B: Guidelines for Determining Whether a Drug is Exerting (More Than) a Class Effect

Finlay A. McAlister, Andreas Laupacis, George A. Wells, David L. Sackett, for the Evidence-Based Medicine Working Group

Based on the Users' Guides to Evidence-based Medicine and reproduced with permission from JAMA. (1999;282(14):1371-1377). Copyright 1999, American Medical Association.


Most classes of drugs include multiple compounds, and the interests of clinicians, manufacturers, and purchasers may conflict around questions of whether a particular drug is more efficacious, safer, or more cost effective than others in its class [1]. In this article, we review the types of evidence commonly cited to support the use of one drug over another of the same class and provide a hierarchy for grading studies which compare one drug with another from the same class, expanding upon our discussion in Part A of this User’s Guide.

Up


Clinical Scenarios

The Clinician

As a busy clinician, you care for many patients with elevated serum cholesterol levels. A speaker at a recent continuing medical education event reviewed the benefits of cholesterol lowering therapy, particularly with hydroxymethylglutaryl-Coenzyme A reductase inhibitors (statins), in the primary and secondary prevention of ischemic heart disease, but did not recommend a particular statin. You decide to consider statin therapy for all of your patients with elevated cholesterol, but are uncertain which of the six statins on the market is the best. You ask your local general internist, cardiologist, and endocrinologist for their opinions and each suggests a different statin, citing different reasons. You contact your local pharmaceutical representatives to provide you with the evidence that their statin is better than that of their competitors. Although you use the JAMA series on "Users guides to the medical literature" to assess the validity of published studies, faced with a variety of competing claims, you realize that you need a framework for grading the strength of these studies.

The Policy Maker

Your colleague, a purchaser for a large Health Maintenance Organization (HMO), is faced with a similar dilemma when she is asked to consider replacing the statin on her HMO’s formulary with a newer one. She wonders whether there is enough evidence to support the contention that the new statin is as good as, or better than, the one currently on formulary. While this statin is cheaper, it has only been evaluated in short-term trials with cholesterol lowering as the solitary endpoint.

Up


Introduction

Although there is no uniformly accepted definition of a drug class, and some argue that it can’t be defined at all, drugs are generally said to belong to the same class for one of three reasons [Table 1]. In this paper, we will define a drug class as those drugs which share a similar structure and mechanism of action. Most classes of drugs include multiple compounds, and because of their similar mechanisms of action, they are generally thought to confer similar pharmacologic effects and clinical outcomes ("class effects"). This assumption is a key medical heuristic [2] and underlies clinical practice guidelines in which evidence from studies involving one or more drugs within a class is extrapolated to other drugs of the same class. For example, it is recommended that “beta-blockers” be prescribed to survivors of myocardial infarction (MI) or “angiotensin converting enzyme (ACE) inhibitors” to patients with heart failure. In this circumstance, clinicians are likely to be interested in the drug within each class with the most attractive efficacy/safety ratio; purchasers the most cost-effective drug from a class; and manufacturers in ensuring that their drug is prescribed as much as possible.

Up

Table 1: Different definitions of drug classes

Definition Example
A group of drugs with similar chemical structure Dihydropyridine CCB have a dihydropyridine ring.
A group of drugs with similar mechanism of action CCB block the voltage-dependent calcium channels on the surfaces of cell membranes.
A group of drugs with similar pharmacologic effects Antihypertensives (for example, calcium channel blockers, ACE inhibitors, beta blockers, thiazides, alpha blockers) lower blood pressure.

ACE inhibitors= angiotensin-converting enzyme inhibitors

The absolute treatment effects seen with a drug (defined by the absolute risk reduction [ARR] or number needed to treat [NNT]) are influenced by the baseline risk or control event rate (CER) of those patients in whom it is used. Thus, the ARR varies considerably among different groups of patients. On the other hand, the relative treatment effect of a drug (defined by the relative risk reduction[RRR]) is often (but not always [3]) similar irrespective of the baseline risk of trial participants [4] [5]. If two drugs are tested in separate placebo-controlled trials, only proportional effects such as the RRR seen with each drug can be compared (and then only under the assumption of constant RRR over different CERs). Although the point estimates of effect size vary with the play of chance, a class effect is considered to be present when drugs with similar mechanisms of action generate RRRs (or odds ratios [ORs]) that are similar in direction and magnitude. For example, the Collaborative Group on ACE Inhibitor Trials [6] suggested that there is a class effect for ACE inhibitors in patients with symptomatic heart failure despite the fact that the OR point estimates for effects on total mortality ranged from 0.14 (95% confidence intervals [CI] 0 to 7.6) for perindopril (one trial, 125 patients) to 0.78 (95% CI 0.67 to 0.91) for enalapril (seven trials, 3381 patients). Our confidence in this class effect stems from the recognition that the overall OR in 32 trials involving 7105 patients was 0.77 (CI 0.67-0.88), the confidence intervals for each of the ACE inhibitors overlapped, and there was no statistical heterogeneity between trials of different agents.

Up


The Risks of Assuming a Class Effect

Although drugs of the same class typically exhibit similar pharmacological effects and clinical outcomes, this may not always be the case (witness the current controversy over the safety of sotalol in MI survivors with CHF after the publication of the SWORD Trial [7] which suggested an increase in mortality with sotalol, in contrast to the decrease in mortality with other beta blockers). However, in this context, it is useful to recall a previous controversy over the efficacy of beta-blockers with intrinsic sympathetic activity (ISA) in patients with MI: while an earlier meta-analysis [8] suggested that the treatment effect was larger with non-ISA beta-blockers than those with ISA, subsequent trials [9] failed to confirm this and the totality of the evidence [10] suggests there is little difference between beta-blocker subgroups. Thus, it would seem reasonable to accept a priori that drugs within the same class exert similar effects, unless there is clear evidence of important differences.

However, this assumption can lead to two important errors in extrapolation with major clinical consequences. Firstly, when a class of drugs (such as the thiazide diuretics) all produce similar pharmacological effects (blood pressure lowering) and similar clinical effects (stroke reduction), a second class of drugs (for example, the calcium-channel blockers) that produce the same pharmacological effect might be assumed to produce the same clinical benefit. In the absence of randomized trials verifying that final step, this type of extrapolation may be in error: for example, consider the issue raised in Part A of this Guide- some calcium channel blockers have unfavourable effects on total mortality [11]. Secondly, even within the same class, individual drugs may have different physiologic effects other than the mechanism of action which defined them as being from the same class. To the extent that this is so, it may be inaccurate to extrapolate the clinical outcomes shown in randomized trials of one drug in a class to another member in that class that has not been subjected to similar outcome-centered trials. For example, some authors have argued that, although all of the statins act on the HMG-CoA reductase enzyme, they may have different non-lipid effects on the atherothrombotic process which may influence their clinical efficacy [12].

In order to reduce the risk of faulty extrapolation and to maximize the optimal selection of treatments within a class of drugs, we believe it may be useful to develop and apply a hierarchy of evidence when making decisions about the comparative clinical efficacy and safety of drugs within a class. As pointed out in Part A of this User’s Guide, no matter how strong the pathophysiologic rationale or indirect evidence, the efficacy and safety of a new drug must be established in clinical outcome studies that test more than just biological plausibility.

Up


Levels of Evidence

Levels of evidence are increasingly used by groups which make recommendations about patient-care [13] [14] [15], and we have used some of them as a starting point to develop guidelines for comparing one drug against other drugs within the same class [Table 2]. This comparison should occur as part of a systematic review of all the relevant evidence on the effects of a treatment, identified and assessed by sound and transparent methods such as those employed in the Cochrane Collaboration [The Cochrane Library, Issue 2, 1998. Oxford: Update Software]. We will describe each level in turn, using the choice of statin drugs as an example to illustrate their use (the relevant statin trials are summarized in Table 3).

Up

Table 2: Levels of Evidence for comparing the efficacy of drugs within the same class

Level Comparison Study Patients Outcomes Threats to Validity
1 Within a “head-to-head” randomized trial (RCT) Identical (by definition) Clinically Important Outcomes -failure to conceal randomization scheme
-failure to achieve complete follow-up
-failure to achieve double-blinding
-soundness of outcome assessment
2 Within a “head-to-head” randomized trial (RCT) Identical (by definition) Validated Surrogate Outcomes -those of level 1 PLUS
-validity of surrogate outcome for clinically important outcomes
2 Across RCTs of different drugs vs. placebo Similar or different (in disease and risk factor status) Clinically Important Outcomes or Validated Surrogate Outcomes -those of level 1 PLUS differences between trials in:
-methodologic quality (adequacy of blinding, allocation concealment, etc.)
-endpoint definitions
-compliance rates
-baseline risk of outcomes
3 Across subgroup analyses from RCTs of different drugs vs. placebo Similar or different Clinically Important Outcomes or surrogate outcomes -those of level 1 (+ 2) PLUS:
-multiple comparisons, post-hoc data dredging
-under-powered subgroups
-misclassification into subgroups
3 Across RCTs of different drugs vs. placebo Similar or different Unvalidated Surrogate Outcomes -surrogate outcomes may not capture all of the effects (beneficial or hazardous) of a therapeutic agent
4 Between non-randomized studies (observational studies and administrative database research) Similar or different Clinically Important Outcomes -confounding by indication, compliance, and/or calendar time
-unknown/unmeasured confounders
-measurement error
-for outcomes research: limited databases, coding systems not suitable for research

Up

Clinically Important Outcomes refer to long-term efficacy data and the particular endpoints depend on the condition being treated. For statins used to prevent/treat atherosclerotic disease, clinically important outcomes would include all-cause mortality, myocardial infarction, and stroke.

Surrogate Outcomes are considered “Validated” only when the relationship between the surrogate outcome and clinically important outcomes has been firmly established in long-term RCTs.

Up

Table 3: Features of randomized, placebo-controlled, statin trials designed to detect differences in clinically important endpoints

Trial: 4S20 WOSCOPS21 CARE22 AFCAPS/
TexCAPS
23
LIPID24
Study design Secondary prevention, Multicentre Primary prevention, One centre Secondary prevention, Multicentre Primary prevention, Multicentre Secondary prevention, Multicentre
Treatment (dose od) Simvastatin 20 mg Pravastatin 40 mg Pravastatin 40 mg Lovastatin 40 mg Pravastatin 40 mg
Patient Inclusion Criteria 35-70 yrs, prior angina or AMI, fasting total chol 5.5-8.0 mmol/L 45-64 yrs, no prior AMI, fasting LDL chol 4.0-6.0 mmol/L 21-75 yrs, prior AMI, fasting LDL chol 3.0-4.5 mmol/L 45-73 yrs (males) or 55-73 (females), no prior AMI, fasting LDL chol 3.4-4.9 mmol/L 31-75 yrs, prior AMI or unstable angina, fasting total chol 4-7 mmol/L
Co-interventions ASA (37%)
Beta-blockers (57%)
None ASA (83%)
Beta-blockers (40%)
None ASA (82%)
Beta-blockers (47%)
Duration of followup (yrs) 5.4
(median)
4.9
(mean)
5.0
(median)
5.2
(mean)
6.1
(mean)
Patients:
Number
4444 6595 4159 6605 9014
Mean Age (yrs) 58.6 55.2 59 58 62
Males (%) 81 100 86 85 83
Smokers (%) 26 44 21 12 10
Diabetes Mellitus (%) 5 1 15 2 9
Baseline Cholesterol (mean)
-Total
(mmol/L)
-LDL
(mmol/L)
6.8

4.9

7.0

5.0

5.4

3.6

5.7

3.9

5.6

3.9

Control Event Rates:
-for death
-for AMI
11.5%
22.6%
4.1%
7.9%
9.4%
10%
0.44%
0.56%
14.1%
10.3%
TREATMENT EFFECTS:
% Change in lipids (active treatment vs. placebo) - 25%
(Total chol)
- 35%
(LDL chol)
+ 8%
(HDL chol)
- 10%
(Trig)
- 20%
(Total chol)
- 26%
(LDL chol)
+ 5%
(HDL chol)
- 12%
(Trig)
- 20%
(Total chol)
- 28%
(LDL chol)
+ 5%
(HDL chol)
- 14%
(Trig)
- 18%
(Total chol)
- 25%
(LDL chol)
+ 6%
(HDL chol)
- 15%
(Trig)
- 18%
(Total chol)
- 25%
(LDL chol)
+ 5%
(HDL chol)
- 11%
(Trig)
Relative Risk Reductions:
-for death
(95% CI)
-for AMI
(95% CI)
30%
(15-42%)
27%
(20-34%)
22%
(0-40%)
31%
(17-43%)
9%
(-12-26%)
25%
(8-39%)
-4%
(not given)
40%
(17-57%)
22%
(13-31%)
29%
(18-38%)
Number Needed to Treat‡:
-to prevent
one death
-to prevent
one AMI
27 (5 yrs)

10 (5 yrs)

111 (5 yrs)

42 (5 yrs)

125 (5 yrs)

40 (5 yrs)

5000 to harm*

435 (5 yrs)

32 (6 yrs)

34 (6 yrs)

Legend:

‡ point estimates only

* since all-cause mortality was non-significantly increased in active treatment arm, results are presented as number needed to treat to cause one death

AMI = acute myocardial infarction

4S = Scandinavian Simvastatin Survival Study
WOSCOPS = West of Scotland Coronary Prevention Study
CARE = Cholesterol and Recurrent Events Trial
AFCAPS/TexCAPS = Air Force/Texas Coronary Atherosclerosis Prevention Study
LIPID = Long-term Intervention with Pravastatin in Ischaemic Disease Study

Up

LEVEL 1: (randomized clinical trials providing head-to-head comparisons of the drug of interest with other drugs of the same class for their effects on clinically important outcomes): This would generate the strongest evidence for the decision maker; however, there are potential threats to validity [Table 2] and several methodologic issues unique to these trials which the reader must consider. Firstly, at least one of the drugs should have been shown to have a clinically important impact versus placebo in previous trials carried out in a similar population to the current trial. Secondly, the choice of appropriate dose for each drug is a complicated issue as this will affect the outcomes and safety profiles for both drugs. Finally, one must carefully consider the trial size and methods before concluding equivalence of two drugs- equivalence trials require much larger sample sizes than standard trials [16] and any laxity in trial conduct or patient compliance will tend to mask any real differences between drugs.

The choice of clinically important outcomes for level 1 studies depends upon the target intervention. In the case of therapies designed to prevent or arrest atherosclerosis (such as statins), this implies long-term efficacy data on events such as myocardial infarction, stroke, and all-cause mortality. On the other hand, for interventions designed to treat symptomatic diseases (such as gastroesophageal reflux disease), clinically important outcomes could include symptom scores and other quality of life measures.

Although there are examples of level 1 evidence in other branches of medicine (such as head-to-head trials of the selective 5-hydroxytryptamine type 3 receptor antagonists for postoperative nausea and vomiting [17] [18]), they are rare in the cardiovascular literature. Our literature search failed to find any level 1 evidence for statins.

LEVEL 2: (randomized clinical trials providing head-to-head comparisons of the drug of interest with other drugs of the same class for their effects on validated surrogate outcomes OR comparisons across two or more placebo-controlled trials for effects on clinically important outcomes or validated surrogate outcomes): Part A of this User’s Guide discussed criteria for deciding whether to accept results of trials based on surrogate outcomes. Ecologic studies, cohort studies, and RCTs with pre-statin lipid lowering agents were supportive of the lipid-lowering hypothesis[19] (that lowering LDL cholesterol lowers the risk of atherosclerotic heart disease); however, it was not until the publication of the large scale statin trials [20] [21] [22] [23] [24] (see Table 3 for full description of trials and acronyms used in this article) consistently linking reductions in LDL-cholesterol to reductions in morbidity and mortality that we felt comfortable in accepting the surrogate endpoint of LDL-cholesterol lowering as a proxy for clinically important outcomes. Thus, in order to accept head-to-head comparisons for surrogate outcomes as level 2 evidence, at least one of the comparators must have demonstrated efficacy in long-term trials with clinically important outcomes.

While a randomized trial [25] comparing four statins for their effects on LDL-cholesterol, HDL-cholesterol, and triglyceride over an eight week period would be an example of level 2 evidence, it is important to also incorporate considerations of the size and duration of trials in the decision-making process (as discussed later).

On the other hand, a number of level 2 comparisons can be made between various statins- for example, one can compare the treatment effects seen with simvastatin versus pravastatin in secondary prevention trials (such as the 4S20 and LIPID24 studies, see Table 3). While consistency of effects in such comparisons would be strong evidence for the presence of a class effect, these comparisons are less useful in determining whether one drug is more efficacious than another since the advantages of randomization are lost and the comparison is essentially that between two or more cohorts. In addition to the potential biases outlined in Table 2, there is also the real possibility of confounding between a subject’s risk/responsiveness and exposure to a particular treatment in those situations where the subjects from different trials have different risk status- for example, if one were to compare the statin used in a primary prevention trial (such as lovastatin in AFCAPS/TexCAPS [23]) with another statin tested in a secondary prevention trial (such as simvastatin in 4S). In fact, comparisons such as these would only be valid when the drug efficacy is known to be independent of baseline risk, an assumption which appears valid in some situations (such as antiplatelet [5] or antihypertensive [4] therapy) but has been questioned for the statins [26] [27] [28] [29] [30] [31].

It is theoretically possible to compare the efficacy of two drugs tested in separate placebo-controlled trials. As outlined by Bucher et al [32], an indirect estimate of the association between drugs A and B can be obtained by comparing the odds ratio (or relative risk) from studies of drug A versus placebo (p) and from studies comparing drug B versus placebo: ORA vs. B= ORA vs. p / ORB vs. p . However, this assumes that none of the potential biases outlined in Table 2 are operative and that an intervention’s treatment effect is consistent across different patient subgroups. Furthermore, these indirect estimates may provide substantially different effect size estimates than direct comparisons of drug A against drug B. For example, a systematic overview of strategies to prevent Pneumocystis carinii pneumonia in HIV positive patients documented that the indirect comparison of trimethoprim-sulphamethoxazole (T-S) versus dapsone/pyrimethamine (D/P) suggested a much larger effect size from T-S (OR 0.37, 95% CI 0.21-0.65) than was seen in the direct comparisons (overall OR 0.64 in the nine trials of T-S versus D/P, 95% CI 0.45-0.90) [32]. Thus, the strength of inference from indirect comparisons is limited.

Level 3 and 4 studies have numerous flaws as outlined below and are best viewed as exercises in hypothesis generation.

LEVEL 3: (comparisons across subgroups from different placebo-controlled trials OR comparisons across placebo-controlled trials in which outcomes are restricted to unvalidated surrogate markers): In addition to the biases that affect higher level studies, comparisons based on subgroup analysis are potentially flawed [Table 2]. Both simple statistics and experience have taught us that many initial subgroup conclusions (especially when they are the result of “data-dredging”) are subsequently shown to be wrong [33] [34]. An example of such a comparison would be looking at the efficacy of simvastatin in the 4S subgroup with the lowest lipid levels (241 patients with total cholesterol 5.5 to 6.24 mmol/L) [27] versus the efficacy of pravastatin in the CARE subgroup with comparable lipid profiles (2087 patients with total cholesterol 5.4 to 6.21 mmol/L) [22].

Level 3 evidence may also include the use of surrogate markers which, although they may lie along a recognized pathogenetic pathway from mechanisms of action to important clinical outcomes, have not been validated in long-term randomized clinical trials. Using an example from Part A of this User’s Guide, this would involve making inferences about reduction in fractures from the effects on bone density of two different bisphosphanates in two independent randomized trials.

LEVEL 4: (comparisons involving or confined to non-randomized evidence): This type of evidence is only possible for conditions such as hypertension or hyperlipidemia in which there are a large number of potential treatments and they are commonly used by practitioners. Non-randomized evidence can include cohort or case control studies, modelling studies (using risk prediction equations such as that derived from the Framingham data [35]), and/or outcomes research employing administrative databases. Although these type of analyses can provide useful insights (particularly with respect to dose-response relationships) [36], we suggest that they are best viewed as exercises in hypothesis-generation. In particular, outcomes research studies, originally developed to determine whether the efficacy of interventions proven in randomized trials have their anticipated impacts at a population level, have sometimes been used to pursue the primary determination of efficacy- a purpose for which they were not intended. When used for this latter purpose, in addition to the limitations common with the other observational data listed in Table 2, they present unique problems in interpretation that restrict the validity of inferences drawn from them about the relative efficacy of medications from the same class.[37]

An example of level 4 evidence is a recent re-analysis of the WOSCOPS database in which the observed coronary event rates in pravastatin-treated patients were compared to those predicted from the Framingham coronary risk equation (using the constellation of risk factors and mean on-treatment cholesterol levels observed in the trial) to determine whether the treatment benefit with pravastatin exceeded that expected from the degree of cholesterol lowering achieved (leading to inferences about whether pravastatin’s efficacy exceeds that expected of other statins) [28].

Up


Other Considerations

Amount of Efficacy Evidence

While we have thus far focused on the validity of the evidence, it is important to also recognize that the number, size, and duration of studies are essential factors to be considered in the decision-making process. Certainly, the superiority of one drug within a class can only be definitively established with level 1 evidence. However, while level 1 evidence would be ideal for establishing that a group of drugs exert a class effect (by showing narrow confidence limits around the difference between drugs), we recognize that this is rarely available and is unlikely to ever be available for many classes of drugs (due to the difficulties in funding and conducting such large trials that are unlikely to appeal to researchers, manufacturers, or funders). In this situation, the amount of level 2 evidence becomes important. For instance, one would feel more comfortable in concluding a class effect if there were a number of placebo-controlled trials demonstrating that various drugs from the same class had similar treatment effects. However, our intent here is not to set a single level that must be achieved before a drug can be claimed to be superior to others in its class or before a class effect can be established. These are decisions which individual clinicians or policy makers must make, taking into account their local circumstances and individual comfort levels.

Safety

In the past decade, there have been numerous examples of drugs within the same class that have been shown to have different safety profiles (for example, practolol causes sclerosing peritonitis and keratoconjunctivitis while other beta-blockers do not; ticlopidine causes more neutropenia than clopidogrel; phenylbutazone causes agranulocytosis while other non-steroidal antiinflammatory agents do not). Although not the primary focus of this paper, considerations of drug safety are part of any treatment or purchasing decision, and we therefore offer a set of levels of evidence for determining safety in Table 4. The first tests of a drug in humans (phase I studies) are designed to determine the maximally tolerated dose, and clinical trials are generally designed to determine the efficacy of the drug. As such, the sample sizes of neither are adequate to detect uncommon adverse effects. The reader’s attention is drawn to the inverse rule of 3: to be 95% sure of seeing at least 1 adverse drug reaction that occurs once in every x patients, you need to follow 3x patients [38]. Given the size and duration of most clinical trials, adverse effects that occur in less than 1 in 1000 participants or take more than six months to appear will generally remain undetected [2]. However, randomized clinical trials are still the strongest design for detecting real differences in adverse effects (such as the different rates of intracranial bleeding with different thrombolytic agents [39] [40]) and meta-analysis of such trials can give an unbiased estimate of excess hazards. In the absence of clinical trials, we believe that premarketing safety data must be considered preliminary and large phase IV studies or systematic post-marketing surveillance data are necessary to confirm the safety of new drugs.

Up

Table 4: Levels of evidence for comparing the safety of drugs within the same class

Level Properties Advantages Threats to Validity
1 RCT(s) Only design which permits the detection of adverse effects when the adverse effect is similar to the event which treatment is trying to prevent Underpowered for detecting adverse effects
2 Cohort Studies Prospective data collection, defined cohort Critically depends on followup, classification and measurement accuracy
3 Case-control Studies Cheap and fast to perform Selection and recall bias
Temporal relationship may not be clear
4 Phase IV Studies If sufficiently large, can detect rare, but important, adverse effects No, or unmatched, control group
Critically depends on followup, classification and measurement accuracy
5 Case series Cheap and fast to perform Small sample size
Selection bias
No control group
6 Case Report(s) Cheap and fast to perform Small sample size
Selection bias
No control group

Up

Convenience/compliance

While once-a-day medications are more convenient and usually have higher compliance rates, evidence on drug compliance derived from trials may translate poorly into clincial practice. For instance, while compliance with the various statins during the course of the trials described in Table 3 ranged from 90% to 94%, analyses of administrative databases in Canada and the United States [41] revealed that only half of statin-treated patients were still taking their medication one year after it was prescribed.

Cost

Faced with a decision as to whether a new drug from a class should be offered to eligible patients within the population, clinicians and policy makers have different (although not mutually exclusive) perspectives. For clinicians, this decision usually hinges on the efficacy, safety, convenience/compliance, and cost of the new drug versus the old, and the applicability of the trial evidence to their patient [42]. However, for policy makers these issues will form only one piece of the puzzle and they must also evaluate the efficiency ("the effects or end results achieved in relation to the effort expended in terms of money, resources, and time") [43], affordability, and opportunity costs of any new drugs. The efficiency of any intervention is determined by formal economic analyses, and the Users’ Guide series has offered criteria for evaluating their methodological quality [44]. Although cost-minimization analysis is the simplest and least controversial of the economic analysis techniques, it requires proof that the outcomes with both alternatives are the same. As this rarely exists, the policy maker must rely on other types of analyses (cost-effectiveness, cost-benefit, or cost-utility analyses) which involve varying degrees of assumption and guesswork. As pointed out by Naylor and colleagues [45], economic analyses should be viewed as "promising, clearly helpful, still in need of refinement and open, like any new technology, to both wise use and well-intentioned abuse."

Further hampering the policy-makers’ task, the decision as to whether a new drug is efficient enough to warrant its adoption depends critically on the social, political, and economic realities of their particular health care setting. Thus, attempts to establish universal cut-points (using cost/QALY ratios) have been largely unsuccessful [46]. While there are occasions where there is compelling evidence for adoption (the new drug is as effective or more effective than others of its class and is less costly) or rejection (the new drug is less effective than others of its class and is more costly) of a new drug, much of the time the policy maker is operating in a cost- utility grey zone between these two extremes[45].

Up


Resolution of Scenarios

The Clinician

Given the qualitative consistency of the RRR for acute myocardial infarction in patients treated with three of the statins in large trials with clinically important outcomes [Table 3] and the convincing nature of LDL-cholesterol lowering as a surrogate outcome [19] [29] [47] [48] [49], our clinician concludes that there is a class effect of statin drugs on the occurrence of ischemic heart disease. In the apparent absence of differences in safety or compliance profile between the various statins, he decides to pursue a cost-minimization strategy. While the newer statin has only been evaluated for cholesterol-lowering efficacy in a short-term trial (less than six months), he decides to prescribe it as it is the cheapest statin in his local setting.

The Policy Maker

The policy maker agrees with the clinician that the statins appear to exert a class effect in terms of efficacy. However, she is concerned that the efficacy of the newer statin has not been evaluated in long-term trials with clinically important outcomes or validated surrogate outcomes. Thus, she decides to keep the older (and more expensive) statin on her formulary until level 1 or long-term level 2 evidence is available proving that the newer statin is as good as, or better, than the currently provided statin.

Up


Conclusion

While it would be preferable that every drug in each class (and indeed every dose and every formulation) be evaluated in RCTs with active comparators from the same class for their effects on clinically important outcomes, this has not been accomplished for several important classes of drugs. We believe that advocates of newer drugs within a class must provide evidence of equivalence (or superiority) to the older agents and "randomized comparative trials … remain the preferred evidentiary standard" [50]. Recognizing that this gold standard is not always attainable (in the case of the statins, such RCTs would require very large sample sizes and long follow-up to detect significant differences in myocardial infarction or death between two different statins), we suggest that discussions about class effects will benefit from citing the levels of evidence behind the arguments and recognizing the strengths and weaknesses inherent in each study design.

Up


Acknowledgements

FAM is supported by the Medical Research Council of Canada and the Alberta Heritage Foundation for Medical Research, AL is supported by the Medical Research Council of Canada, and DLS is supported by the Research and Development Programme of the National Health Service, United Kingdom. The authors gratefully acknowledge the input of various members of the University of Ottawa Clinical Epidemiology Unit and attendees of the EQUINOX symposium in earlier discussions of this topic, Dr. J. Glennie in providing information on the approaches of regulators to these issues, and Drs. I. Chalmers and F. Lawson in reviewing earlier versions of this manuscript. The authors also thank the anonymous reviewers for their helpful input.

Up


References

1. Drug firm suit fails to halt publication of Canadian Health Technology Report. JAMA 1998;280:683-4.

2. McDonald CJ. Medical heuristics: the silent adjudicators of clinical practice. Ann Intern Med 1996;124:56-62.

3. Rothwell P. Can overall results of clinical trials be applied to all patients? Lancet 1995;345:1616-9.

4. Collins R, Peto R, MacMahon S, Hebert P, Fiebich NH, Eberlein KA, et al. Blood pressure, stroke, and coronary heart disease. Part 2. Short-term reductions in blood pressure: overview of randomised trials in their epidemiological context. Lancet 1990;335:827-38.

5. Antiplatelet Trialists’ Collaboration. Collaborative overview of randomised trials of antiplatelet therapy- I: Prevention of death, myocardial infarction, and stroke by prolonged antiplatelet therapy in various categories of patients. BMJ 1994;308:81-106.

6. Garg R, Yusuf S, for the Collaborative Group on ACE Inhibitor Trials. Overview of randomized trials of angiotensin-converting enzyme inhibitors on mortality and morbidity in patients with heart failure. JAMA 1995;273:1450-56.

7. Waldo AL, Camm AJ, de Ruyter H, et al. Effect of d-sotalol on mortality in patients with left ventricular dysfunction after recent and remote myocardial infarction. Lancet 1996;348:7-12.

8. Yusuf S, Peto R, Lewis J, Collins R, Sleight P. Beta blockade during and after myocardial infarction: an overview of randomized trials. Prog Cardiovasc Dis 1985;27:335-71.

9. Boissel JP, Leizorovicz A, Picolet H, Peyrieux JC. Secondary prevention after high-risk acute myocardial infarction with low-dose acebutalol. Am J Cardiol 1990;66:251-60.

10. McAlister FA, Teo KK. Antiarrhythmic therapies for the prevention of sudden cardiac death. Drugs 1997;54:235-52.

11. Furberg CD, Psaty BM. Calcium antagonists: not appropriate as first line antihypertensive agents. Am J Hypertens 1996;9:122-25.

12. Rosenson RS, Tangney CC. Antiatherothrombotic properties of statins. JAMA 1998;279:1643-50.

13. Guyatt GH, Sackett DL, Sinclair JC, Hayward RC, Cook DJ, Cook RJ, for the Evidence-Based Medicine Working Group. Users’ guides to the medical literature. IX. A method for grading health care recommendations. JAMA 1995;274:1800-04.

14. Canadian Task Force on the Periodic Health Examination. The periodic health examination. Can Med Assoc J 1988;138:618-26.

15. Cook DJ, Guyatt GH, Laupacis A, Sackett DL, Goldberg RJ. Clinical recommendations using levels of evidence for antithrombotic agents. Chest 1995;108(Suppl):227S-230S.

16. Donner A. Approaches to sample size estimation in the design of clinical trials- a review. Stat Med 1984;3:199-214.

17. Naguib M, El Bakry AK, Khoshim MHB, Channa AB, El Gammal M, El Gammal K, et al. Prophylactic antiemetic therapy with ondansetron, tropisetron, granisetron and metoclopramide in patients undergoing laparoscopic cholecystectomy: a randomized, double-blind comparison with placebo. Can J Anaesth 1996;43:226-31.

18. Korttila K, Clergue F, Leeser F, Feiss P, Olthoff D, Payeur-Michel C, et al. Intravenous dolasetron and ondansetron in prevention of postoperative nausea and vomiting: a multicenter, double-blind, placebo-controlled study. Acta Anaesthesiol Scand 1997;41:914-22.

19. Law MR, Wald NJ, Thompson SG. By how much and how quickly does reduction in serum cholesterol concentration lower risk of ischaemic heart disease? BMJ 1994;308:367-73.

20. Scandinavian Simvastatin Survival Study Group. Randomised trial of cholesterol lowering in 4444 patients with coronary heart disease: the Scandinavian Simvastatin Survival Study (4S). Lancet 1994;344:1383-9.

21. Shepherd J, Cobbe SM, Ford I, Isles CG, Lorimer AR, MacFarlane PW, et al for the West of Scotland Coronary Prevention Study Group. Prevention of coronary heart disease with pravastatin in men with hypercholesterolemia. N Engl J Med 1995;333:1301-07.

22. Sacks FM, Pfeffer MA, Moye LA, Rouleau JL, Rutherford JD, Cole TG, et al for the Cholesterol and Recurrent Events Trial Investigators. The effect of pravastatin on coronary events after myocardial infarction in patients with average cholesterol levels. N Engl J Med 1996;335:1001-9.

23. Downs JR, Clearfield M, Weis S, et al for the AFCAPS/TexCAPS Research Group. Primary prevention of acute coronary events with lovastatin in men and women with average cholesterol levels. Results of AFCAPS/TexCAPS. JAMA 1998;279:1615-22.

24. The Long-term Intervention with Pravastatin in Ischaemic Disease (LIPID) Study Group. Prevention of cardiovascular events and death with pravastatin in patients with coronary heart disease and a broad range of initial cholesterol levels. N Engl J Med 1998;339:1349-57.

25. Jones P, Kafonek S, Laurora I, et al for the CURVES Investigators. Comparative dose efficacy study of atorvastatin versus simvastatin, pravastatin, lovastatin, and fluvastatin in patients with hypercholesterolemia (The CURVES Study). Am J Cardiol 1998;81:582-7.

26. Sacks FM, Moye LA, Davis BR, Cole TG, Rouleau JL, Nash DT, et al. Relationship between plasma LDL concentrations during treatment with pravastatin and recurrent coronary events in the cholesterol and recurrent events trial. Circulation 1998;97:1446-52.

27. Scandinavian Simvastatin Survival Study Group. Baseline serum cholesterol and treatment effect in the Scandinavian Simvastatin Survival Study (4S). Lancet 1995;345:1274-75.

28. West of Scotland Coronary Prevention Study Group. Influence of pravastatin and plasma lipids on clinical events in the West of Scotland Coronary Prevention Study (WOSCOPS). Circulation 1998;97:1440-5.

29. Fager G, Wiklund O. Cholesterol reduction and clinical benefit. Are there limits to our expectations? Arterioscler Thromb Vasc Biol 1997;17:3527-33.

30. Davey Smith G, Song F, Sheldon TA. Cholesterol lowering and mortality: the importance of considering initial level of risk. BMJ 1993;306:1367-73.

31. Sacks FM, Gibson CM, Rosner B, Pasternak RC, Stone PH, for the Harvard Atherosclerosis Reversibility Project Research Group. The influence of pretreatment low density lipoprotein cholesterol concentrations on the effect of hypocholesterolemic therapy on coronary atherosclerosis in angiographic trials. Am J Cardiol 1995;76:78C-85C.

32. Bucher HC, Guyatt GH, Griffith LE, Walter SD. The results of direct and indirect treatment comparisons in meta-analysis of randomized controlled trials. J Clin Epidemiol 1997;50:683-91.

33. Oxman AD, Guyatt GH. A consumers’ guide to subgroup analyses. Ann Intern Med 1992;116:78-84.

34. Yusuf S, Wittes J, Probstfield J, Tyroler HA. Analysis and interpretation of treatment effects in subgroups of patients in randomized clinical trials. JAMA 1991;266:93-8.

35. Anderson KM, Wilson PWF, Odell PM, Kannel WB. An updated coronary risk profile: a statement for health professionals. Circulation 1991;83:356-62.

36. Psaty BM, Siscovick DS, Weiss NS, Koepsell TD, Rosendaal FR, Lin D, et al. Hypertension and outcomes research. From clinical trials to clinical epidemiology. Am J Hypertens 1996;9:178-83.

37. Marshall WJS. Administrative databases: fact or fiction? Can Med Assoc J 1998;158:489-90.

38. Sackett DL, Haynes RB, Gent M, Taylor DW: Compliance. In: Inman WH (ed): Monitoring for Drug Safety, Lancaster: MTP, 1980.

39. Grupo Italiano per lo Studio Della Sopravvivenza Nell Infarcto Miocardico. GISSI-2: A factorial randomised trial of alteplase versus streptokinase and heparin versus no heparin among 12 490 patients with acute myocardial infarction. Lancet 1990;336:65-71.

40. Third International Study of Infarct Survival Collaborative Group. ISIS-3: A randomised comparison of streptokinase vs. tissue plasminogen activator vs. anistreplase and of aspirin plus heparin vs aspirin alone among 41 299 cases of suspected acute myocardial infarction. Lancet 1992;339:753-70.

41. Avorn J, Monette J, Lacour A, Bohn RL, Monane M, Mogun H, et al. Persistence of use of lipid-lowering medications. A cross-national study. JAMA 1998;279:1458-62.

42. Dans AL, Dans LF, Guyatt GH, Richardson S, for the Evidence-Based Medicine Working Group. Users guides to the medical literature. XIV. How to decide on the applicability of clinical trial results to your patient. JAMA 1998;279:545-9.

43. Last JM. A dictionary of epidemiology. Oxford: Oxford University Press, 1995.

44. Drummond MF, Richardson WS, O’Brien BJ, Levine M, Heyland D, for the Evidence-Based Medicine Working Group. Users’ guides to the medical literature. XIII. How to use an article on economic analysis of clinical practice. A. Are the results of the study valid? JAMA 1997;277:1552-7.

45. Naylor CD, Williams JI, Basinski A, Goel V. Technology assessment and cost-effectiveness analysis: misguided guidelines? Can Med Assoc J 1993;148:921-4.

46. Laupacis A, Feeny D, Detsky AS, Tugwell PX. How attractive does a new technology have to be to warrant adoption and utilization? Tentative guidelines for using clinical and economic evaluations. Can Med Assoc J 1992;146:473-81

47. Gotto AM. Cholesterol management in theory and practice. Circulation 1997;96:4424-30.

48. Grover SA, Paquet S, Levinton C. Coupal L, Zowall H. Estimating the benefits of modifying cardiovascular risk factors: a comparison of primary versus secondary prevention. Arch Intern Med 1998;(in press)

49. Lacour A, Derderian F, LeLorier J. Comparison of efficacy and cost among lipid-lowering agents in patients with primary hypercholesterolemia. Can J Cardiol 1998;14:355-61

50. Tu JV, Naylor CD. Choosing among drugs of different price for similar indications. Can J Cardiol 1998;14:349-51.

Up


© 2001 Evidence-Based Medicine Informatics Project