How effective are common medications: a perspective based on meta-analyses of major drugs
© Leucht et al. 2015
Received: 28 May 2015
Accepted: 18 September 2015
Published: 2 October 2015
The vastness of clinical data and the progressing specialization of medical knowledge may lead to misinterpretation of medication efficacy. To show a realistic perspective on drug efficacy we present meta-analyses on some of the most commonly used pharmacological interventions. For each pharmacological intervention we present statistical indexes (absolute risk or response difference, percentage response ratio, mean difference, standardized mean difference) that are often used to represent efficacy. We found that some of the medications have relatively low effect sizes with only 11 out of 17 of them showing a minimal clinically important difference. Efficacy was often established based on surrogate outcomes and not the more relevant patient-oriented outcomes. As the interpretation of the efficacy of medication is complex, more training for physicians might be needed to get a more realistic view of drug efficacy. That could help prevent harmful overtreatment and reinforce an evidence-based, but personalized medicine.
KeywordsAbsolute risk or response difference Common medications Drug classes Drug efficacy Mean difference Medication efficacy Meta-analysis Percentage response ratio Pharmacological interventions Standardized mean difference Schizophrenia Depression
Medicine is becoming so highly specialized and the clinical literature is growing so fast, that few doctors let alone the lay public have a working knowledge of the detailed evidence on drugs outside their specialty . This is despite the fact that clinicians must often evaluate comparative risks and benefits of treatments for patients with multiple maladies. Studies show that decision making can be distorted by various cognitive biases such as a physician’s tendency to remember dramatically successful cases and forget ones that failed or to misinterpret the statistical indices used in clinical trials and meta-analyses . This may lead the physician to overestimate the efficacy of treatments, which in turn may be one of the causes of harmful overtreatment .
Common pharmacological treatments
We would like to present a realistic perspective on the general efficacy of common pharmacological treatments. Following the general methods of a previous overview of reviews , we identified systematic reviews of randomized controlled trials with meta-analysis comparing drugs used in specific therapy types with placebo. We included 20 most common therapy types as measured by the number of on-therapy patients in the US, according to the IMS Institute for Healthcare Informatics . For each therapy type listed there we identified primary pharmacological treatments and their primary indications (as suggested by the IMS review and verified by national and international treatment guidelines). Then using PubMed we searched (last search: 5 August 2014, see Additional file 1) for the broadest and most recent meta-analysis on that treatment. If possible, we included meta-analyses on monotherapy rather than combination therapy, on all patients rather than a sub-group of patients (for example, we preferred reviews on all age groups, over ones restricted to adults or children) and on broad drug classes rather than narrow ones or single drugs (for example, we preferred a meta-analysis on all antihypertensive drugs, over ones on ACE inhibitors or enalapril). If a meta-analysis on the whole therapy type (for example, any narcotic) was not available, we included a frequently used example (for example, oxycodone + paracetamol, which is the most frequently used painkiller according to the IMS report for which we found a meta-analysis fulfilling our inclusion criteria). For a more detailed description of our methods, please refer to the protocol (see Additional file 2).
Measures of medication efficacy
Absolute risk or response difference (ARD) is the risk or percentage of responders in group B subtracted from the risk or percentage of responders in group A. For example, mortality was 2 % for drug treatment and 4 % for placebo, which gives an ARD = |-2 %|. For responder rates, if 45 % of patients responded in the drug group and 30 % in the placebo group, the ARD is 15 %.
Percentage response ratio (PRR) is the percentage of responders in group A divided by the percentage responders in group B. For example, if 45 % of participants responded to drug treatment in group A and 30 % to placebo in group B, the PRR is 50 %, because 0.45/0.3 = 1.5. This means that there were 50 % more responders in group A compared to group B.
Mean difference (MD) is the mean from group B subtracted from the mean in group A. For example, if the mean total sleep time at the end of treatment in the drug group was 5 hours and 10 minutes and in the placebo group 4 hours and 55 minutes, the MD is 15 minutes.
Standardized mean difference (SMD) is the mean from group B subtracted from the mean in group A and divided by the pooled standard deviation (SD). For example, if the average weight of participants at the end of treatment was 79 kg in the drug group and 83 kg in the placebo group and the pooled SD was 8 kg, the SMD is 0.5.
Effect sizes at Fig. 1 are expressed graphically as SMDs and are ranked as “small” (0.2), “medium” (around 0.5) or “large” (above 0.8) . We also present the percentage of responders in the drug and placebo group and, if appropriate, the number of trials (N) and patients (n) for each meta-analysis, as well as the AMSTAR score, which is a measure of methodological quality of systematic reviews .
The efficacy of common medications
Differences larger than one standard deviation (that is, SMD >1) between the drug and placebo groups are uncommon, examples being proton pump inhibitors for reflux esophagitis  or oxycodone plus paracetamol for postoperative pain . For many other medications the effect sizes were much smaller. For example, antihypertensive drugs reduced systolic and diastolic blood pressure by only 10 mmHg and 5 mmHg, respectively , the ARD between aspirin and placebo for primary prevention of cardiovascular events was only 0.07 % per year , and the ARD for antidepressants and placebo for major depressive disorder was 17 % .
For an outcome affecting quality of life, ½ of a standard deviation is considered to be a minimal clinically important difference . Out of 17 common pharmacological treatments examined, only 11 met this threshold. In four of them efficacy was represented by surrogate outcomes, such as diastolic blood pressure or fasting plasma glucose, and not patient-oriented outcomes, such as pain, mortality or adverse events. Therefore, patients might not have experienced substantial benefits related to their well-being and quality of life after therapy with some of these drugs. Moreover many of the included meta-analyses had a low methodological quality as represented by median AMSTAR score of 7/11 (interquartile range 5 to 9).
Surrogate outcomes versus patient-oriented outcomes
Figure 1 also illustrates that surrogate outcomes often show dramatic effects, while the effects on patient-oriented outcomes are much smaller. For example, statins reduce cholesterol by 30 % on average . However, high cholesterol alone does not directly produce pain or disability. For long-term consequences, such as cardiovascular events and mortality, the effects are smaller (ARD between statins and placebo of 4 % for cardiovascular events and 1.2 % for mortality within 5 years ). In hypertension, medium effect sizes for reductions of hypertension  lead to comparatively small reductions of cardiovascular events , and metformin strongly reduces glucose , but there is no evidence of a reduction in mortality . Among the seven outcomes that can be both objectively measured and are patient-oriented (marked in red color in Fig. 1) only one shows a big effect size (remission of reflux esophagitis by proton pump inhibitors ).
Statistical indices can be misleading
In general, relative risk reductions suggest larger differences than ARDs. For example, statins reduced the number of patients with major cardiovascular events from 18 % to 14 % . The relative risk reduction of 21 % (100 % - (14 %/18 %) = 21 %) is more impressive than the ARD of 4 % (14 % - 18 % = |-4 %|). Findings consistently show that a mere reporting of a relative risk reduction can be misleading, because many clinicians will interpret it as an absolute difference .
There are many limitations in an overview of meta-analyses . For example, the meta-analyses differed in methods and publication year. We preferred reviews of drug classes which may obscure superiorities of single drugs. Many outcomes may accumulate over time if the studies had longer durations. For example, the evidence on mortality reduction by statins is based on 5-year studies, but the effect could get larger if patients took them for 20 years. Or a patient with depression may have ten episodes in his life which could be reduced by medication to five . Finally, whether the increment of improvement by a drug is important depends on many factors, such as the seriousness of the disease, side-effects, cost and, most importantly, the short- and long-term outcome in question. For mortality, the “baseline risk” (that is, mortality in the no-treatment group) is often low, leading to a relatively low maximally possible absolute risk reduction. For example, within 5 years without treatment only 9.7/100 participants with hypercholesterolemia died , limiting the maximally possible absolute mortality reduction to 9.7 %. Nevertheless, since mortality is such an important outcome, even a small reduction can be clinically meaningful. In other words, a large effect size for a transitory rash is less important than a small reduction of death. For all these reasons, this article is only a perspective and not a full review of the evidence for every possible aspect.
We feel that we need to be more realistic about drug efficacy. Doctors may believe that all patients respond to drugs and none to placebo, but neither statement is true because there is no ideal drug and many disorders remit spontaneously due to their natural course. Our preference for black or white over shades of grey is convenient but it can offer only a “false clarity” . The psychologist Daniel Kahneman received the Nobel Prize in economics for research on cognitive bias and decision making, seen in the context of an initial perception of an idea, which takes place in less than a second versus the more logical thinking through of ideas, which often takes hours or days . The initial rapid intuitions can be biased by many factors such as recency, frequency and vividness of prior personal experiences, but does not take into account statistics very well. Pharmaceutical company advertising takes full advantage of this. We feel these quantitative benchmarks will help clinicians learn how to interpret the latest drug findings and reflect on their limitations. We do not strive to therapeutic nihilism, but rather believe that drug data is complex and requires thoughtful consideration regarding which medications and therapies are best suited for certain situations and patients.
A measurement scale for the assessment of the methodological quality of systematic reviews
Absolute response or risk difference
Percentage of patients with the outcome in the drug group
Mean difference in original units
Number of participants
Number of trials
Percentage of patients with the outcome in the placebo group
Percentage response ratio
Standardized mean difference
We would like to thank Dawn R Gartlehner for proofreading of the manuscript.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Bastian H, Glasziou P, Chalmers I. Seventy-five trials and eleven systematic reviews a day: how will we ever keep up? PLoS Med. 2010;7:e1000326. doi:10.1371/journal.pmed.1000326.View ArticlePubMedPubMed CentralGoogle Scholar
- Gigerenzer G, Muir Gray J. Better doctors, better patients, better decisions: Envisioning health care 2020. Cambridge, MA: MIT Press; 2011.View ArticleGoogle Scholar
- Lenzer J. Unnecessary care: are doctors in denial and is profit driven healthcare to blame? BMJ. 2012;345:e6230. doi:10.1136/bmj.e6230.View ArticlePubMedGoogle Scholar
- Leucht S, Hierl S, Kissling W, Dold M, Davis JM. Putting the efficacy of psychiatric and general medicine medication into perspective: review of meta-analyses. Br J Psychiatry. 2012;200:97–106. doi:10.1192/bjp.bp.111.096594.View ArticlePubMedGoogle Scholar
- Aitken M, Kleinrock M, Lyle J, Caskey L. Medicine use and shifting costs of healthcare. A review of the use of medicines in the United States in 2013. Danbury, CT: IMS Institute for Healthcare Informatics; 2014.Google Scholar
- Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale, NJ: Lawrence Erlbaum Associates; 1988.Google Scholar
- Shea BJ, Hamel C, Wells GA, Bouter LM, Kristjansson E, Grimshaw J, et al. AMSTAR is a reliable and valid measurement tool to assess the methodological quality of systematic reviews. J Clin Epidemiol. 2009;62:1013–20. doi:10.1016/j.jclinepi.2008.10.009.View ArticlePubMedGoogle Scholar
- Khan M, Santana J, Donnellan C, Preston C, Moayyedi P. Medical treatments in the short term management of reflux oesophagitis. Cochrane Database Syst Rev. 2007;2:CD003244. doi:10.1002/14651858.CD003244.pub2.PubMedGoogle Scholar
- Moore RA, Derry S, McQuay HJ, Wiffen PJ. Single dose oral analgesics for acute postoperative pain in adults. Cochrane Database Syst Rev. 2011;9:CD008659. doi:10.1002/14651858.CD008659.pub2.PubMedGoogle Scholar
- Law M, Morris JK, Jordan R, Wald N. Headaches and the treatment of blood pressure: results from a meta-analysis of 94 randomized placebo-controlled trials with 24,000 participants. Circulation. 2005;112:2301–6. doi:10.1161/CIRCULATIONAHA.104.529628.View ArticlePubMedGoogle Scholar
- Antithrombotic Trialists C, Baigent C, Blackwell L, Collins R, Emberson J, Godwin J, et al. Aspirin in the primary and secondary prevention of vascular disease: collaborative meta-analysis of individual participant data from randomised trials. Lancet. 2009;373:1849–60. doi:10.1016/S0140-6736(09)60503-1.View ArticleGoogle Scholar
- Undurraga J, Baldessarini RJ. Randomized, placebo-controlled trials of antidepressants for acute major depression: thirty-year meta-analytic review. Neuropsychopharmacology. 2012;37:851–64. doi:10.1038/npp.2011.306.View ArticlePubMedGoogle Scholar
- Norman GR, Sloan JA, Wyrwich KW. Interpretation of changes in health-related quality of life: the remarkable universality of half a standard deviation. Med Care. 2003;41:582–92. doi:10.1097/01.MLR.0000062554.74615.4C.PubMedGoogle Scholar
- Law MR, Wald NJ, Rudnicka AR. Quantifying effect of statins on low density lipoprotein cholesterol, ischaemic heart disease, and stroke: systematic review and meta-analysis. BMJ. 2003;326:1423. doi:10.1136/bmj.326.7404.1423.View ArticlePubMedPubMed CentralGoogle Scholar
- Baigent C, Keech A, Kearney PM, Blackwell L, Buck G, Pollicino C, et al. Efficacy and safety of cholesterol-lowering treatment: prospective meta-analysis of data from 90,056 participants in 14 randomised trials of statins. Lancet. 2005;366:1267–78. doi:10.1016/S0140-6736(05)67394-1.View ArticlePubMedGoogle Scholar
- Turnbull F, Blood Pressure Lowering Treatment Trialists’ Collaboration. Effects of different blood-pressure-lowering regimens on major cardiovascular events: results of prospectively-designed overviews of randomised trials. Lancet. 2003;362:1527–35.View ArticlePubMedGoogle Scholar
- Saenz A, Fernandez-Esteban I, Mataix A, Ausejo M, Roque M, Moher D. Metformin monotherapy for type 2 diabetes mellitus. Cochrane Database Syst Rev. 2005;3:CD002966. doi:10.1002/14651858.CD002966.pub3.PubMedGoogle Scholar
- Boussageon R, Supper I, Bejan-Angoulvant T, Kellou N, Cucherat M, Boissel JP, et al. Reappraisal of metformin efficacy in the treatment of type 2 diabetes: a meta-analysis of randomised controlled trials. PLoS Med. 2012;9:e1001204. doi:10.1371/journal.pmed.1001204.View ArticlePubMedPubMed CentralGoogle Scholar
- Covey J. A meta-analysis of the effects of presenting treatment benefits in different formats. Med Decis Making. 2007;27:638–54. doi:10.1177/0272989X07306783.View ArticlePubMedGoogle Scholar
- Glue P, Donovan MR, Kolluri S, Emir B. Meta-analysis of relapse prevention antidepressant trials in depressive disorders. Aust N Z J Psychiatry. 2010;44:697–705. doi:10.3109/00048671003705441.View ArticlePubMedGoogle Scholar
- van Deemter K. Not exactly: In praise of vagueness. Oxford: Oxford University Press; 2010.Google Scholar
- Kahneman D. Thinking fast and slow. New York, NY: Farrar, Straus and Giroux; 2011.Google Scholar
- Birks J. Cholinesterase inhibitors for Alzheimer’s disease. Cochrane Database Syst Rev. 2006;1:CD005593. doi:10.1002/14651858.CD005593.PubMedGoogle Scholar
- Faraone SV, Buitelaar J. Comparing the efficacy of stimulants for ADHD in children and adolescents using meta-analysis. Eur Child Adolesc Psychiatry. 2010;19:353–64. doi:10.1007/s00787-009-0054-3.View ArticlePubMedGoogle Scholar
- Holbrook AM, Crowther R, Lotter A, Cheng C, King D. Meta-analysis of benzodiazepine use in the treatment of insomnia. CMAJ. 2000;162:225–33.PubMedPubMed CentralGoogle Scholar
- Leucht S, Arbter D, Engel RR, Kissling W, Davis JM. How effective are second-generation antipsychotic drugs? A meta-analysis of placebo-controlled trials. Mol Psychiatry. 2009;14:429–47. doi:10.1038/sj.mp.4002136.View ArticlePubMedGoogle Scholar
- MacLean C, Newberry S, Maglione M, McMahon M, Ranganath V, Suttorp M, et al. Systematic review: comparative effectiveness of treatments to prevent fractures in men and women with low bone density or osteoporosis. Ann Intern Med. 2008;148:197–213.View ArticlePubMedGoogle Scholar
- Nabi G, Cody JD, Ellis G, Herbison P, Hay-Smith J. Anticholinergic drugs versus placebo for overactive bladder syndrome in adults. Cochrane Database Syst Rev. 2006;4:CD003781. doi:10.1002/14651858.CD003781.pub2.PubMedGoogle Scholar
- Sin DD, Man J, Sharpe H, Gan WQ, Man SF. Pharmacological management to reduce exacerbations in adults with asthma: a systematic review and meta-analysis. JAMA. 2004;292:367–76. doi:10.1001/jama.292.3.367.View ArticlePubMedGoogle Scholar
- Barr RG, Bourbeau J, Camargo CA, Ram FS. Tiotropium for stable chronic obstructive pulmonary disease: A meta-analysis. Thorax. 2006;61:854–62. doi:10.1136/thx.2006.063271.View ArticlePubMedPubMed CentralGoogle Scholar
- Derry CJ, Derry S, Moore RA. Sumatriptan (oral route of administration) for acute migraine attacks in adults. Cochrane Database Syst Rev. 2012;2:CD008615. doi:10.1002/14651858.CD008615.pub2.PubMedPubMed CentralGoogle Scholar
- Fahn S, Oakes D, Shoulson I, Kieburtz K, Rudolph A, Lang A, et al. Levodopa and the progression of Parkinson’s disease. N Engl J Med. 2004;351:2498–508. doi:10.1056/NEJMoa033447.View ArticlePubMedGoogle Scholar