- Research article
- Open Access
- Open Peer Review
Comparative efficacy and safety of second-line treatments for advanced non-small cell lung cancer with wild-type or unknown status for epidermal growth factor receptor: a systematic review and network meta-analysis
BMC Medicine volume 15, Article number: 193 (2017)
The Commentary to this article has been published in BMC Medicine 2018 16:24
Docetaxel, pemetrexed, erlotinib, and gefitinib are recommended as second-line treatment for advanced non-small cell lung cancer (NSCLC) with wild-type or unknown status for epidermal growth factor receptor (EGFR). However, the number of published randomized clinical trials (RCTs) on this topic is increasing. Our objective was to assess the comparative effectiveness and tolerability of all second-line treatments for advanced NSCLC with wild-type or unknown status for EGFR by a systematic review and network meta-analysis.
MEDLINE, EMBASE, CENTRAL, ClinicalTrials.gov, and the US Food and Drug Administration website, as well as other sources, were searched for available reports up to June 6, 2017. Two reviewers independently selected published and unpublished reports of RCTs comparing any second-line treatments, extracted data and assessed the risk of bias of all included trials. We performed a Bayesian network meta-analysis. The primary outcomes were overall survival (OS) and progression-free survival (PFS). Secondary outcomes included objective response (ObR), the number of serious adverse events, and quality of life.
We included 102 RCTs involving 36,058 patients (62% male, median age 61 years, 81% with stage IV cancer, 80% smokers, and 92% with performance status 0–1). We revealed a differential reporting of outcomes between efficacy and safety outcomes. Half of the trials reported safety outcomes and less than 20% quality of life. For OS, nivolumab was more effective than docetaxel (hazard ratio (HR) 0.69, 95% credible interval (CrI) 0.56–0.83), pemetrexed (0.67, 0.52–0.83), erlotinib (0.68, 0.53–0.86), and gefitinib (0.66, 0.53–0.83). Pembrolizumab, atezolizumab, and pemetrexed plus erlotinib were also significantly more effective than docetaxel, pemetrexed, erlotinib, and gefitinib. For PFS, erlotinib plus cabozantinib was more effective than docetaxel (HR 0.39, 95% CrI 0.18–0.84), pemetrexed (0.38, 0.18–0.82), erlotinib (0.37, 0.18–0.78), and gefitinib (0.38, 0.18–0.82). Cabozantinib and pemetrexed plus erlotinib were also significantly more effective than the four recommended treatments. For ObR, no treatment was significantly more effective. The effectiveness of the four recommended treatments was similar and they were ranked among the 25 less-effective treatments. For safety, evidence is insufficient to draw certain conclusions.
Nivolumab, pembrolizumab, atezolizumab, and pemetrexed plus erlotinib may be the most effective second-line treatments for NSCLC in terms of OS. The four recommended treatments seem to have relatively poor performance. However, the impact on life expectancy of immunotherapy versus other treatments should be further explored by future analyses, and more trials comparing the novel treatments are needed to reduce uncertainty in these results.
Registration number: PROSPERO (CRD42015017592)
Lung cancer remains the leading cause of cancer-related death worldwide, with a lower than 15% 5-year survival . Further, it is the fifth leading cause of disability-adjusted life years in developed countries . Non-small cell lung cancer (NSCLC) represents approximately 85% of lung cancer cases, with most patients in Western populations having wild-type or unknown status for epidermal growth factor receptor (EGFR) and do not present anaplastic lymphoma kinase gene rearrangement . Most patients receive a diagnosis of advanced-stage disease and are candidates for palliative systemic therapy.
Patients with Eastern Cooperative Oncology Group performance status 0–2 with disease progression after first-line chemotherapy are given second-line treatments. In this setting, the American Society of Clinical Oncology (ASCO) clinical practice guidelines, updated in August 2015, recommend two cytotoxic drugs, docetaxel and pemetrexed (only for non-squamous cell carcinoma (NSCC) currently), and two EGFR-tyrosine kinase inhibitors, erlotinib and gefitinib . However, several new treatments have been approved by the US Food and Drug Administration (FDA), including a combination of docetaxel and ramucirumab, nivolumab, pembrolizumab, and atezolizumab, and more than 40 treatments have been assessed in randomized clinical trials (RCTs) for second-line treatment of advanced NSCLC .
Conventional meta-analyses have only partially captured the available evidence for treatment of advanced NSCLC. Specifically, 29 systematic reviews were published between 2009 and 2015, but did not incorporate approximately 40% of the existing evidence in terms of alternative treatments and available trials . Therefore, we need to consider the broader picture and simultaneously assess all treatments evaluated for this condition . The aforementioned treatments approved recently have only been compared to docetaxel, and no trial has compared them with each other. Network meta-analysis (NMA) bypasses this limitation and can inform about these contrasts via indirect evidence, given that at least one trial has compared these treatments against other comparators. Two NMAs have been published, but they focused on small subsets of treatments (only four and six treatments, respectively) [7, 8]. Thus, we lack a comparative synthesis in this setting to assist clinical decision-making.
In this paper, we present an NMA comparing the relative efficacy and safety of all available treatments for second-line treatment of advanced NSCLC in patients with wild-type or unknown status for EGFR.
The present report has been prepared following the recommendations of the Cochrane Comparing Multiple Interventions Methods Group  and the PRISMA extension statement for systematic reviews incorporating NMAs . The protocol of this NMA was also registered with PROSPERO (no. CRD42015017592).
We considered RCTs assessing second-line treatments in patients with advanced NSCLC (stage IIIB unsuitable for radical radiotherapy or surgery and stage IV) with wild-type or unknown status for EGFR. Trials in which patients in the control arm received chemotherapy (e.g., docetaxel or pemetrexed) at the investigators’ discretion were included for the secondary analysis considering treatment categories. We also considered trials including both second- and third-line therapy, because there is no clinical reason to presume that these minority patients in third-line could not be randomized to any of the treatments. If a multi-arm trial compared one drug to two different dosages of another drug, we retained the usual treatment dosage or that corresponding to the 3-week scheme of administration. We excluded trials comparing different administration schemes of the same drug and those assessing combinations of different chemotherapy drugs.
The primary outcomes were overall survival (OS) and progression-free survival (PFS). Secondary outcomes were objective response (ObR), defined as a complete response or a partial response according to the Response Evaluation Criteria in Solid Tumors (RECIST) , the number of serious adverse events (SAEs) as defined at ClinicalTrials.gov (see Additional file 1: Appendix 1 for a description), and quality of life (QoL) considering any scale used in eligible trials.
Data sources and searches
We searched for eligible RCTs up to June 6, 2017, in the Cochrane Central Register of Controlled Trials (CENTRAL), MEDLINE, and EMBASE with no restriction on language, status, or year of publication. To identify additional trials and unpublished data, we screened previous systematic reviews, reference lists of all selected trials, conference abstracts, both industry and non-industry trial registries and results databases, regulatory agency online databases, and health technology assessment agencies . The full search strategy is available in Additional file 1: Appendix 2.
Selection of trial and data extraction
Two investigators (PC and AY) independently examined titles, abstracts, and full texts to assess the eligibility of each report. Disagreements were discussed with a third author (LT). The same authors extracted data independently by using a standardized form. With several reports pertaining to the same trial, we gave priority to the first available source among regulatory agency reports, results posted at ClinicalTrials.gov, published articles, reports from pharmaceutical companies, and conference abstracts. For conference abstracts and trials without published results, we contacted trial investigators to request the final report, if available, and the outcome data when missing from the available reports. The data extracted for each trial are available in Additional file 1: Appendix 3. When attempts to retrieve data from trialists failed, we obtained the necessary data from published Kaplan–Meier curves . For each eligible trial, we extracted information related to the publication, trial and patient characteristics, description of the interventions, and outcome data (Additional file 1: Appendix 3). To evaluate the synthesis assumption for NMA, we extracted data on the effect modifiers of sex, histology, ethnicity, and smoking status.
Risk of bias assessment
Two investigators (PC and AY) assessed the risk of bias of the trials by using the risk of bias tool of The Cochrane Collaboration (Additional file 1: Appendix 4) . Disagreements were discussed with a third author (LT). The considerations for defining the overall risk of bias for each trial are available in Additional file 1: Appendix 4.
We produced network diagrams to show the amount of evidence available for each outcome. The size of the nodes is proportional to the total number of patients allocated to the corresponding treatment across all trials and the width of the lines is proportional to the total number of RCTs evaluating the corresponding treatment comparison. Our primary analyses considered treatments separately and in a secondary analysis, we grouped treatments into five categories, namely cytotoxic monochemotherapy, targeted treatment, immunotherapy, combination of a cytotoxic monochemotherapy and a targeted treatment, and combination of two targeted treatments (see Additional file 1: Appendix 5 for details on the classification). Trials in which patients in the control arm received chemotherapy (e.g., docetaxel or pemetrexed) at the investigators’ discretion contributed to the secondary analyses only.
We assessed the clinical and methodological heterogeneity of the eligible trials within and across the different comparisons in terms of trial and population characteristics. Results from the NMA were valid if the assumption of transitivity (i.e., one can infer the relative effect between two interventions via one or more intermediate comparators) is plausible [15,16,17]. To infer the plausibility of transitivity, we considered the similarity of the distribution of the potential effect modifiers across the different pairwise comparisons.
We used the hazard ratio (HR) as an effect measure for survival outcomes and the odds ratio (OR) for ObR and SAEs, estimating 95% credible intervals (95% CrIs). Trial- and arm-level data were used for survival and binary outcomes, respectively. For ObR, we assumed that participants with missing outcome data did not respond to the intervention. First, we conducted random-effects pairwise meta-analyses for every comparison involving at least two trials. Statistical heterogeneity was assessed by visual inspection of the forest plots and by considering the I2 statistic and the magnitude of the between-trial standard deviation τ. Then, we performed NMA by using a random-effects hierarchical model. We assumed that the different comparisons within each outcome shared the same heterogeneity parameter. For clarity, we present the relative effects between a subset of treatments (recommended and approved treatments or treatments with unexpected results) in league tables, with other relative effects reported in forest plots in the supplementary files. Additionally, for every treatment, we estimated the probability of being at each possible rank and used the SUrface under the Cumulative RAnking (SUCRA) curves to infer the relative ranking of the treatments . We assessed statistical inconsistency (i.e., the agreement between direct and indirect evidence) by two approaches, namely (1) the loop-specific approach, which evaluates inconsistency in every closed loop of evidence [19, 20], and (2) the design-by-treatment interaction model, which assesses the presence of inconsistency in the entire network [19, 20].
We performed pre-specified subgroup analyses considering histologic subtypes (NSCC vs. squamous cell carcinoma (SCC)), ethnicity (Asian vs. non-Asian), and EGFR mutation status (wild-type vs. unknown status).
Reporting bias and credibility of the evidence
To assess the presence of small-study effects in the network for the primary outcomes we drew ‘comparison-adjusted’ funnel plots considering that small trials might give more favorable results for the sponsored drugs than larger trials and used network meta-regression models with the variance of the trials as a covariate [21, 22]. We defined a drug as being ‘sponsored’ if the drug was developed and marketed by the pharmaceutical company that sponsored the trial. We also evaluated the credibility of the evidence from network meta-analysis using the approach suggested by Salanti et al. , which extends the GRADE system into NMA. Details on the method and its implementations are available in Additional file 2: Appendix Figure S7.
All analyses involved the use of R v3.0.2  (metafor package) , WinBUGS v1.4.3 , and Stata v13 (network  and network graphs packages ). The WinBuGS codes can be found in Additional file 1: Appendix 12.
Selection of trials
We included 102 trials involving a total of 36,058 patients (Fig. 1, Additional file 1: Appendix 6), of which data for 87 trials were published and 15 (15%) unpublished (Additional file 1: Appendix 7). For 13 trials (13%), we obtained outcome data from available survival curves or the authors (Additional file 1: Appendix 3). Two eligible trials were excluded from the quantitative analyses since one was disconnected from the rest of the network (HANSHIN, 2015) and another involved suspected research misconduct (Zhang, 2015) . In the latter trial, the reported HR and survival curves were contradictory and the authors did not answer our questions.
Characteristics of eligible trials
The characteristics of the 102 individual trials are available in Additional file 1: Appendix 8 and their summary characteristics are provided in Table 1. Most trials were multi-center with industry funding. The majority of patients (29,864 patients, 83%) were randomized in phase III trials. Overall, 62% of patients were male, the mean age was 61 years, 81% had stage IV cancer, 80% were smokers, and 92% had a performance status score 0–1. In all, 26 trials (27%), including 4659 patients (13%), involved only Asians and 78 (76%) were of patients with both SCC or NSCC. The main results of these trials are provided in Additional file 1: Appendix 9.
Risk of bias
Only 47 trials (46%) described an adequate random sequence generation and 37 (36%) an adequate treatment allocation concealment. Patients and care providers were blinded in 29 trials (28%), and outcome assessors in 41 trials (40%). A detailed description of the risk of bias results is provided in Additional file 1: Appendix 4.
Figure 2 shows six network diagrams representing the global network (i.e., all trials regardless of data availability) and the networks with data for ObR, OS, PFS, SAEs, and QoL. The four recommended treatments (docetaxel, pemetrexed, erlotinib, and gefitinib) were thoroughly compared against each other. However, novel treatments were compared against only one alternative intervention. ObR and OS were reported in almost all trials (91 trials (95%) and 85 trials (88%), respectively) and PFS in 79 trials (82%), whereas SAEs were reported in 55 trials (57%) and QoL in only 18 trials (19%). Moreover, more than six different scales for QoL were used in these trials, so we did not synthesize the data for QoL.
Relative effects and ranking
We did not observe any important clinical differences in distribution of effect modifiers between trials comparing different sets of interventions; thus, we considered that the transitivity assumption was likely met (Table 1, see Additional file 1: Appendix 10 for patient characteristics).
Pairwise meta-analysis suggested a statistically significant OS benefit of nivolumab, atezolizumab, and docetaxel plus ramucirumab against docetaxel and pemetrexed plus erlotinib against pemetrexed. Additional file 1: Appendix 10 provides the results from pairwise meta-analyses for all available direct comparisons. For five comparisons, the heterogeneity standard deviation was greater than 0.10.
Table 2 presents all relative effects for the pre-specified treatments from the NMA. Relative effects of all other treatments against these treatments are presented in forest plots in Additional file 2: Appendix Figure S1. Eighteen treatments were significantly more effective than placebo in terms of OS. Nivolumab was more effective than docetaxel (HR 0.69, 95% CrI 0.56–0.83), pemetrexed (HR 0.67, 95% CrI 0.52–0.83), erlotinib (HR 0.68, 95% CrI 0.53–0.86), and gefitinib (HR 0.66, 95% CrI 0.53–0.83). Pembrolizumab, atezolizumab, and pemetrexed plus erlotinib were also significantly more effective than docetaxel, pemetrexed, erlotinib, and gefitinib. The τ value was close to 0 (τ = 0.04). Erlotinib plus cabozantinib, nivolumab, pembrolizumab, atezolizumab, and pemetrexed plus erlotinib represented the five most effective treatments in terms of OS (Additional file 1: Appendix Figure S2). The four recommended treatments were ranked in the 30th position (Additional file 1: Appendix Figure S3). Proper methodology to translate these results into absolute effects still remains to be developed. Nevertheless, we could consider as an illustrative – but not yet conclusive – example of the absolute effects the difference in restricted mean survival at 18 months of immunotherapy versus docetaxel (Additional file 1: Appendix 13). As the evidence presented in this systematic review does not allow for any definitive conclusions, the actual impact on life expectancy should be further explored by future analyses.
Pairwise meta-analyses suggested that, for PFS, treatment combinations often performed better when compared to a single treatment (Additional file 1: Appendix 10). For most comparisons, heterogeneity was 0. The largest heterogeneity was for gefitinib versus pemetrexed (τ = 0.51). According to the NMA results, erlotinib plus cabozantinib was more effective than docetaxel (HR 0.39, 95% CrI 0.18–0.84), pemetrexed (HR 0.38, 95% CrI 0.18–0.82), erlotinib (HR 0.37, 95% CrI 0.18–0.78), and gefitinib (HR 0.38, 95% CrI 0.18–0.82) (Tables 2 and 3, Additional file 2: Appendix Figure S1). Cabozantinib and pemetrexed plus erlotinib were also significantly more effective than docetaxel, pemetrexed, erlotinib, and gefitinib. The heterogeneity was larger as compared with OS (τ = 0.15). The four recommended treatments were ranked around the 40th position (Additional file 2: Appendix Figure S2). Additionally, combinations of dual-targeted therapies (erlotinib plus pazopanib) or chemotherapy plus targeted therapy (paclitaxel plus bevacizumab) appeared to be among the most effective treatments (Additional file 2: Appendix Figure S3).
Pairwise meta-analysis suggested a statistically significant benefit for ObR of nivolumab and some treatment combinations against a single treatment (Additional file 1: Appendix 10). For SAE, docetaxel plus selumetinib appeared more toxic than docetaxel (Additional file 1: Appendix 10). The heterogeneity was generally larger than that for the two primary outcomes for all comparisons. For ObR and SAEs, no pair of the previously mentioned treatments showed a statistically significant effect (Tables 2 and 3, Additional file 2: Appendix Figure S1). The τ value was moderate to high for ObR and moderate for SAEs . Cabozantinib combined with erlotinib or alone and docetaxel plus selumetinib seemed to be among the most effective for ObR but also the most toxic (Additional file 2: Appendix Figures S2 and S3).
The design times treatment interaction model did not suggest statistical inconsistency for any outcome. Nevertheless, the P value was less than 10% for ObR, which indicates some inconsistency for this outcome. These results in general agreed with the loop-specific approach, finding no inconsistent loop SAEs, one loop for OS and PFS, and two loops for ObR (Additional file 2: Appendix Figure S4). A possible explanation for this inconsistency in PFS and ObR is the presence of one particular trial in these loops (the WJOG5108L trial for the loop docetaxel-erlotinib-gefitinib and the loop erlotinib-gefitinib-vandetanib, and the Lee, 2013 trial for docetaxel-erlotinib-pemetrexed), which differed greatly in patient characteristics (ethnicity, histology, and smoking status).
In subgroup analyses, we did not find any important differences in treatment effect estimates between trials of SCC and NSCC, only Asians, and of only wild-type patients (Additional file 2: Appendix Figure S5). For NSCC, immunotherapy seemed more efficacious for OS than erlotinib plus cabozantinib, docetaxel plus ramucirumab, and docetaxel plus nintedanib. For SCC, immunotherapy seemed better afatinib and docetaxel plus ramucirumab. According to the treatment categories analysis, immunotherapy and chemotherapy plus targeted therapy seemed the most effective treatments (Additional file 1: Appendix 11). Nevertheless, chemotherapy plus targeted therapy seemed more toxic.
Reporting bias and credibility of the evidence
The funnel plot appeared symmetrical for OS but rather asymmetrical for PFS, which suggests that the sponsored treatments were favored more in small rather than larger trials (Additional file 2: Appendix Figure S6). However, the regression coefficient was not significant in the network meta-regression model that accounted for differences in trial size. This result might be explained by small-study effects not operating consistently across all comparisons, but the scarcity of the data did not allow us to explore this possibility. The GRADE evaluation suggests that the available evidence is of moderate credibility for the majority of the comparisons with respect to OS. For comparisons between immunotherapies and the combination pemetrexed plus erlotinib against the four recommended treatments, information came from low risk of bias trials with a contribution varying from 60% to 100%. On the contrary, for comparisons including either cabozantinib alone or in combination with erlotinib, information came mainly from moderate risk of bias trials. Less confidence can be placed on the results for PFS as most comparisons were rated at low or very low credibility.
Our review is, to date, the most comprehensive comparative effectiveness review for second-line treatments for advanced NSCLC with wild-type or unknown status for EGFR, involving 102 RCTs (36,058 patients). In terms of OS, immunotherapy (nivolumab, pembrolizumab, and atezolizumab) and the combination pemetrexed plus erlotinib seem to be more efficacious than the four recommended treatments, with no difference in effectiveness between the four recommended treatments. Other approved treatments did not have a statistically significant benefit as compared to the four recommended treatments. Our review highlighted that only half of the trials reported safety outcomes; thus, results for safety were very uncertain. Moreover, for OS, all relative effects were informed by trials at low and moderate risk of overall bias, and therefore the quality of evidence was moderate.
Our NMA has advantages over the two previous NMAs, which focused on a subset of the available treatments [7, 8]. Our exhaustive search strategy allowed us to identify (1) a large number of unpublished trials, which represented one-sixth of our data, and (2) several reports for the same trial, with a median of two reports by trial (range 1–17), which allowed us to compare results from each report and to give priority to the report corresponding to the best level of evidence. The unpublished trials were mainly phase III trials (9/15, 60%) and corresponded to negative trials assessing unsuccessfully licensed drugs or small trials (less than 100 patients); including these trials decreased the risk of publication bias and increased the power of treatment categories analysis.
Our NMA also provides the most up-to-date evidence synthesis results with last search date on June 6, 2017. Considering all the available evidence on any treatment for NSCLC that has appeared in the literature allowed us to (1) confirm the superiority of immunotherapies over all other treatments; (2) reveal highly efficacious treatment combinations (such as the combination pemetrexed plus erlotinib), which can be considered as equivalent alternatives to the new drugs although they are underrepresented in trials partly due to contradicting interests of the two pharmaceutical companies that market the two drugs; (3) investigate subgroups of patients considering histologic subtypes, such as the superiority of immunotherapies over the combination of docetaxel plus ramucirumab for NSCC, and ethnicity.
We also performed a detailed assessment of the credibility of the evidence to critically appraise our results. With respect to the five most efficacious treatments for OS, the level of evidence was higher (i.e., moderate) for nivolumab, pembrolizumab, atezolizumab, and pemetrexed plus erlotinib when compared to the four recommended treatments than for erlotinib plus cabozantinib, for which the level of evidence was low. For PFS, most comparisons were rated at low or very low credibility mainly due to lack of blinding and absence of an independent clinical endpoint adjudication committee to assess subjective outcomes, which led to very serious concerns for study limitation for many comparisons.
Moreover, considering all trials performed for a given condition facilitates the research planning analysis. Such analysis revealed a lack of head-to-head trials comparing novel treatments to each other in our network of trials. This apparent lack of direct evidence between novel treatments may help researchers plan subsequent trials (e.g., for choosing a control group). In their discussion, Neal et al.  wondered about the best comparator treatment for erlotinib plus cabozantinib in a phase III trial; in considering our results, nivolumab, atezolizumab, or pembrolizumab should be the best choice in considering treatment efficacy. Finally, our findings can be useful for policy-makers to prioritize future research and to facilitate the development of high-quality guidelines. Recently, the FDA modified the indication for erlotinib, restricting its use to patients with EGFR mutation . Nevertheless, in our analysis, erlotinib as second-line therapy appeared to be as effective as docetaxel for OS.
Our study has several limitations. First, we did not distinguish between the different types of data; namely, we considered the 11 trials (11%) only identified through a conference abstract as the same level of evidence as published trials in the quantitative analysis. However, when available, results obtained from the different reports were very similar. Second, we could not formally assess the assumption of transitivity because, for most of treatment comparisons, there are very few trials included. However, we observed small differences in terms of tumor histology and ethnicity, which are unlikely to violate the transitivity assumption. These differences might explain the small inconsistencies found for ObR and PFS. Nevertheless, our subgroup analyses did not reveal any important differences in the relative effects when these populations were analyzed separately (Additional file 1: Appendix Figure S5). Third, for safety assessment, we focused on the number of SAEs because this allows a reproducible global assessment of severe toxicity. This outcome was actually the most frequently reported (52 of 102 trials, 51%), contrary to the number of grade 3–4 SAEs, which were only reported in 30 trials (29%). Reporting of specific adverse events is very heterogeneous across trials and did not allow the synthesis of evidence for each specific adverse event. Additionally, because of the lack of reporting of the number of SAEs across trials, we could not even infer on the effect of these treatments. Fourth, providing a broad panorama of all available evidence by considering trials performed over a 10-year period does not allow to take into account some parameters only described recently, such as some predictive biomarkers. Among the 102 selected trials, only nine were designed in a specific population in terms of biomarkers. Moreover, for NSCLC research, even in 2015, only one third of trials proposed an enrichment design and the majority of trials remained performed in an unselected population. Finally, we performed a NMA on the hazard ratios, which remain the most widely used measure of treatment effect in oncology, although this measure is probably not the most informative for patients. Alternatives to the hazard ratio have been proposed, such as the difference in restricted mean survival times [33, 34] or difference in survival rates at a pre-specified time-point . However, these alternatives have never been applied, to date, to NMA as several issues remain to be solved (such as the choice of the time-point for defining the restricted mean survival or the survival rate).
Our comparative effectiveness review of second-line treatments for advanced NSCLC with wild-type or unknown status for EGFR compared 61 treatments assessed in 102 trials (36,058 patients). Our NMA revealed that immunotherapy (nivolumab, pembrolizumab, and atezolizumab) and pemetrexed plus erlotinib might be more efficacious for OS than the four recommended treatments (docetaxel, pemetrexed, erlotinib, and gefitinib) and highlighted the relatively poor performance of these four treatments. The assessment of safety and patient reporting outcomes was uncertain because of a lack of reporting.
Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer J Int Cancer. 2015;136(5):e359–386.
Global Burden of Disease Cancer Collaboration, Fitzmaurice C, Dicker D, Pain A, Hamavid H, Moradi-Lakeh M, et al. The Global Burden of Cancer 2013. JAMA Oncol. 2015;1(4):505–27.
Barlesi F, Mazieres J, Merlio J-P, Debieuvre D, Mosser J, Lena H, et al. Routine molecular profiling of patients with advanced non-small-cell lung cancer: results of a 1-year nationwide programme of the French Cooperative Thoracic Intergroup (IFCT). Lancet. 2016;387(10026):1415–26.
Masters GA, Temin S, Azzoli CG, Giaccone G, Baker S, Brahmer JR, et al. Systemic therapy for stage IV non-small-cell lung cancer: American Society of Clinical Oncology Clinical Practice Guideline Update. J Clin Oncol. 2015;33(30):3488–515.
Créquit P, Trinquart L, Yavchitz A, Ravaud P. Wasted research when systematic reviews fail to provide a complete and up-to-date evidence synthesis: the example of lung cancer. BMC Med. 2016;14:8.
Ioannidis JPA, Karassa FB. The need to consider the wider agenda in systematic reviews and meta-analyses: breadth, timing, and depth of the evidence. BMJ. 2010;341:c4875.
Hawkins N, Scott DA, Woods BS, Thatcher N. No study left behind: a network meta-analysis in non-small-cell lung cancer demonstrating the importance of considering all relevant data. Value Health J Int Soc Pharmacoeconomics Outcomes Res. 2009;12(6):996–1003.
Popat S, Mellemgaard A, Fahrbach K, Martin A, Rizzo M, Kaiser R, et al. Nintedanib plus docetaxel as second-line therapy in patients with non-small-cell lung cancer: a network meta-analysis. Future Oncol. 2015;11(3):409–20.
Chaimani A, Caldwell D, Li T, Higgins JPT, Salanti G. Additional considerations are required when preparing a protocol for a systematic review with multiple interventions. J Clin Epidemiol. 2017;83:65–74.
Hutton B, Salanti G, Caldwell DM, Chaimani A, Schmid CH, Cameron C, et al. The PRISMA extension statement for reporting of systematic reviews incorporating network meta-analyses of health care interventions: checklist and explanations. Ann Intern Med. 2015;162(11):777–84.
Therasse P, Arbuck SG, Eisenhauer EA, Wanders J, Kaplan RS, Rubinstein L, et al. New guidelines to evaluate the response to treatment in solid tumors. European Organization for Research and Treatment of Cancer, National Cancer Institute of the United States, National Cancer Institute of Canada. J Natl Cancer Inst. 2000;92(3):205–16.
Chan A-W. Out of sight but not out of mind: how to search for unpublished clinical trial evidence. BMJ. 2012;344:d8013.
Guyot P, Ades AE, Ouwens MJNM, Welton NJ. Enhanced secondary analysis of survival data: reconstructing the data from published Kaplan-Meier survival curves. BMC Med Res Methodol. 2012;12:9.
Higgins JPT, Altman DG, Gøtzsche PC, Jüni P, Moher D, Oxman AD, et al. The Cochrane Collaboration’s tool for assessing risk of bias in randomised trials. BMJ. 2011;343:d5928.
Salanti G. Indirect and mixed-treatment comparison, network, or multiple-treatments meta-analysis: many names, many benefits, many concerns for the next generation evidence synthesis tool. Res Synth Methods. 2012;3(2):80–97.
Cipriani A, Higgins JPT, Geddes JR, Salanti G. Conceptual and technical challenges in network meta-analysis. Ann Intern Med. 2013;159(2):130–7.
Caldwell DM, Ades AE, Higgins JPT. Simultaneous comparison of multiple treatments: combining direct and indirect evidence. BMJ. 2005;331(7521):897–900.
Salanti G, Ades AE, Ioannidis JPA. Graphical methods and numerical summaries for presenting results from multiple-treatment meta-analysis: an overview and tutorial. J Clin Epidemiol. 2011;64(2):163–71.
Bucher HC, Guyatt GH, Griffith LE, Walter SD. The results of direct and indirect treatment comparisons in meta-analysis of randomized controlled trials. J Clin Epidemiol. 1997;50(6):683–91.
Higgins JPT, Jackson D, Barrett JK, Lu G, Ades AE, White IR. Consistency and inconsistency in network meta-analysis: concepts and models for multi-arm studies. Res Synth Methods. 2012;3(2):98–110.
Chaimani A, Higgins JPT, Mavridis D, Spyridonos P, Salanti G. Graphical tools for network meta-analysis in STATA. PLoS One. 2013;8(10):e76654.
Chaimani A, Salanti G. Using network meta-analysis to evaluate the existence of small-study effects in a network of interventions. Res Synth Methods. 2012;3(2):161–76.
Salanti G, Del Giovane C, Chaimani A, Caldwell DM, Higgins JPT. Evaluating the quality of evidence from a network meta-analysis. PLoS One. 2014;9(7):e99682.
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2016. https://www.R-project.org/. Accessed 13 June 2017.
Viechtbauer W. metaphor: Meta-Analysis Package for R. The R project. https://www.r-project.org/. Accessed 13 June 2017.
Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility. Stat Comput. 2000;10(4):325–37.
White IR. Network meta-analysis. Stata J. 2015;15(4):951–85.
Chaimani A, Salanti G. Visualizing assumptions and results in network meta-anlaysis: the network graphs package. Stata J. 2015;15(4):905–50.
Elia N, von Elm E, Chatagner A, Pöpping DM, Tramèr MR. How do authors of systematic reviews deal with research malpractice and misconduct in original studies? A cross-sectional analysis of systematic reviews and survey of their authors. BMJ Open. 2016;6(3):e010442.
Turner RM, Davey J, Clarke MJ, Thompson SG, Higgins JP. Predicting the extent of heterogeneity in meta-analysis, using empirical data from the Cochrane Database of Systematic Reviews. Int J Epidemiol. 2012;41(3):818–27.
Neal JW, Dahlberg SE, Wakelee HA, Aisner SC, Bowden M, Huang Y, et al. Erlotinib, cabozantinib, or erlotinib plus cabozantinib as second-line or third-line treatment of patients with EGFR wild-type advanced non-small-cell lung cancer (ECOG-ACRIN 1512): a randomised, controlled, open-label, multicentre, phase 2 trial. Lancet Oncol. 2016;17(12):1661–71.
Approved Drugs - Erlotinib (Tarceva). http://www.fda.gov/Drugs/InformationOnDrugs/ApprovedDrugs/ucm525739.htm. Accessed 19 Dec 2016.
Uno H, Claggett B, Tian L, et al. Moving beyond the hazard ratio in quantifying the between group difference in survival analysis. J Clin Oncol. 2014;32:2380–5.
Trinquart L, Jacot J, Conner SC, Porcher R. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. J Clin Oncol. 2016;34(15):1813–9.
Blumenthal GM, Zhang L, Zhang H, Kazandjian D, Khozin S, Tang S, Goldberg K, Sridhara R, Keegan P, Pazdur R. Milestone analyses of immune checkpoint inhibitors, targeted therapy, and conventional therapy in metastatic non-small cell lung cancer trials: a meta-analysis. JAMA Oncol. 2017;3(8):e171029.
The authors thank Drs. Porcher and Vivot for helping in revising the manuscript, and Ivan Lerner for his work as second reviewer for the update of the systematic review. The authors thank Prof. Maitland, Prof. Vernenegre, Dr. Bhatnagar, and Dr. Kumbhaj for providing their data. The authors also thank Laura Smales (BioMedEditing, Toronto, Canada) for language revision of the manuscript and Elise Diard for the layout of the figures.
This study was supported by a grant from the French National Cancer Institute (Institut National du Cancer, INCa) (N° 2016-020/058/AB-KA), and a LEGS POIX 2015 grant (from Chancellerie des Universités de Paris). The funding sources had no role in the design of this study, its execution, analyses, interpretation of the data, and decision to submit results.
Availability of data and materials
All the data supporting our findings are presented in Additional file 1: Appendix 7: Identified reports for the eligible trials (p. 22–42), Additional file 1: Appendix 8: Characteristics of the 102 individual trials (p. 43–49), Additional file 1: Appendix 9: Results of individual trials (p. 50–59), Additional file 1: Appendix 4: Risk of bias assessment (p. 10–18).
Ethics approval and consent to participate
Consent for publication
Dr. Cadranel reports grants and personal fees from Pfizer, Novartis, and Boerhinger, and personal fees from AZ, Lilly, BMS, and Roche outside the submitted work. All other authors have no potential competing interests to disclose.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Definition used for objective response and serious adverse events. Appendix 2. Full search strategy. Appendix 3. Data extraction process. Appendix 4. Risk of bias assessment. Appendix 5. Classification of second-line treatments. Appendix 6. Reasons for excluding full texts and conference abstracts. Appendix 7. Identified reports for the eligible trials. Appendix 8. Characteristics of the 98 individual trials. Appendix 9. Results of individual trials. Appendix 10. Results of pairwise meta-analyses and patient characteristics across trials within each comparison. Appendix 11. Treatment categories analysis. Appendix 12. WinBUGS codes. Appendix 13. Difference of restricted mean survival times (RMST) at 18 months and the 1-year overall survival for trials comparing immunotherapy to docetaxel. (PDF 2899 kb)
Forest plots for four outcomes. Figure S2. Cumulative ranking curves and SUCRA values for four outcomes. Figure S3. Rankograms for four outcomes. Figure S4. Evaluation of inconsistency. Figure S5. Subgroup analyses by histology and ethnicity: network graphs of trials for OS, forest plots, and SUCRA values for OS and PFS. Figure S6. Funnel plots of small-study effects. Figure S7. Overall assessment of bias. (PDF 1930 kb)