First-line treatment of EGFR-mutated nonsmall cell lung cancer: critical review on study methodology
- 1Dept of Hematology/Oncology, University Hospital Frankfurt, Frankfurt, 2Hematology, Oncology, Infectious Diseases, Aerzteforum Seestrasse, Medizinisches Versorgungszentrum, Berlin, and 3LungClinic Grosshansdorf, Member of the German Centre for Lung Research (DZL), Grosshansdorf, Germany.
- M. Sebastian, Dept of Hematology/Oncology, University Hospital Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Frankfurt, Germany. E-mail: martin.sebastian{at}kgu.de
Abstract
Recent advances in understanding the mechanisms of nonsmall cell lung cancer (NSCLC) has led to the development of targeted treatments, including the reversible epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors gefitinib and erlotinib, and the irreversible ErbB family blocker afatinib. Several important activating EGFR mutations have now been identified, which correlate strongly with response to treatment with these agents. Multiple randomised controlled trials have confirmed the association between the presence of activating EGFR mutations and objective response to gefitinib, erlotinib and afatinib, thus demonstrating their superiority over platinum-based chemotherapy as first-line treatment for NSCLC patients with EGFR mutation-positive tumours, and resulting in approval of these agents for use in this setting. It can be tempting to compare outcome data across multiple clinical trials and agents; however, substantial differences in methodology between studies, including investigator versus independent assessment and differences in patient eligibility, makes such comparisons fraught with difficulty. This critical review provides an overview of the evolution of the methodology used in eight phase III trials investigating first-line targeted treatment of NSCLC, identifies key differences in methodology and reporting, and critically assesses how these differences should be taken into account when interpreting the findings from such trials.
Introduction
Despite recent advances in therapy for advanced lung adenocarcinoma, there continues to be an unmet medical need for effective treatment of stage IIIb/IV nonsmall cell lung cancer (NSCLC). In recent years, our understanding of the mechanisms of this disease has substantially increased in parallel with the development of the reversible epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs) gefitinib and erlotinib, and afatinib, which binds irreversibly to EGFR as well as to the other members of the ErbB family.
Early trials with gefitinib and erlotinib revealed subsets of patients achieving prolonged responses to treatment not seen with standard chemotherapy [1, 2]. Females, nonsmokers, Japanese patients and patients with lung adenocarcinoma were found to have higher response rates than patients who were of European origin, male, smokers or who had other NSCLC histology [3–5]. In light of this, further investigations identified several important activating EGFR mutations occurring in specific patient types that correlate strongly with response to gefitinib and erlotinib treatment [2, 6, 7].
The first randomised clinical trial to specifically compare EGFR TKI therapy with chemotherapy in patients with EGFR mutation-positive tumours was IPASS (Iressa Pan-Asia Study) [8]. In East Asian patients with stage IIIB/IV lung adenocarcinoma who never smoked tobacco (or only smoked lightly), initial treatment with an EGFR TKI was found to be superior to standard platinum-based chemotherapy [8]. Patients with EGFR mutation-positive tumours achieved significantly longer progression-free survival (PFS) with gefitinib versus those receiving chemotherapy (hazard ratio (HR) for progression or death 0.48 (95% CI 0.36–0.64); p<0.001) [8].
Subsequently, a number of trials, published within a relatively short period, have specifically addressed first-line treatment options in patients with stage IIIb/IV NSCLC and suspected or known EGFR mutations, confirming the association between the presence of activating EGFR mutations and objective response to gefitinib, erlotinib and afatinib. These trials are as follows: EURTAC (European Randomised Trial of Tarceva versus Chemotherapy) [9], OPTIMAL [10, 11], NEJ002 (North East Japan 002) [12], West Japan Thoracic Oncology Group (WJTOG) 3405 [13], IPASS [8, 14], LUX-Lung 3 [15], LUX-Lung 6 [16] and ENSURE [17]. As a result, erlotinib, gefitinib and, most recently, afatinib have received approval for first-line treatment of EGFR mutation-positive NSCLC [18–20]. Furthermore, recognition of the significance of acquired genetic mutations in therapeutic targets, including EGFR, has led to alterations in the NSCLC treatment paradigm, with upfront molecular testing for EGFR and other mutations now recommended [21].
Although results from these trials are frequently compared, it is important to recognise that direct comparisons do not take into account substantial differences in trial methodology, e.g. mutation testing, assessment of progression (independent versus investigator) and differences in patient inclusion criteria. For example, inclusion of local populations and differences in EGFR mutation status have the potential to impact on extrapolation of the findings and the generalisability of the conclusions, and may, therefore, have a bearing on regulatory processes and approval. Furthermore, differences in trial documentation can impact on the utility of trial data.
The objective of this review is to provide an overview of the evolution of methodology of phase III trials investigating first-line treatment of NSCLC over time, and to assess how differences in methodology should be taken into account when interpreting the findings from such trials. The results of the chemotherapy arms are not discussed extensively because all trials concluded that in patients with EGFR mutation-positive NSCLC, the chemotherapy comparator was inferior.
Methodology
Clinical trials were searched using www.ClinicalTrials.gov and www.citeline.com. The results of identified trials were obtained via PubMed, American Society of Clinical Oncology and European Society for Medical Oncology/European CanCer Organisation supplements, and World Conference on Lung Cancer abstracts. Where possible, data from fully published peer-reviewed literature were included. However, for more recent studies, LUX-Lung 6 and ENSURE [16, 17], data were only available in abstract, poster or presentation form. Phase III trials that investigated the first-line treatment of patients with stage IIIb/IV NSCLC were included. This was further restricted to trials that compared EGFR TKI monotherapy with standard platinum-based chemotherapy. Each of the trials had TKI and chemotherapy comparator arms. The comparators are relevant to understanding the trial methodology and generalisability of the study results and are, therefore, included in the methodological comparison. Phase III studies meeting these criteria, but where only a subpopulation of patients were EGFR mutation-positive, were included providing efficacy data reported for the EGFR mutation-positive subgroup were sufficient for comparison with other studies. Phase III studies meeting these criteria, but where only a subpopulation of patients had stage IIIb/IV NSCLC, were also included.
For a qualitative comparison of the studies, results were analysed using the CONSORT (Consolidated Standards of Reporting Trials) criteria (table S1) [22]. For conciseness, not all CONSORT criteria are discussed in full for all studies. For instance, the CONSORT criteria require that all trial protocols be made publically available in a registration database (e.g. www.ClinicalTrials.gov); however, since not all study protocols were available, no attempt was made to systematically retrieve study details from this source. Other CONSORT criteria, e.g. eligibility criteria, were reviewed as published but are not repeated here in detail. Only differences that were considered of relevance to the interpretation and comparison of study findings are discussed.
Quantitative and qualitative analyses of phase III first-line trials in EGFR mutation-positive NSCLC
Identification of trials
Nine trials were identified initially. Of these, eight had PFS as the primary end-point: NEJ002 [12, 23], WJTOG3405 [13], IPASS [8, 14], EURTAC [9], LUX-Lung 3 [15], OPTIMAL [10, 11], LUX-Lung 6 [16], and ENSURE [17] (table 1). The First-SIGNAL trial [27] (www.ClincalTrials.gov identifier NCT00455936) was also identified, but differed from the other trials in that the primary end-point was overall survival. The First-SIGNAL trial was conducted exclusively in South Korea and investigated first-line gefitinib versus gemcitabine-cisplatin in never-smokers with lung adenocarcinoma (stage IIIb/IV). Only 42 (14%) out of 313 patients were EGFR mutation-positive. As a result, the published results for the EGFR mutation-positive subgroup, especially for the secondary end-points of PFS and overall response rate, were limited, compromising comparison with other trials.
Qualitative analyses of the trials
Six studies were conducted in East Asia (NEJ002, WJTOG3405, IPASS, OPTIMAL, LUX-Lung 6 and ENSURE; 100% East Asian population), one was global (LUX-Lung 3) with a 72% East Asian population, and one was European (EURTAC) with a 99% Caucasian population. The earliest studies commenced in March 2006 (NEJ002, WJTOG3405 and IPASS) and were completed by June 2009, while OPTIMAL and the LUX-Lung 3 and 6 studies were conducted between 2008 and 2011. ENSURE was conducted between 2011 and 2013.
All studies were open-label randomised controlled trials. However, randomisation methodology was not reported consistently across trials (notably lacking in NEJ002 and IPASS) and stratification criteria varied widely. Most trials had a 1:1 treatment allocation ratio. Only the LUX-Lung 3 and 6 studies had a treatment allocation ratio of 2:1. The sample size of the studies varied from 154 patients in OPTIMAL to 364 patients in LUX-Lung 6. The number of protocol violations in terms of patients’ eligibility was low; four of the studies reported no violations, while EURTAC reported two, LUX-Lung 3 reported one and OPTIMAL reported four protocol violations.
Some key differences in trial conception and design were noted, including differences in EGFR mutation status between trials, ranging from not mandating EGFR mutation-positive status at baseline to requirement for specific EGFR mutations. With the exception of IPASS, all trials focused on patients whose mutation status was confirmed by various detection methods (table 2). In IPASS, the overall study population was clinically enriched for patients with an EGFR mutation-positive status (table 3); however, only a subgroup of patients had known EGFR mutation status. Of all the included studies, WJTOG3405 and IPASS differed most from the others with regard to heterogeneous patient population (table 3). Again with the exception of IPASS, all studies aimed to show superiority of the EGFR targeting TKI over chemotherapy. In contrast, with its overall study population clinically enriched for EGFR mutations, IPASS was designed to show noninferiority between treatments for the overall population.
“Measurable disease”, an important baseline criterion for the evaluation of response to treatment (according to the gold-standard RECIST (Response Evaluation Criteria In Solid Tumours) criteria), was not used consistently across studies. EURTAC included patients with “measureable or evaluable disease” and WJTOG3405 included patients with “measurable and nonmeasurable disease”. Both NEJ002 and WJTOG3405 also had limitations on patients’ age, leading to the exclusion of a relevant cohort of elderly patients. There was variation in whether investigator or independent assessments were conducted. Three studies relied on investigator review only. One trial (IPASS) did not describe the method of assessment. The key features of each trial, according to CONSORT criteria, are summarised in table 3.
Quantitative analyses of the trials
An overview of the outcomes presented across trials is provided in table 4, and in the context of EGFR mutation status in table 5. All studies significantly show the efficacy of EGFR-targeting TKIs in the EGFR mutant population, although there are differences in methodology and numerical outcome. For example, overall, PFS was shortest in WJTOG3405 (8.4 months) and longest in LUX-Lung 6 (13.7 months by investigator assessment). However, these trials differed in their use of investigator versus (blinded) independent assessment, as well as other methodologies, as described previously. The overall response rate was highest in OPTIMAL (83%) and lowest in EURTAC (58%) and LUX-Lung 3 (56%, independent review). Again, however, differences in assessment methodology were noted.
Findings for overall survival are also presented where available (table 4); however, the potential impact of crossover and the lack of assessment and reporting of outcomes with treatment post-progression further limits the comparability of these data.
In terms of safety, increases in alanine transaminase/aspartate aminotransferase were most common in the gefitinib trials (NEJ002 and WJTOG3405), while fatigue had the highest incidence in the EURTAC study, and rash and diarrhoea were most commonly reported in the afatinib studies (LUX-Lung 3 and LUX-Lung 6). However, rates of adverse event-related discontinuation were presented differently between studies (some studies only reported treatment-related adverse events/EGFR mutation-positive patients while others presented overall data). No information was available regarding the quality of adverse event reporting in terms of source data monitoring.
Comparison of the evaluation of health-related quality of life/patient-reported outcomes
Three different questionnaires were used in the five studies addressing health-related quality of life (HRQoL): Care Notebook; European Organization for Research and Treatment of Cancer (EORTC) Quality of Life Questionnaire (QLQ); and the Functional Assessment of Cancer Therapy – Lung (FACT-L) questionnaire (including lung cancer-specific modules of the latter two). Key details of the quality of life (QoL) analyses are summarised by study in table 6. In NEJ002 [30], QoL was assessed for 20 weeks after initiation of first-line therapy using the Care Notebook [34–36], a self-administered, cancer-specific questionnaire that comprises 24 domains structured in multidimensional scales, assessed using one word or a short phrase graded on an 11-point linear analogue scale (scored 0–10). Patients complete the questionnaire before therapy and then weekly during first-line treatment. Deterioration is noted when worsening from baseline by one of 11 points (9.1%) occurs at any time-point [37, 38]. To the best of our knowledge, the Care Notebook is not used outside Japan.
In the LUX-Lung 3 and 6 studies, patient-reported outcomes (PROs) were comprehensively assessed at randomisation and then every 21 days until disease progression [26, 33, 39] using the self-administered, cancer-specific EORTC QLQ-C30 [40, 41], comprising 30 questions of both multi- and single-item measures, and the lung cancer-specific module QLQ-LC13 [42, 43], comprising 13 questions and designed for use in patients with lung cancer undergoing chemotherapy or radiotherapy. Each item utilises a four-point linear analogue scale with a seven-point scale for overall health and QoL. A linear transformation is then applied to standardise the raw score to a range from 0 to 100 (high scores represent a high/healthy level of functioning or high/severe level of symptomatology) [41, 44]. A 10-point change in an item or domain was accepted as a clinically meaningful change [38], with a ≥10-point decrease from baseline at any time during the study used to define symptom improvement. Time to deterioration in PROs was defined as months from randomisation to the first instance of symptom worsening (10 points from baseline) [38, 45], and changes in PROs scores over time were assessed using mixed-effects growth curve models [46].
Both IPASS [31] and OPTIMAL [32] assessed HRQoL using the total score of the FACT-L questionnaire and the Trial Outcome Index (TOI; sum of the physical well-being, functional well-being and lung cancer subscale (LCS) scores of the FACT-L), and lung cancer symptom improvement was assessed using the LCS domain of the FACT-L. Questionnaires were completed at baseline, week 1 and week 3, then every 3 weeks until week 18, and then every 6 weeks until tumour progression, and at treatment discontinuation. Each item uses a five-point linear analogue scale, with clinically relevant improvement/worsening in HRQoL and symptoms predefined as an increase or decrease from baseline of ≥6 points for FACT-L and TOI, and ≥2 points for LCS, maintained for ≥21 days [37].
All studies showed clinically relevant symptom improvement; however, for a direct comparison, it is critical that different definitions of what was considered to be clinically meaningful are applied. EORTC has the highest threshold for what was considered to be clinically meaningful (10%); Care Notebook has the most granular questionnaire with the most requirement of data collection. This may be why the compliance rate for patients recording these data was so low in this study. The definition of “clinically meaningful” used in EORTC questionnaires was prospectively developed by Osoba et al. [38]. This is a subjective significance questionnaire where a change in score of 5–10 points is perceived by patients as having little difference. A difference in score of 10–20 points is perceived as moderately different, and a difference of >20 points is perceived as very different. By combining systematic reviews, expert opinions and meta-analysis, Cocks et al. [44] demonstrated similar results. An improvement by 0–4 points was considered trivial, an improvement by 4–10 points was considered little difference, an improvement by 10–15 points was considered moderately different, and an improvement by >15 points was considered very different. The approach for FACT-L was a retrospective estimation. Cella et al. [37] defined criterion-related validity as the relationship of test scores to meaningful anchors such as performance status rating, weight loss and presence of primary disease symptoms, and used this information to provide meaning to scores based on group-level differences from one trial. Clinically relevant changes were estimated as 2–3 points for the LCS and 5–7 points for the TOI [37]. Furthermore, longitudinal analysis, which shows that the effects are long lasting and not only snapshots, was only available for afatinib. Interestingly, the positive impact of afatinib over chemotherapy in the LUX-Lung studies could also be shown for the time-period where patients were on drug holiday from chemotherapy but still on afatinib.
Limitations of comparisons across trials
There is no question that TKIs targeting EGFR are superior to platinum-based chemotherapy in the first-line setting of NSCLC patients with EGFR mutation-positive tumours. However, as a result of the substantial differences in methodology between studies and reporting, caution should be exercised when comparing outcomes between trials.
The high incidence of patients with EGFR mutation-positive NSCLC in Asia is reflected by the focus on Asian populations in studies conducted in this setting. While a large proportion of patients in the LUX-Lung 3 study came from Asia, LUX-Lung 3 was the only study designed to be global, with subanalyses using race (Asian versus non-Asian) as a stratification showing no significant difference between the ethnicities, hence further increasing the global relevance of the data.
Some trial design features apply to all studies. All were randomised controlled trials and all had a low rate of protocol violations; however, when looking more closely at our quantitative analysis, two studies warrant specific discussion as they differ most from the others: WJTOG3405 and IPASS. In the WJTOG3405 study important changes were implemented during study conduct with the inclusion of a heterogeneous patient population (both post-operative recurrent as well as stage IIIb/IV patients). This makes the findings from this study somewhat difficult to put into perspective. IPASS was conducted to show noninferiority of TKIs versus chemotherapy in clinically enriched patients. The subgroup analysis of patients with EGFR mutation-positive tumours was pre-planned and superiority for the whole population could be concluded from the same analysis without statistical penalty. With these exploratory analyses, IPASS was certainly a milestone for the understanding of the activity of gefitinib in patients with EGFR mutation-positive tumours, but unlike the other studies discussed here, IPASS was not designed to show superiority. The similar design and robust methodology used in LUX-Lung 3 and LUX-Lung 6 has led to a high reproducibility of the efficacy results for afatinib. This has not been observed in the erlotinib study (ENSURE, OPTIMAL or EURTAC). Currently, we cannot provide reasons for this.
Limitations of source data
Some of the included studies provided only limited information required by CONSORT. In some cases, study protocols were not included as supplementary information to the primary publications (despite being recommended by CONSORT and requested by many journals). The exceptions to this were OPTIMAL and LUX-Lung 3, both of which included study protocols as part of the primary publication.
CONSORT also requires the disclosure of randomisation methods, as these are critical for judging of the quality of a trial [24]. However, to our knowledge, this was not carried out for NEJ002 or IPASS. The term “random” is frequently used to describe treatment allocation methods that do not fit with its precise definition. These include nonrandom methods to determine allocation, such as alternation and the use of hospital numbers or birth date, which have the potential to lead to biases in the study design and, ultimately, study outcomes. Furthermore, while various different methods of sequence generation, such as the nonrandom process of minimisation, are acceptable, this cannot be determined from descriptors centred on randomness alone. As such, more detailed descriptions of randomisation and sequence generation methods are needed [24].
In some publications, results of secondary end-points and data on stratification factors were not reported. In EURTAC, data from external review were, to our knowledge, not published in a peer-reviewed journal. In the primary OPTIMAL publication, the secondary end-points duration of response and time to progression were not reported. Furthermore, the influence of stratification on PFS was not shown. In the OPTIMAL trial, only patients who “had received at least one dose of study drug” rather than the intention to treat population were included in the efficacy analyses. Furthermore, in response to the request by the European Medical Agency, no clinical study report has been made available for this trial [48]. The ongoing ENSURE study [17] was conducted in the same setting as the OPTIMAL study, and may provide robust data for the efficacy and safety of TKIs in EGFR mutation-positive NSCLC.
The independent review of PFS in the EURTAC study was conducted retrospectively. In this study, not all of the scans were available for independent review. Based on the number of available scans and relevant clinical information, 30 patients were considered to have had an event by independent review in the chemotherapy arm versus 31 patients in the erlotinib arm (data cut-off August 2, 2010). In the investigators’ assessment, the number of patients who experienced an event in the chemotherapy and erlotinib treatment arms were 47 and 45, respectively [47].
It should also be noted that for studies where not only EGFR-mutated stage IIIB/IV patients were enrolled (e.g. IPASS and WJTOG3405), safety data were reported for the whole study population (and not the subpopulation of interest), potentially leading to artificially low occurrence rates. A recently published analysis showed that some methodological aspects of adverse event collection and analysis are poorly reported in trials. Given the importance of adverse events in evaluating new treatments, authors should be encouraged to adhere to the 2004 CONSORT guidelines regarding adverse event reporting [48].
Limitations of trial methodology
Eligibility criteria
There were substantial differences in eligibility criteria across the studies, some of which have the potential to introduce bias and compromise direct comparison of trial outcomes. Three trials included patients with an Eastern Coopertaive Oncology Group (ECOG) performance status of 0–2. However, the majority of patients included in these trials had a performance status of 0 or 1, meaning that few patients with a performance status of 2 were included. Of note, the afatinib trials excluded ECOG performance status 2 patients, and two studies had restrictions regarding the inclusion of elderly patients, excluding a relevant subgroup of patients in the stage IIIb/IV NSCLC setting, [15, 49]. Most studies were not restricted to patients with adenocarcinoma. As the histology of the tumour may influence prognosis, this difference may need to be taken into account before making any direct comparisons across studies. WJTOG3405 recruited patients with “non-measurable” disease and EURTAC included patients with “evaluable” disease, which compromises the assessment of the efficacy of these studies by standard RECIST criteria [50, 51].
Four studies were limited to patients with common mutations (Del19/L858R), meaning that these study populations were more homogeneous versus studies that included patients with both common and uncommon mutations [52]. For direct comparison of study results, this difference has a significant impact. The benefit of TKI treatment in patients with common mutations is well-established. This was also shown in the LUX-Lung 3 study, where the median PFS for patients with common mutations was 13.6 months compared with 11.1 months for all patients including those with uncommon mutations [15]. Due to the low number of patients with uncommon mutations, it still remains unclear as to what is the best treatment for these individuals. The testing methods for EGFR mutation detection were of different sensitivity; PCR methods have demonstrated lower invalid rates and higher sensitivity than Sanger in the detection of EGFR mutations [53, 54]. Highly sensitive testing methods, such as the peptide nucleic acid-locked nucleic acid PCR clamp methods used in NEJ002, have the potential to identify patients with low numbers of EGFR mutation-positive tumour cells. This could be different compared with methods where patients with a high percentage of EGFR-mutated cells may be selected and who might be expected to respond better to TKI.
Choice of comparator treatment
The comparator and number of cycles used in NEJ002, EURTAC, OPTIMAL and ENSURE varied as only three or four cycles of chemotherapy were allowed. In EURTAC, a defined variety of different chemotherapy regimens was allowed. This is an important consideration, given the impact that comparator treatment has on the comparability of efficacy data across trials. The relevance of the comparator arm is illustrated by the differences in HRs for PFS in the LUX-Lung 3 and LUX-Lung 6 studies. In these studies, the choice of comparators was driven by the differences in regulatory approvals for chemotherapies across the countries in which the studies were conducted. In LUX-Lung 6, the use of cisplatin/gemcitabine resulted in a lower PFS (HR 0.26) than with cisplatin/pemetrexed in LUX-Lung 3 (HR 0.58) (table 4). The experimental arms were essentially identical but chemotherapies differed substantially regarding PFS. However, when assessing LUX-Lung 6 and ENSURE, both comparing the respective investigational compound with cisplatin/gemcitabine, and reporting the same median PFS by independent review (11.0 months and 5.5–5.6 months for EGFR TKI and chemotherapy, respectively), a difference in HRs was observed (0.25 in LUX-Lung 6 compared with 0.42 in ENSURE for patients with common mutations). However, as neither trial is fully published yet we cannot speculate about the underlying reason.
Assessment and reporting of trial outcomes
The recognition of inherent differences in assessment and reporting of trial outcomes is critical for comparison of data across studies. Not all studies had prospective independent review by blinded oncologists/radiologists, which is regarded as the most conservative approach to assessing response to therapy, and is recommended in RECIST guidelines [50, 51]. Studies in this setting also lack assessment and reporting of survival outcomes with post-progression crossover treatment. As a result, the optimal sequence of EGFR TKI/afatinib and chemotherapy in patients with EGFR mutations has yet to be clarified.
Due to the difficulty in assessing overall survival benefits in clinical trials, HRQoL is an important method of measuring response to treatment in the first-line setting. From the consistent picture, it can be concluded that in general there is a benefit with both reversible and irreversible EGFR TKIs, especially when this is indicated by robust results from studies with validated HRQoL questionnaires and the inclusion of longitudinal analysis. The robustness of EORTC questionnaires ensures high generalisability of results. In the studies analysed here, HRQoL was improved with EGFR TKIs and afatinib compared with chemotherapy, and all trials showed clinically relevant differences in time to deterioration. The high return of completed questionnaires in IPASS, OPTIMAL, LUX-Lung 3 and LUX-Lung 6 strengthens the reliability of these data; however, in NEJ002, the QoL assessment gives limited information as patients had a low compliance in completing these data. Furthermore, differences in the approach to determining clinically meaningful improvements and a lack of longitudinal analyses in most trials means that, as with the other outcomes discussed in this review, caution must be exercised when comparing HRQoL results.
Conclusions
Taken together, these clinical trials provide substantial evidence that erlotinib, gefitinib and afatinib are the standard of care for patients with EGFR mutation-positive NSCLC, and should be considered as first-line treatment options. The results of QoL analyses, as a sum of side-effects and symptom improvement, support this view. However, cross-trial comparisons generally have strong scientific limitations. This is particularly obvious when comparing differences in trial design, comparator choice, inclusion criteria and reporting standards. Without highlighting these differences, the outcomes of these studies may be misinterpreted as comparable. For a reader not familiar with the intricacies of these studies, it is tempting to relate directly to the eye-catching single values of median PFS or response rate. This review shows that such comparisons are not valid. Furthermore, the optimal sequencing of EGFR TKIs, afatinib and chemotherapy in patients with EGFR mutations requires more investigation. The head-to-head comparisons of afatinib with gefitinib in the first-line setting (LUX-Lung 7; www.ClinicalTrials.gov identifier NCT01466660, fully recruited) and dacomitinib with gefitinib (ARCHER 1050; www.ClinicalTrials.gov identifier NCT01774721, ongoing) will shed more light on how these agents compare. For oncologists and patients, it is of high importance that clinical trial results are robust and generalisable. Therefore, it is highly desirable that future studies in NSCLC make use of the appropriate tools: independent tumour assessment; most appropriate randomisation methods; clearly defined patient populations; and well-recognised QoL questionnaires, to name a few.
Acknowledgments
Medical writing assistance, supported financially by Boehringer Ingelheim Pharma GmbH & Co. KG, was provided by C. Smart (Ogilvy Healthworld, London, UK) during the preparation of this manuscript. The authors were fully responsible for all content and editorial decisions, were involved at all stages of manuscript development, and have approved the final version.
Footnotes
This article has supplementary material available from err.ersjournals.com
Provenance: Publication of this peer-reviewed article was supported by Boehringer Ingelheim, Germany (article sponsor, European Respiratory Review issue 131).
Statement of Interest: Disclosures can be found alongside the online version of this article at err.ersjournals.com
- Received November 29, 2013.
- Accepted December 20, 2013.
- ©ERS 2014
ERR articles are open access and distributed under the terms of the Creative Commons Attribution Non-Commercial Licence 3.0.